Alta Voce SDK - Real-time voice processing

Integrate voice enhancement into your own applications

Voice is everywhere, let it sound.

Fitting piece illustration

Our SDK can adapt to your needs for speech-to-speech augmented voice and noise canceling.

Applications include:

Contact us to design the best voice technology for your needs.

Specifications

In addition to the demos below, we also offer a web app where you can try out our effects on your own audio.

Shaping voice gender

What if you could make your voice genderless in real time, or shape the way your gender is perceived in a call?

French natural voice

More male voice gender

More female voice gender

Even more female voice gender


English natural voice

Less female voice gender

Even less female voice gender

Male voice gender

Try it in Python

import os
os.environ["VOIMOOO_LICENSE_ID"] = "AAAA-BBBB-CCCC-DDDD"

import pyvoimooo
import pysndfile

wav, fs, enc = pysndfile.sndio.read("yourfile.wav")
syn = pyvoimooo.pvoccombo(
  wav,
  fs,
  pitch_scaling=1.5,
  frequency_warping=2,
)
pyvoimooo.writewav("yourfile_female.wav", fs, syn)

Contact us to get a license key


Better understanding

Making speech more intelligible and removing background noise.

Original
Augmented

Research with our clients shows that voices augmented with Alta Voce get to engage with more customers than in their normal, unprocessed audio conditions.

Try it in Python

import os
os.environ["VOIMOOO_LICENSE_ID"] = "AAAA-BBBB-CCCC-DDDD"

import pyvoimooo
import pysndfile

wav, fs, enc = pysndfile.sndio.read("yourfile.wav")
syn = pyvoimooo.pvoccombo(
  wav,
  fs,
  efx_inteligibility_scaling=0.5
)
pyvoimooo.writewav("yourfile_intellegible.wav", fs, syn)

Contact us to get a license key


Enhanced smile

Enhancing a speaker’s attitude, e.g. how smiling and welcoming they sound.

Less
More

Less
More

Research in the lab shows that smiling voices engage more attention from listeners, triggers spontaneous smiles, and is especially effective on younger listeners.

Try it in Python

import os
os.environ["VOIMOOO_LICENSE_ID"] = "AAAA-BBBB-CCCC-DDDD"

import pyvoimooo
import pysndfile

wav, fs, enc = pysndfile.sndio.read("yourfile.wav")
syn = pyvoimooo.pvoccombo(
  wav,
  fs,
  efx_smile_alpha=1.2
)
pyvoimooo.writewav("yourfile_smiley.wav", fs, syn)

Contact us to get a license key


It's how you say it

Varying expressivity

Martin (French)

Martin more dynamic

Martin even more dynamic

Our technology can gradually shape the expressivity in a voice, e.g. to make it more dynamic.


Varying prosody inflections

Federica (TTS, Italian)

Buongiorno 1

Buongiorno 2

Buongiorno 3

It is common knowledge that how we say something is more important than what we say. Our research proved it, and our technology allows to find the expressive features conveying the desired intention. For example, listen to Federica’s voice more trustworthy.


Federica (TTS, italian)

Federica trust