资讯
Meta’s open-source speech AI recognizes over 4,000 spoken languages It can also produce text-to-speech in over 1,100 languages.
With a focus on expressive quality, reproducibility, and open access, Dia adds a distinctive new voice to the landscape of text-to-speech.
Decagon, which builds AI-powered voice experiences, saw a 30% improvement in transcription accuracy using OpenAI’s speech recognition model.
Meta's Voicebox AI promises to do for the spoken word what ChatGPT and Dall-E, respectfully, did for text and image generation.
ChatGPT update enables its AI to “see, hear, and speak,” according to OpenAI Image recognition and voice features aim to make the AI bot's interface more intuitive.
On Thursday, Microsoft researchers announced a new text-to-speech AI model called VALL-E that can closely simulate a person's voice when given a three-second audio sample.
TranscribeGlass, a company that produces glasses with live text-to-speech transcription, will launch this week. The company develops glasses that allow hearing-impaired users to engage with the world ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果