资讯
Meta’s open-source speech AI recognizes over 4,000 spoken languages It can also produce text-to-speech in over 1,100 languages.
With a focus on expressive quality, reproducibility, and open access, Dia adds a distinctive new voice to the landscape of text-to-speech.
Meta defines the system as “a non-autoregressive flow-matching model trained to infill speech, given audio context and text.” It’s been trained on more than 50,000 hours of unfiltered audio.
ChatGPT update enables its AI to “see, hear, and speak,” according to OpenAI Image recognition and voice features aim to make the AI bot's interface more intuitive.
On Thursday, Microsoft researchers announced a new text-to-speech AI model called VALL-E that can closely simulate a person's voice when given a three-second audio sample.
TranscribeGlass, a company that produces glasses with live text-to-speech transcription, will launch this week. The company develops glasses that allow hearing-impaired users to engage with the world ...
当前正在显示可能无法访问的结果。
隐藏无法访问的结果