15.ai Natural Text to Speech

News

Microsoft's VibeVoice uses AI to create 90-minute podcasts with multiple speakers

VibeVoice is a new open-source AI tool that can generate a full 90 minute audio podcast recording with multiple speakers from ...

3don MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good at it

"VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as ...

29d

I tested 3 text-to-speech AI models to see which is best - hear my results

Text-to-speech models from ElevenLabs, Hume AI, and Descript are all pushing the limits of AI-generated voice technology.

WinBuzzer13h

Microsoft Releases VibeVoice Open-Source AI Model for Generating Multi-Speaker Podcasts

Microsoft has launched VibeVoice, a new open-source AI model capable of generating up to 90 minutes of multi-speaker audio ...

ZDNet2mon

Text-to-speech with feeling - this new AI model does everything but ...

Text-to-speech with feeling - this new AI model does everything but shed a tear ElevenLabs' 'most expressive' v3 model can speak with a huge range of emotions in more than 70 languages.

MediaNama2d

Microsoft Launches New Speech Generation AI Model MAI-Voice-1 Despite AI Capacity Constraints

In its latest blog, Microsoft announced the launch of a new speech generation AI model, MAI-Voice-1, for Copilot and Podcasts features.

VentureBeat2mon

Voice AI that actually converts: New TTS model boosts sales 15% for ...

Capturing natural conversations Rime’s model generates audio tokens that are decoded into speech using a codec-based approach, which Rime says provides for “faster-than-real-time synthesis.” ...

WinBuzzer6d

OpenAI Launches Production-Ready Voice API with Cheaper, More Expressive ‘gpt-realtime’ Model

OpenAI's Realtime API is now generally available, featuring the new gpt-realtime model for more natural voice agents at a 20% lower cost for developers.

15d

Google Docs gets Gemini audio feature to read documents aloud

Google Docs introduces Gemini audio, a text-to-speech feature allowing users to listen to documents. Available on the web for AI Pro and Ultra subscribers, it includes natural-sounding voices, ...

Morningstar23d

Day1/5: SkyReels-A3: The Art of Natural Speech for Digital Humans

From August 11 to August 15, a new model will be unveiled each day, covering cutting-edge models for multimodal AI scenarios. On August 11, Skywork officially launched the SkyReels-A3 model.

Forbes5d

Why Speech Diversity Is The Missing Piece For Truly Autonomous AI Agents

So why do today’s AI systems struggle with natural speech? The problem isn’t just about technology; it’s about the way these ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results