A two-person startup by the name of Nari Labs has introduced Dia, a 1.6 billion parameter text-to-speech (TTS) model designed to produce naturalistic dialogue directly from text prompts — and one of ...
Smarter document extraction starts here.
ChatTTS is an open-source AI voice text-to-speech (TTS) model that has gained significant popularity on GitHub due to its impressive features and user-friendly design. This model is specifically ...
Just in time for Halloween 2024, Meta has unveiled Meta Spirit LM, the company’s first open-source multimodal language model capable of seamlessly integrating text and speech inputs and outputs.
Diffusion Bee harnesses the power of the open source text-to-image AI Stable Diffusion, turning it into a one-click Mac App. Brace yourself for a new creativity Big Bang. Impossibly realistic and ...
Genmo Inc., an artificial intelligence content generation platform, today announced the preview release of its new open-source model Mochi 1, capable of video generation. The company said Mochi 1 ...
If you’re venturing into the world of audio, music, and speech generation, you’ll be pleased to know that a new open-source AI Text-to-Speech (TTS) toolkit called Amphion might be worth further ...