Speech to Text Tutorial JavaScript

Speech-to-Text Tools for Modern Dev Teams

You know that feeling when a meeting ends and half the discussion is just… gone? Not in memory exactly, not in notes ...

IEEE

SYNTHE-SEES: Face Based Text-to-Speech for Virtual Speaker

Abstract: Recent virtual voice generation researches have limitations in that they results in low-quality voice and generate inconsistent voice from the same speaker’s different facial images. To ...

GitHub

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including ...

IEEE

An Automated Method to Correct Artifacts in Neural Text-to-Speech Models

Abstract: Recent advances in deep learning technology have enabled high-quality speech synthesis, and text-to-speech models are widely used in a variety of applications. However, even state-of-the-art ...

GitHub

Moshi: a speech-text foundation model for real time dialogue

Finally, the code for the web UI client used in the Moshi demo is provided in the client/ directory. If you want to fine tune Moshi, head out to kyutai-labs/moshi ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results