Speech Synthesis, Recognition, and More With SpeechT5We’r... | Speech Synthesis, Recognition, and More With SpeechT5We’r...
Speech Synthesis, Recognition, and More With SpeechT5
We’re happy to announce that SpeechT5 is now available in 🤗 Transformers, an open-source library that offers easy-to-use implementations of state-of-the-art machine learning models.

SpeechT5 was originally described in the paper SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing by Microsoft Research Asia. The official checkpoints published by the paper’s authors are available on the Hugging Face Hub.

If you want to jump right in, here are some demos on Spaces:

Speech Synthesis (TTS)
Voice Conversion
Automatic Speech Recognition