Multilingual Speech Synthesis download

This repository provides synthesized samples, training and evaluation data, source code, and parameters for the paper One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech. It contains an implementation of Tacotron 2 that supports multilingual experiments and that implements different approaches to encoder parameter sharing. It presents a model combining ideas from Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning, End-to-End Code-Switched TTS with Mix of Monolingual Recordings, and Contextual Parameter Generation for Universal Neural Machine Translation. We provide data for comparison of three multilingual text-to-speech models. The first shares the whole encoder and uses an adversarial classifier to remove speaker-dependent information from the encoder. The second has separate encoders for each language.

Features

Interactive demos introducing code-switching abilities and joint multilingual training of the generated model (trained on an enhanced CSS10 dataset) are available
Many samples synthesized using the three compared models
Our best model supporting code-switching or voice-cloning can be downloaded
We provide data for comparison of three multilingual text-to-speech models
It contains an implementation of Tacotron 2 that supports multilingual experiments
Implements different approaches to encoder parameter sharing

Project Samples

Multilingual Speech Synthesis Screenshot 1

Project Activity

See All Activity >

License

MIT License

Follow Multilingual Speech Synthesis

Multilingual Speech Synthesis Web Site

Other Useful Business Software

Searching for a better way to ship ecommerce? We can help

ShipHero gives you the tools that give you ecommerce fulfillment super powers.

ShipHero is built for multi-channel commerce. With a few clicks, you can connect your stores. ShipHero will download new products, as well as sync existing ones. When changes are made to your inventory all connected stores will be updated.

Learn More

Rate This Project

User Reviews

Be the first to post a review of Multilingual Speech Synthesis!

Additional Project Details

Programming Language

Python

Related Categories

Python Voice Cloning Software

Registered

2023-03-23

Similar Business Software

Murf AI

Murf API is an advanced text-to-speech (TTS) solution that transforms written text into natural, lifelike voiceovers with remarkable accuracy and ease. It empowers developers and businesses with a suite of sophisticated features, including pitch and speed modulation, audio duration adjustments,...

See Software
Zyphra Zonos

Zyphra is excited to announce the release of Zonos-v0.1 beta, featuring two expressive and real-time text-to-speech models with high-fidelity voice cloning. We are releasing our 1.6B transformer and 1.6B hybrid under an Apache 2.0 license. It is difficult to quantitatively measure quality in the...

See Software
Chatterbox

Chatterbox is a free, open source voice cloning AI model developed by Resemble AI, licensed under MIT. It enables zero-shot voice cloning using just 5 seconds of reference audio, eliminating the need for training. The model offers expressive speech synthesis with unique emotion control, allowing...

See Software
Kukarella

Kukarella is an AI-powered audio and voice-content platform that enables users to create professional voice-overs, multi-speaker dialogues, transcriptions, and visual content all within one integrated environment. The platform features a text-to-speech tool with access to hundreds of...

See Software
ElevenLabs

The most realistic and versatile AI speech software, ever. Eleven brings the most compelling, rich and lifelike voices to creators and publishers seeking the ultimate tools for storytelling. Generate top-quality spoken audio in any voice and style with the most advanced and multipurpose AI...

See Software
Elai

Build customized AI videos with a presenter in minutes without using a camera, studio, and a green screen. Convert a blog post into a video in 3 clicks. Use AI to generate a professional video from the link to an article or a blog post. Learn how Elai may help you boost conversion rates,...

See Software

Report inappropriate content

Multilingual Speech Synthesis

An implementation of Tacotron 2 that supports multilingual experiments

Get an email when there's a new version of Multilingual Speech Synthesis

Features

Project Samples

Project Activity

Categories

License

Follow Multilingual Speech Synthesis

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered