With the latest developments in generative AI, it is trivial to create speech in a language of your choice. You can generate a male or female voice in in any manner of speaking you choose. The voice can sound happy, sad, angry or excited. One thing that is trickier though is to have a way to generate speech in your own voice. With their new Professional Voice Cloning feature,@elevenlabsio makes it not only possible but easily accessible.
In this tutorial, you will learn how to build a web-based voice-to-voice cloning app using Gradio. This voice translator is built using these technologies:
1. Gradio - for the interface
2. AssemblyAI - for transcription
3. Python translate module - for translation of text
4. Elevenlabs - for reading translated text in your own voice
🔑 The API Keys you need:
1. AssemblyAI API key - https://www.assemblyai.com/?utm_source=youtube&utm_medium=referral&utm_campaign=yt_mis_66
2. Elevenlabs API key - https://elevenlabs.io/
👩💻 You can find the code for the simple and complex apps in this repo: https://github.com/AssemblyAI-Community/Voice-to-Voice-translator
00:00 Introduction
01:26 Project structure
02:32 Gradio interface
07:04 Transcibing input audio
10:17 Translate text
13:35 Generate audio in your own voice
21:31 Test the app
24:31 Keep learning
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: https://www.assemblyai.com/?utm_source=youtube&utm_medium=referral&utm_campaign=yt_mis_66
🐦 Twitter: https://twitter.com/AssemblyAI
🦾 Discord: https://discord.gg/Cd8MyVJAXd
▶️ Subscribe: https://www.youtube.com/c/AssemblyAI?sub_confirmation=1
🔥 We're hiring! Check our open roles: https://www.assemblyai.com/careers
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#MachineLearning #DeepLearning