Apps Page Background Image

Free F5-TTS Online

F5-TTS is an advanced AI text-to-speech generator that produces high-quality, human-like speech. It enables users to create lifelike voice outputs with remarkable accuracy and emotional depth. With capabilities in text-to-speech conversion, podcast generation, and multi-style emotional speech, F5-TTS offers a versatile solution for various audio needs. Additionally, it supports both English and Chinese, making it an invaluable tool for diverse applications and audiences.

Quick Start of F5-TTS

How to Use F5-TTS Text-to-Speech Generator?

  • 1. Upload Reference Audio: Go to the Batched TTS tab. Upload a reference audio clip that you want to clone. Enter the text you wish to generate as audio. If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
  • 2. Choose the TTS Model: Select either F5TTS (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching) or E2TTS (Embarrassingly Easy Fully Non-Autoregressive Zero Shot voice cloning). It is recommended to choose F5 TTS for smoother and better sound quality.
  • 3. Synthesize Audio: Click on the “Synthesize” button. Wait for the processing to complete. Once finished, you can preview the audio and download it.
  • 4. Multilingual Capability: F5TTS supports to clone multiple languages. Even if your reference audio is in English, you can enter text in Chinese, and F5-TTS will clone the voice and read it out in Chinese.
  • 5. Set Up Speakers in Podcast Tab: Name the first speaker (e.g., Mike) and upload their reference audio. Name the second speaker (e.g., Lily) and upload her reference audio.
  • 6. Write Your Podcast Script: Format your script like this: "Mike: Hi everyone, I am Mike! Welcome to my channel. Lily: Hi, I am Lily! Happy to be here. Mike: Today, we will discuss how to use F5-TTS on MimicPC. Lily: Looking forward to it!"
  • 7. Generate the Podcast: Click on “Generate Podcast” and wait for the result. If satisfied with the outcome, click to download the podcast.
  • 8. Input Reference Audio for Multi-Style Emotional Speech: Start by uploading a regular reference audio. This emotion type is mandatory.
  • 9. Add Speech Types: Click on “Add Speech Type” and name the new type (e.g., “Surprised”). Repeat the process to upload different audio clips for each speech type.
  • 10. Format Your Text: Input your text following the specified format: "{Regular} Hello, everyone! Today, I want to introduce you to MimicPC. {excited} It's a powerful tool that comes with many built-in Al generator tools. {sad} I know that for most Al tools, like ComfyUI, Stable Diffusion, and F5FTT, the installation process can be quite difficult. {happy} But the great news is that with MimicPC,all the apps are ready to use online-no installation needed!"
  • 11. Generate Emotional Speech: Click on “Generate Emotional Speech.” Wait for the result, then download the audio file.
Quick Start of Image
Quick Start of Image
Quick Start of Image
Quick Start of Image
Quick Start of Image
Quick Start of Image
KOL Background Image

F5-TTS: Emotion-Driven Text-to-Speech Generator

kol Video Imageicon play image

FAQ About F5-TTS