Generate speech from text

อัปเดตครั้งล่าสุดเมื่อ 27 ต.ค. 2025

Learn how to use text prompts to generate audio clips with varying voices, tones, and accents using Firefly.

Try it in the app
Generate speech from text in a few simple steps.

Generate Speech (beta) allows you to generate natural-sounding audio clips and voiceovers. You can use controls such as accent, language, speed, and pitch to customize the voice characteristics that best fit your needs.

On the Firefly homepage, select Generate from the left panel and then select Generate speech.

On the Generate speech page, copy and paste the text to convert into speech or select Add Text and upload a file in DOCX or TXT format.

After adding the text, navigate to the Speech settings panel on the left and use the Model dropdown to select Firefly Speech.

เคล็ดลับ:

You can also use a partner model, such as ElevenLabs Multilingual v2, to generate speech from text.

Use the Voice dropdown menu and select a voice.

Use this panel to adjust accent, pitch, and speed to give your voice a unique style.

The Speech settings section in the left panel displays the Firefly Speech model and an expanded Voice dropdown menu listing all available voices.
Select the voice that best suits your project’s requirements and aligns with your creative goals.

หมายเหตุ:

The list of voices will load only if you're signed in to your Firefly account.

Use the Select a language dropdown and select a language and the delivery accent from the list of languages, such as English (US), English (UK), and English (India).

The Speech settings section displays the Accent dropdown menu and the speed and pitch sliders to adjust for speech generation.
Customize the selected voice by adjusting its accent, speed, and pitch.

A. Select a language B. Speed C. Pitch 

If you want to change the Speed and Pitch of the generated speech, adjust the following speech settings:
  • Speed: Drag the speed bar to the right to increase or to the left to decrease the speed of the spoken audio.
  • Pitch: Drag the pitch bar to the right to increase or to the left to decrease the pitch of the spoken audio.
เคล็ดลับ:
  • Navigate to the bottom of the left panel and select the icon to play a sample audio of the voice you’ve selected and adjusted speed, pitch, and accent.
  • You can also add the voice to your favourites by selecting the icon.

In the main text editor window, you can make additional edits to the text entered:

  • Play: Preview selected text in your uploaded content before generating it.
The Generate speech page displays the text editor window with the Play button highlighted to preview voice output.
Use the Play button to quickly preview how the text sounds with the selected voice settings.

  • Fix Pronunciation: Fix pronunciation and add additional guidance on how certain words should sound.
  • Find & Replace: Select words and replace them.
  • Add Text: Add additional text to the uploaded content by importing a TXT or DOCX file.
  • Add Pause: Add pauses to make the audio track sound more natural.
  • Add Tone: Add tonality to your audio and define the intonation of the generated speech.

Select Generate.

Once you’re satisfied with the generation and how it sounds, select Download to save a copy of the audio file in WAV or MP3 format.