Frequently asked questions about Enhance Speech

Last updated on Oct 27, 2025

Do you have questions about Enhance Speech? We’ve got you covered with answers to the most common questions.

Enhance speech is optimized for a wide range of browsers on desktop and mobile. The latest versions of the following browsers are supported:

  • Google Chrome
  • Apple Safari 
  • Microsoft Edge
  • Mozilla Firefox
Note

The upload and download speed of audio content will depend on your internet connection speed and device memory. If you encounter any issues, try updating your browser to the latest version or clearing your browser cache.

Both audio and video file formats are supported.

Note

Bulk and video upload are available to Premium users. Learn more about different plans for the Enhance Speech feature.

File format type

Supported file formats

Audio

  • .mp3
  • .wav
  • .m4a
  • .flac
  • .ogg
  • .aac

 

Video

  • .mp4
  • .mov
  • .m4v

Below are the details regarding the maximum supported file sizes and durations for uploading audio and video files.

File format type

Max file size

Max duration

Audio

  • Free plan: Up to 500 MB
  • Premium plan: Up to 1 GB
  • Free plan: Up to 30 minutes per file and 1 hour maximum per day
  • Premium plan: Up to 4 hours per day

Video

  • Up to 1 GB
  • Up to 4K resolution
  • Up to 2 hours per day

When downloading from Enhance speech, we do not convert the format. Whatever format you import, you'll receive back in the download. For example, if you import an .mp3 file, you'll get back an .mp3 file.

On the free plan, you can process files up to 30 minutes in duration (maximum 500 MB), with a daily limit of 1 hour. On the Premium plan, you can process up to 4 hours of content per day, with individual files up to 1 GB in size. Learn more here.

Enhance speech is language agnostic and does not generate or translate across different languages. The results often depend on the speaker's audibility and the amount of background noise. It’s important to note that clearer and more polished inputs (speaker pronunciation and enunciation) will produce better outputs. This tool enhances audio by filtering out noise and artifacts, adjusting pitch and volume levels, and normalizing the audio.

First, check the supported formats to ensure you uploaded an accepted file type. If that is not the issue, check the daily limits enforced by your current plan. You can do this by navigating to your avatar and/or initials in the top right corner and viewing Enhance speech usage.