Starting with the December 2014 release of Premiere Pro CC (8.2), the Speech-to-Text feature or Speech Analysis functionality is removed from Premiere Pro.
The Speech Analysis functionality in Premiere Pro analyzes speech and converts spoken words into text-based, searchable metadata.
At the time of implementing Speech Analysis, it was anticipated that the underlying engine for this feature would improve at a faster rate than it did. Over time, user feedback and research confirm that this feature is not feasible to a degree of accuracy that is useful to users.
Given this focus, Speech Analysis is removed in Premiere Pro. That is, you can no longer analyze any new media and generate speech-to-text metadata in Premiere Pro.
You can, however, continue to view the speech-to-text metadata for any clips that were previously analyzed using Premiere Pro CC 8.1 or earlier versions. The generated speech analysis metadata is embedded in the media files, so the text continues to be visible and usable when the analyzed clips are imported and used in the latest version of Premiere Pro.
What is the last version of Premiere Pro that has the Speech Analysis functionality?
Are any other Adobe products affected by the removal of the Speech Analysis functionality?
Adobe Media Encoder and Adobe Prelude support the Speech Analysis functionality. Starting with the December 2014 release, Speech Analysis will no longer be supported in Adobe Media Encoder CC (8.2) and Adobe Prelude CC (3.2).
In Adobe Media Encoder CC 8.2, you can export speech XMP metadata for any clips that were previously analyzed using Premiere Pro CC 8.1 or earlier. Select Export Master Speech Track And Sequence Markers in the Preferences dialog to include the speech XMP metadata while exporting.
In Prelude CC 3.2, Speech Transcription markers are removed from the Marker Type panel. However, when you open a clip that has Speech Transcription markers created from a previous version of Prelude, the markers reappear.
The Speech Analysis functionality can analyze spoken words and generate text metadata. You can edit and search the generated text metadata like any other metadata properties. You can also navigate to the times at which specific words are spoken, to better align edits, advertising, and subtitles.
Analyze speech to create text metadata
In Premiere Pro, Speech Analysis is available for clips when there is at least one component audio clip in the merged clip. You can choose to analyze all audio tracks in a merged clip, or a single one.
To analyze a single track for speech, select the clip that you want to analyze and then click Analyze in the Speech Analysis section of the Metadata panel.
The Speech Analysis section is at the bottom of the Metadata panel.
Select the language in which you want to create the text metadata.
The speech analysis feature can use any of several language-specific and dialect-specific libraries, such as libraries for Spanish and UK English.
To download more language-specific and dialect-specific libraries, see this link on the Adobe website.
You can improve the quality of speech analysis by using a reference script or an embedded Adobe Story script.
If you import files with a speech metadata into After Effects, each word appears as a layer marker on layers based on these footage items.
Edit speech metadata
Navigate to a specific word in speech metadata
Improve speech analysis using reference scripts
Accuracy of speech-to-text conversion depends on the clarity of the spoken words and the quality of the recorded dialog. Dialog recorded in a noisy environment or with poor microphone placement cannot produce highly accurate results even with a reference script. You can nevertheless use a reference script to improve speech analysis.
A reference script is a text document containing dialog similar to the dialog recorded in your assets. There are two types of reference scripts:
- A script that contains similar dialog, but was not necessarily written for the current project. For example, a series of medical training scripts for different products can be combined into a single text document. You can use this text document as a reference script. With this type of reference script, speech analysis produces results more accurate than it does when using only the default language models.
- A script that matches the recorded dialog. This type of reference script provides the highest accuracy possible. For example, you can use the script that the talent read during the shoot as a reference script. Alternatively, you could use a transcript typed from the assets for the purposes of close captioning.
Speech Analysis supports reference scripts only in the UTF-8 encoded text format, including Adobe Story scripts, which have the .astx filename extension.
The closeness of the match between the embedded script text and the recorded dialog determines the accuracy of matched-script text. If 100% accuracy is important, edit and revise the script text first. Ensure that the script matches the recorded dialog before using it as a reference script.
- In the Analyze Content dialog box, from the Reference Script menu, select Add.
- Browse to the reference script text or .astx file, select it, and click Open.
- In the Import Script dialog box, type a name for the reference script, and select the language of the script.
Note: You can view the text of the file in a scrolling window.
- Select Script Text Matches Dialog only if the imported script covers the recorded dialog verbatim. For example, if the reference script is the script from which the talent read their lines, select Script Text Matches Dialog. Select this option even if the recorded dialog is shorter than what the script file covers.
- Click OK.The Import Script dialog closes, and the reference script is selected in the Reference Script menu.
- Click OK in the Analyze Content dialog.
Improve speech analysis using Adobe Story scripts
Speech analysis is more accurate if Adobe Story script data is associated with a clip. Then, Adobe Premiere Pro automatically uses the Adobe Story script as a reference script.
When Adobe Premiere Pro finds enough matches with the embedded script, it replaces the analyzed speech text with the embedded script text. Adobe Premiere Pro carries over the correct spelling, proper names, and punctuation from the reference script, benefits that standard speech analysis cannot provide.
You need to match the scene number of the clips to the scene numbers in the Adobe Story script. Adobe Premiere Pro needs the information so that it can match the clips to the correct scenes in the script. You can assign scene numbers to clips in the Project panel or the Metadata panel.
- Select one or more clips in the Project panel, right-click, and choose Attach Script File or select File > Adobe Story > Attach Script File.
Note: An Adobe Story script cannot be attached to a Merged Clip. If, however, a Story script was attached to a component clip before the merge, the merged clip may be analyzed using the previously attached Story script.
- In the Analyze Content dialog, under the Script section, select Use Embedded Adobe Story Script.
When a clip has a Story script attached to it, the script is displayed in the Embedded Adobe Story Script view in the Speech Analysis section of the Metadata panel. Compare the script displayed in this view to the results of the speech analysis displayed in the Analysis Text view below it.
Note: The Embedded Adobe Story Script view is read only. You can't perform editing operations in it.
Trim with Speech Analysis
You can add In and Out points to speech analysis text to select a portion of a master clip. You can insert or overwrite the selected portion directly from the Speech Analysis section of the Metadata panel.