This page documents changes in the Asynchronous Speech-to-Text API only. View all changes.
Added
- Added Whisper Fusion transcriber support that has better support for rare words. See Submit Transcription Job for more details
Added
- Premium Diarization feature goes out of Beta and becomes publicly available.
Added
- Added Forced Alignment feature
- Added Asynchronous Speech-to-Text API
forced_alignmentsubmission option.
Added
- Updated US deployment of the Asynchronous Speech-to-Text API to support new low-cost transcription using Reverb Turbo model.
- It can be used by specifying
"transcriber": "low_cost"in the request.
Added
- Updated US deployment of the Asynchronous Speech-to-Text API to support asynchronous Translation and Summarization
Added
- Updated US deployment of the Asynchronous Speech-to-Text API
languagesubmission option to support new languages: Afrikaans, Armenian, Azerbaijani, Belarusian, Bosnian, Estonian, Galician, Icelandic, Kannada, Kazakh, Macedonian, Marathi, Nepali, Serbian, Swahili, Tagalog, Thai, Ukrainian, Urdu, Vietnamese, Welsh, and multilingual English/Spanish. - Updated US deployment of the Asynchronous Speech-to-Text API HIPAA-supported
languagelist to all languages: Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Farsi, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Mandarin, Marathi, Nepali, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh, and multilingual English/Spanish. - Updated US deployment of the Asynchronous Speech-to-Text API media file duration limits. All languages except Telugu supports file duration up to 17 hours, and Telugu supports up to 6 hours.
Added
- Updated Asynchronous Speech-to-Text API
diarization_typesubmission option.
Changed
- Updated Asynchronous Speech-to-Text API to support English US (
en-us) and English UK (en-gb) language values.
Added
- Updated Asynchronous Speech-to-Text API
speakers_countsubmission option.
Added
- Updated Asynchronous Speech-to-Text API
speaker_channels_countsubmission option documentation to include valid languages (en, es, fr).
Added
- Added a new job submission
remove_atmosphericsoption to the Asynchronous Speech-to-Text API. This option enables you to remove atmospherics such as<laugh>, <affirmative>etc from the ASR output.
Changed
- Changed the submission option
verbatimof async transcription job. Now it can be used with bothmachineandhumantranscribers. The option default value depends on the transcriber.
| Transcriber | Description |
|---|---|
machine | the default is true. To turn it off false should be explicitly provided |
human | the default is false To turn it on true should be explicitly provided |
Removed
- Deprecated
machine_v2as an option fortranscriber. Usingmachineis now the recommended option. Usage ofmachine_v2will silently route tomachine.
Added
- Added support for asynchronous non-English Speech-to-Text transcription in the European Union deployment.
Changed
balance_secondsresponse value deprecated and replaced withfree_balance,purchased_balance,total_balanceandinvoiced_balancevalues instead. Thebalance_secondsvalue will continue to be included in the response but will always have a value of0.
Added
- Added a new job submission
skip_postprocessingoption to the Asynchronous Speech-to-Text API. This option enables you to skip the post-processing steps (inverse text normalization or ITN, casing and punctuation) of a transcription job.
Changed
- Human Transcription feature
segments_to_transcribeminimum segment length lowered from 2 minutes to 1 minute
Added
- Added support for asynchronous English Speech-to-Text API in the European Union deployment. Read more about Rev AI's global deployments and its API limitations for its European Union deployment.
Added
source_configas a replacement for the deprecatedmedia_urlto a provide a source URL for a job- Support for authorization headers when accessing URLs for
source_config
Added
custom_vocabularyas a possiblefailurefor a failed job
Added
- (Open Beta)
machine_v2as an option fortranscriberto run our Reverb ASR model for improved Word Error Rate.
Added
- Human Transcription feature
- (Open Beta)
transcriberoption to enable asynchronous transcription job submissions to be transcribed by a human. - (Open Beta)
verbatimoption to enable asynchronous transcription job submissions - (Open Beta)
rushoption to enable asynchronous transcription job submissions - (Open Beta)
segments_to_transcribeoption to enable asynchronous transcription job submissions
- (Open Beta)
Added
- Ability to rotate access tokens
Added
- (Open Beta) Support for transcription for more languages
Changed
- Max allowed stream duration increased from 2 to 3 hours
Changed
languagejob option is out of Open Beta and in General Availability.
Added
custom_vocabulariesjob option support for Rev AI's non-English languages. These are French, German, Portuguese and Spanish.
Added
- Limit on non-English language transcription audio of 12 hours or less.
Changed
- Reverted minor breaking change introduced on November 9 involving job failure types. "duration_out_of_range" failure type has been reverted to "duration_exceeded" and a new failure type of "duration_too_short" was introduced to cover the minimum case. See get job endpoint documentation response schema for full enum of failures.
Added
- (Open Beta) Support for Spanish, Portuguese, French, and German language transcription
Changed
- Changed
"duration_exceeded"job failure type to"duration_out_of_range"to account for both too short and too long durations of files.
Added
- (Open Beta)
custom_vocabulary_idoption to enable job submission with the id of a pre-completed custom vocabulary