This page documents all notable changes to this project.
2025-01-06 / Asynchronous Speech-to-Text API
Added
- Added Whisper Fusion transcriber support that has better support for rare words. See Submit Transcription Job for more details
2024-12-06 / Java SDK
Added
- Added option for setting deployment configuration for the Asynchronous Speech-to-Text API Client and Language Identification API Client. See Rev AI Global Deployments for more details
2024-10-18 / Asynchronous Speech-to-Text API
Added
- Added Forced Alignment feature
- Added Asynchronous Speech-to-Text API
forced_alignmentsubmission option.
2024-10-08 / Asynchronous Speech-to-Text API
Added
- Premium Diarization feature goes out of Beta and becomes publicly available.
2024-10-02 / Asynchronous Speech-to-Text API
Added
- Updated US deployment of the Asynchronous Speech-to-Text API to support new low-cost transcription using Reverb Turbo model.
- It can be used by specifying
"transcriber": "low_cost"in the request.
2024-01-05 / Java SDK
Fixed
- Updated
summarizationparameter to useSummarizationModelfor the Asynchronous Speech-to-Text API Client - Updated
translationparameter to useTranslationModelfor the Asynchronous Speech-to-Text API Client
2024-01-04 / Asynchronous Speech-to-Text API
Added
- Updated US deployment of the Asynchronous Speech-to-Text API to support asynchronous Translation and Summarization
2023-12-28 / Java SDK
Added
summarizationandtranslationparameters to the Asynchronous Speech-to-Text API Client
2023-11-16 / Asynchronous Speech-to-Text API
Added
- Updated US deployment of the Asynchronous Speech-to-Text API
languagesubmission option to support new languages: Afrikaans, Armenian, Azerbaijani, Belarusian, Bosnian, Estonian, Galician, Icelandic, Kannada, Kazakh, Macedonian, Marathi, Nepali, Serbian, Swahili, Tagalog, Thai, Ukrainian, Urdu, Vietnamese, Welsh, and multilingual English/Spanish. - Updated US deployment of the Asynchronous Speech-to-Text API HIPAA-supported
languagelist to all languages: Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Estonian, Farsi, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Mandarin, Marathi, Nepali, Norwegian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, Welsh, and multilingual English/Spanish. - Updated US deployment of the Asynchronous Speech-to-Text API media file duration limits. All languages except Telugu supports file duration up to 17 hours, and Telugu supports up to 6 hours.
2023-10-31 / Node SDK
Added
- Added
diarization_typeparameter to Asynchronous Speech-To-Text API job options.
2023-09-07 / Asynchronous Speech-to-Text API
Added
- Updated Asynchronous Speech-to-Text API
diarization_typesubmission option.
2023-08-10 / Asynchronous Speech-to-Text API
Changed
- Updated Asynchronous Speech-to-Text API to support English US (
en-us) and English UK (en-gb) language values.
2023-07-19 / Asynchronous Speech-to-Text API
Added
- Updated Asynchronous Speech-to-Text API
speakers_countsubmission option.
2023-06-14 / Forced Alignment API
Added
- (Open Beta) Added Forced Alignment API documentation
2023-05-15 / Asynchronous Speech-to-Text API
Added
- Updated Asynchronous Speech-to-Text API
speaker_channels_countsubmission option documentation to include valid languages (en, es, fr).
2023-03-29 / Node SDK
Added
- Added
enable_speaker_switchparameter to Streaming Speech-To-Text API job options.
2023-03-29 / Node SDK
Added
- Fixed an issue with handling API error responses when the error object response property is undefined.
2023-03-22 / Asynchronous Speech-to-Text API
Added
- Added a new job submission
remove_atmosphericsoption to the Asynchronous Speech-to-Text API. This option enables you to remove atmospherics such as<laugh>, <affirmative>etc from the ASR output.
2023-01-02 / Asynchronous Speech-to-Text API
Changed
- Changed the submission option
verbatimof async transcription job. Now it can be used with bothmachineandhumantranscribers. The option default value depends on the transcriber.
| Transcriber | Description |
|---|---|
machine | the default is true. To turn it off false should be explicitly provided |
human | the default is false To turn it on true should be explicitly provided |
2022-11-14 / Node SDK
Added
- Added functionality to enable the caller to specify which Rev AI region to use for submission. Use new
RevAiApiClientConfigwhen constructing yourRevAiApiClient. This is available for the Asynchronous Speech-To-Text API, Streaming Speech-To-Text API, and Language Id API clients. See https://github.com/revdotcom/revai-node-sdk#usage for details - Added
speaker_namesparameter to Asynchronous Speech-To-Text API job options. This is available for Human Transcription only - Added
skip_postprocessingparameter to both Asynchronous Speech-To-Text API and Streaming Speech-To-Text API.
2022-10-27 / Streaming Speech-to-Text API
Changed
- Streams are now billed based on the maximum of stream duration and audio duration. Refer to the Billing section for more details.
2022-09-29 / Streaming Speech-to-Text API
Added
- Added support for Streaming Speech-to-Text API (non-English Languages) in the European Union deployment.
2022-09-24 / Rev AI API
Changed
- All Rev AI job identifiers are now 16 characters in length (increased from 12 characters previously).
2022-09-22 / Streaming Speech-to-Text API
Added
- Added support for Streaming Speech-to-Text API (English only) in the European Union deployment.
2022-09-15 / Asynchronous Speech-to-Text API
Removed
- Deprecated
machine_v2as an option fortranscriber. Usingmachineis now the recommended option. Usage ofmachine_v2will silently route tomachine.
2022-09-02 / Language Identification API
Added
- Added support for Language Identification API in the European Union deployment. Read more about Rev AI's global deployments and its API limitations for its European Union deployment.
2022-08-25 / Asynchronous Speech-to-Text API
Added
- Added support for asynchronous non-English Speech-to-Text transcription in the European Union deployment.
2022-08-04 / Streaming Speech-to-Text API
Added
- Added a new
priorityoption to the Streaming Speech-to-Text API. Possible values arespeedandaccuracy. Only available for English and Spanish languages withmachine_v2transcriber.
2022-08-03 / Asynchronous Speech-to-Text API
Changed
balance_secondsresponse value deprecated and replaced withfree_balance,purchased_balance,total_balanceandinvoiced_balancevalues instead. Thebalance_secondsvalue will continue to be included in the response but will always have a value of0.
2022-08-02 / Java SDK
Added
skip_punctuationparameter to the Streaming Speech-to-Text API Clientskip_punctuationparameter to the Asynchronous Speech-to-Text API Client
2022-08-02 / Python SDK
Added
skip_punctuationparameter to the Streaming Speech-to-Text API Clientskip_punctuationparameter to the Asynchronous Speech-to-Text API Client
2022-07-25 / Streaming Speech-to-Text API
Changed
- Streaming Speech-to-Text API v2 moved from Open Beta to General Availability
2022-06-23 / Asynchronous Speech-to-Text API
Added
- Added a new job submission
skip_postprocessingoption to the Asynchronous Speech-to-Text API. This option enables you to skip the post-processing steps (inverse text normalization or ITN, casing and punctuation) of a transcription job.
2022-06-22 / Asynchronous Speech-to-Text API
Changed
- Human Transcription feature
segments_to_transcribeminimum segment length lowered from 2 minutes to 1 minute
2022-06-09 / Language Identification API
Changed
- Multipart/form-data submission no longer requires
optionsparameter
2022-06-08 / Java SDK
Fixed
- Error response when submitting forbidden parameters, such as
media_urlfor HIPAA accounts. See https://docs.rev.ai/resources/tutorials/introduction-to-auth-options/ for proper usage.
2022-06-08 / Node SDK
Fixed
- Response when submitting forbidden parameters, such as
media_urlfor HIPAA accounts. See https://docs.rev.ai/resources/tutorials/introduction-to-auth-options/ for proper usage.
2022-06-03 / Java SDK
Added
- Support for Language Identification API
2022-06-02 / Asynchronous Speech-to-Text API
Added
- Added support for asynchronous English Speech-to-Text API in the European Union deployment. Read more about Rev AI's global deployments and its API limitations for its European Union deployment.
2022-06-01 / Node SDK
Added
- Support for Language Identification API
2022-05-26 / Node SDK
Added
- Support for Sentiment Analysis API
- Support for Topic Extraction API
2022-05-26 / Streaming Speech-to-Text API
Added
- Added
machine_v2transcriber. Routes to our new improved v2 model. - Added
enable_speaker_switchoption. Only available for v2 streams.
2022-05-26 / Streaming Speech-to-Text API
Added
- Added a new
skip_postprocessingoption to the Streaming Speech-to-Text API. This option allows you to skip the post-processing steps (inverse text normalization or ITN, casing and punctuation) of a transcription job. Only available for English and Spanish languages.
2022-05-23 / Language Identification API
Changed
- Language Identification API moved from Open Beta to General Availability
2022-05-20 / Java SDK
Added
languageparameter to the Streaming Speech-to-Text API Client
2022-05-20 / Streaming Speech-to-Text API
Changed
- The 8 languages for the Streaming Speech-to-Text API are out of Open Beta and in General Availability: French, German, Italian, Japanese, Korean, Mandarin, Portuguese, and Spanish.
2022-05-19 / Java SDK
Added
notificationConfigandsourceConfigparameters to RevAiJobOptionsSupport for Topic Extraction API
Support for Sentiment Analysis API
Deprecated
mediaUrloption. UsesourceConfiginsteadcallbackUrloption. UsenotificationConfiginstead
2022-05-19 / Python SDK
Added
languageparameter to the Streaming Speech-to-Text API Client
2022-05-17 / Node SDK
Added
languageparameter to the Streaming Speech-to-Text API Client
2022-05-13 / Node SDK
Added
notification_configandsource_configparameters to job options
Deprecated
submitJobUrlfunction. Use thesubmitJobfunction instead and provide a URL in the job options as shown in the code examplesmedia_urloption. Usesource_configinsteadcallback_urloption. Usenotification_configinstead
2022-05-13 / Python SDK
Added
notification_configandsource_configparameters to all job submissions
Added
Support for Language Identification API
Support for Sentiment Analysis API
Deprecated
media_urloption. Usesource_configinsteadcallback_urloption. Usenotification_configinstead
2022-05-09 / Asynchronous Speech-to-Text API
Added
source_configas a replacement for the deprecatedmedia_urlto a provide a source URL for a job- Support for authorization headers when accessing URLs for
source_config
2022-04-29 / Language Identification API
Added
- Added v1 route for Language Identification API:
languageid/v1
2022-04-25 / Python SDK
Added
Support for Topic Extraction API
speaker_namesoption to asynchronous client for human transcription
2022-04-25 / Streaming Speech-to-Text API
Added
- Added 8 new languages to the Streaming Speech-to-Text API in Open Beta: French, German, Italian, Japanese, Korean, Mandarin, Portuguese, and Spanish.
2022-04-20 / Streaming Speech-to-Text API
Added
- RTMP audio streaming documentation
2022-04-07 / Language Identification API
Added
- (Open Beta)
processed_duration_secondsto Language Identification Job Details
2022-03-31 / Language Identification API
Added
- (Open Beta) Added Language Identification API documentation
2022-03-24 / Asynchronous Speech-to-Text API
Added
custom_vocabularyas a possiblefailurefor a failed job
2022-02-07 / Node SDK
Added
transcriberto asynchronous clientverbatim,rush,test_modeandsegments_to_transcribeoptions to asynchronous client for human transcriptionstart_tsandtranscriberto streaming client
Fixed
- Fixed a bug where binary data containing bytes equivalent to string "EOS" prematurely ends the streaming session
2022-02-07 / Node SDK
Added
- Support for Node 14, 16 and 17 (supported versions now include: 8, 10, 12, 14, 16, 17)
custom_vocabulary_idto asynchronous clientdetailed_partialsto streaming client
2022-02-01 / Java SDK
Added
custom_vocabulary_idandtranscriberto asynchronous clientverbatim,rush,test_modeandsegments_to_transcribeoptions to asynchronous client for human transcriptiondetailed_partials,start_tsandtranscriberto streaming client
2022-01-31 / Python SDK
Added
transcriberto asynchronous clientverbatim,rush,test_modeandsegments_to_transcribeoptions to asynchronous client for human transcriptionstart_tsandtranscriberto streaming client
2022-01-14 / Asynchronous Speech-to-Text API
Added
- (Open Beta)
machine_v2as an option fortranscriberto run our Reverb ASR model for improved Word Error Rate.
2021-12-07 / Asynchronous Speech-to-Text API
Added
- Human Transcription feature
- (Open Beta)
transcriberoption to enable asynchronous transcription job submissions to be transcribed by a human. - (Open Beta)
verbatimoption to enable asynchronous transcription job submissions - (Open Beta)
rushoption to enable asynchronous transcription job submissions - (Open Beta)
segments_to_transcribeoption to enable asynchronous transcription job submissions
- (Open Beta)
2021-12-07 / Sentiment Analysis API
Added
- (Open Beta) Added Sentiment Analysis API documentation
2021-12-07 / Topic Extraction API
Added
- (Open Beta) Added Topic Extraction API documentation
2021-10-12 / Python SDK
Added
detailed_partialsparameter to the streaming client- CI now runs on GitHub Actions. This replaces Travis CI.
2021-10-06 / Node SDK
Fixed
- Fixed a bug where the HTTP client library was artificially lowering the max file size for multipart upload to 10MB. The API limit is 2GB. More information in revai-node-sdk issue #72.
2021-08-06 / Asynchronous Speech-to-Text API
Added
- Ability to rotate access tokens
2021-05-26 / Asynchronous Speech-to-Text API
Added
- (Open Beta) Support for transcription for more languages
2021-05-06 / Streaming Speech-to-Text API
Added
- Enable offsetting hypotheses timestamps by providing
start_tsto streaming jobs
2021-04-22 / Asynchronous Speech-to-Text API
Changed
- Max allowed stream duration increased from 2 to 3 hours
2021-03-09 / Python SDK
Added
custom_vocabulary_idoption to enable job submission with the id of a pre-submitted custom vocabulary
2021-02-10 / Node SDK
Fixed
- Bug fixes for streaming client
2021-02-03 / Asynchronous Speech-to-Text API
Changed
languagejob option is out of Open Beta and in General Availability.
Added
custom_vocabulariesjob option support for Rev AI's non-English languages. These are French, German, Portuguese and Spanish.
2021-02-02 / Asynchronous Speech-to-Text API
Added
- Limit on non-English language transcription audio of 12 hours or less.
2021-01-29 / Node SDK
Fixed
- Bug fix for streaming client crash on unsafeEnd
2021-01-17 / Java SDK
Added
languagejob option to the Asynchronous Speech-to-Text API. Transcribe audio in languages other than English. See Asynchronous Speech-to-Text API docs for the full list of supported languages.
2021-01-17 / Python SDK
Added
languagejob option to the Asynchronous Speech-to-Text API. Transcribe audio in languages other than English. See Asynchronous Speech-to-Text API docs for the full list of supported languages.- Relax dependency pinned version requirements.
2021-01-15 / Node SDK
Added
languagejob option to the Asynchronous Speech-to-Text API. Transcribe audio in languages other than English. See Asynchronous Speech-to-Text API docs for the full list of supported languages.- Upgraded dependency on axios to version 0.21.1 to fix noted vulnerability.
2020-12-04 / Asynchronous Speech-to-Text API
Changed
- Reverted minor breaking change introduced on November 9 involving job failure types. "duration_out_of_range" failure type has been reverted to "duration_exceeded" and a new failure type of "duration_too_short" was introduced to cover the minimum case. See get job endpoint documentation response schema for full enum of failures.
2020-11-18 / Asynchronous Speech-to-Text API
Added
- (Open Beta) Support for Spanish, Portuguese, French, and German language transcription
2020-11-09 / Asynchronous Speech-to-Text API
Changed
- Changed
"duration_exceeded"job failure type to"duration_out_of_range"to account for both too short and too long durations of files.
2020-09-01 / Node SDK
Added
delete_after_secondsoption for both Streaming and Asynchronous Speech-to-Text APIs
Fixed
- Bug in Streaming Speech-to-Text API client where the client closed the WebSocket connection after 1 minute of not sending any data
2020-08-31 / Asynchronous Speech-to-Text API
Added
- (Open Beta)
custom_vocabulary_idoption to enable job submission with the id of a pre-completed custom vocabulary
2020-08-27 / Streaming Speech-to-Text API
Added
- (Open Beta)
detailed_partialsoption to show timestamps and confidence scores in partial hypotheses
2020-07-22 / Java SDK
Added
CustomVocabularyClient: Interact with the Custom Vocabulary API for pre-uploading custom vocabularyremove_disfluenciesoption for both Asynchronous and Streaming Speech-to-Text API clientsfilter_profanityoption for streaming client
2020-07-22 / Python SDK
Added
delete_custom_vocabulary(id): Delete your custom vocabulary by idget_list_of_custom_vocabularies(): Get a list of recent custom vocabulary submissions' informationremove_disfluenciesjob option for the streaming client: Remove filler words (disfluencies) from the resulting transcript. This option was previously available for the Asynchronous Speech-to-Text API client.
Changed
- Improved examples
Fixed
- Bug fixes and improvements
2020-07-01 / Node SDK
Added
deleteCustomVocabulary(id): Delete your custom vocabulary by idgetListOfCustomVocabularyInformations(): Get a list of recent custom vocabulary submissions' informationremove_disfluenciesjob option for Streaming Speech-to-Text API: Remove filler words (disfluencies) from the resulting transcript. This option was previously available for the Asynchronous Speech-to-Text API.
Changed
- Improved examples
Fixed
- Bug fixes and improvements
2020-06-24 / Streaming Speech-to-Text API
Added
- (Closed Beta) Option to show timestamps and confidence scores in partial hypotheses. Email support@rev.ai for access.
- (Closed Beta) Stream to Rev AI with RTMP. Email support@rev.ai for access.
2020-05-23 / Java SDK
Added
- Initial release of the Java SDK available on Maven Central Repository
Changes previous to the oldest date in this document are not noted in this changelog.