getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Top Rated Speech Recognition Software with Automatic transcription - Page 2

Last updated: May 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


85 software options

Taption logo

AI-driven subtitles, translations and video editing

learn more
Taption is a feature-rich platform that automatically generates high-quality transcripts, translations, and subtitles for videos. The platform's leading AI technology converts audio or video content into text in over 40 languages, allowing users to create embedded bilingual subtitles, labeled speaker transcripts, and translations for their video projects. Taption's intuitive editing tools make it easy to trim and adjust the text to align with video edits, ensuring a polished final product.

Read more about Taption

Users also considered
Amazon Transcribe logo

Automatic speech recognition platform

learn more
Amazon Transcribe is an automatic speech recognition platform that helps businesses convert speech to text and generate read or review transcripts. It includes a call analytics API, which allows developers to process live as well as recorded audio/video inputs and perform transcriptions.

Read more about Amazon Transcribe

Users also considered
Speech Recognition Engine logo

AI-enabled speech recognition engine

learn more
LumenVox’s speech and voice software leverages artificial intelligence, natural language understanding, and deep machine learning technologies to deliver speech recognition technology. It includes neural networks to improve the ability to add new languages and dialects and serve a more diverse base of users.

Read more about Speech Recognition Engine

Users also considered
SoapBox logo

Speech recognition software designed to detect kid speech

learn more
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.

Read more about SoapBox

Users also considered
Kili logo

Training data platform for enterprise AI

learn more
Kili is a training data platform designed to help businesses in banking, manufacturing, and healthcare industries streamline the entire training process for artificial intelligence (AI) and machine learning models, from connecting with raw data sources to preparing models, processing information, and training. The platform enables organizations to handle multiple machine learning projects and process videos, images, text, and other types of data.

Read more about Kili

Users also considered
Voice To Text Online logo

Speech recognition in 55 languages via browser

learn more
Voice to Text Online converts spoken words into written text in real-time across more than fifty-five languages. The browser-based tool processes audio locally using the Web Speech API, requiring no account creation or software installation. Users can transcribe live speech, save projects for later editing, and export transcripts in multiple formats including TXT, SRT, VTT, and JSON with automatic speaker identification and confidence-level indicators.

Read more about Voice To Text Online

Users also considered
DeepTranscript logo

Transcribe large volumes of conversation

learn more
DeepTranscript is an automatic speech recognition provider for professionnals designed for large volumes and high accuracy. Let's collect all data available in conversations, talks, interview with our plug and play API.

Read more about DeepTranscript

Users also considered
Translation Worldwide Software logo

Translation management tool for healthcare & medical sector

learn more
Translation Worldwide Software by JBI SOFTWARE is designed to help businesses across healthcare, legal, medical, insurance, banking, and other industries manage language translation projects. The artificial intelligence (AI)-enabled solution allows employees to handle text interpretation and translation processes and reduce lawsuits.

Read more about Translation Worldwide Software

Users also considered
Descript logo

Transcription management and video & audio editing software

learn more
Descript is a transcription software that is designed for businesses in multiple industries, such as marketing, sales, user research, online learning, and customer support. It helps team members collaborate on projects, send feedback, create shared folders, add comments, and track document versions.

Read more about Descript

Users also considered
Verbit logo

Verbit makes video and audio accessible and more engaging.

learn more
Verbit provides accurate captions & transcription of live and recorded video to make them accessible & engaging to all audiences. These tools help educators, government entities, legal agencies, business leaders and media producers meet the needs of those with disabilities & all engage viewers.

Read more about Verbit

Users also considered
GoSpeech logo

Saas solution for transcription and subtitling

learn more
Saas solution to convert speech to text based on artificial intelligence

Read more about GoSpeech

Users also considered
Matesub logo

Subtitling & Transcription in 200+ Languages

learn more
Matesub is a cloud-based closed captioning tool that utilizes AI technology to generate subtitles for videos. This solution offers compatibility with a range of video formats, facilitating transcription and translation across 85 languages. With Matesub, users can generate culturally sensitive subtitles that resonate with international audiences. Additionally, its WYSIWYG frame-level editor helps capture the contextual nuances of the source material.

Read more about Matesub

Users also considered
ELSA Speech Recognition API logo

Leading Speech Recognition API for Language Learning

learn more
ELSA uses proprietary speech recognition technology and artificial intelligence to help language learners improve their English pronunciation

Read more about ELSA Speech Recognition API

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
SpeechTexter logo

Speech recognition and conversion software

learn more
SpeechTexter is a speech recognition and conversion software that helps corporates, teachers, lawyers, writers, and students convert audio files into text. It offers a multi-language speech recognizer as well as document and email transcriber, enabling users to transcribe documents in real-time.

Read more about SpeechTexter

Users also considered
3Play Media logo

Closed captioning and transcription solution

learn more
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement via a unified platform.

Read more about 3Play Media

Users also considered
Rev.ai logo

Speech recognition software using asynchronous API

learn more
Rev.ai’s suite of speech-to-text APIs allows businesses to build downstream applications. Speech recognition software built from speech engine trained to transcribe content on various topics with various accents for various industries.

Read more about Rev.ai

Users also considered
INVOX Medical logo

Real-time dictation and transcription of medical reports.

learn more
INVOX Medical is a speech recognition software for real-time dictation and transcription of medical reports. It is compatible with any medical or EHR software and we have specific dictionaries for more than 15 medical specialties to ensure maximum accuracy in dictation transcription.

Read more about INVOX Medical

Users also considered
Maestra logo

Speech to text, closed captioning & transcription software

learn more
Maestra is a speech to text software designed to help educators, researchers, marketers, journalists, and media houses automatically add transcriptions, captions, subtitles, and voiceovers to audio and video files in real-time. The platform enables professionals to translate text into various languages including English, French, Spanish, and German.

Read more about Maestra

Users also considered
Amberscript logo

Web-based speech recognition software

learn more
AmberScript is a suite of software products that allow you to transform audio and video files into searchable text and subtitles. Create closed captions and subtitles to improve accessibility, save money, and time.

Read more about Amberscript

Users also considered
DeepScribe logo

DeepScribe AI Scribe: Fast | Accurate | Scalable | Secure

learn more
DeepScribe is Healthcare's most trusted and widely adopted AI Medical Scribe, used by hundreds of healthcare systems across the US.

Clinicians using DeepScribe have seen charts closed within 1.6 minutes, documentation time decreased by 75%, and increased patient capacity by 2 patients/day.

Read more about DeepScribe

Users also considered
Snowfly logo

Employee engagement, gamification & corporate wellness tool

learn more
Snowfly is an employee engagement and gamification software designed to help businesses measure the performance of employees and engage them through incentives and rewards. It enables organizations to create, implement, and manage recognition programs to improve employee experience (EX) and satisfaction.

Read more about Snowfly

Users also considered
BigHand Workflow Management logo

Speech recognition and transcription software

learn more
Automatically delegate legal tasks to the right support staff, at the right cost to the firm, with BigHand Workflow Management. Assign support tasks & receive work seamlessly, whilst using the output reports to make data-driven decisions.

Read more about BigHand Workflow Management

Users also considered
Trint  logo

Automated transcription platform with AI

learn more
Trint is a cloud-based audio and video transcription solution which leverages artificial intelligence (AI), machine learning, and natural language processing (NLP) to automatically transcribe audio from a range of file formats and generate an interactive, searchable, editable & shareable transcript

Read more about Trint

Users also considered
LilySpeech logo

Speech to text software

learn more
LilySpeech is a windows application that allows users to type with voice. It is highly accurate and works with any Windows application.

Read more about LilySpeech

Users also considered