getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software - Page 2

Last updated: April 2026

Filter results

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


101 software options

Rythmex logo

Speech to text, transcription, medical transcription

learn more
Rythmex is an AI transcription solution offering real-time, multilingual transcription services in a user-friendly interface. With its intuitive features, API integrations, and robust data security, Rythmex is the go-to solution for individuals and businesses seeking accurate transcription solution.

Read more about Rythmex

Users also considered
Twixor logo

Customer communications management software

learn more
Twixor EnCaps is a low-code customer engagement platform that helps businesses deliver personalized interactions. The software utilizes generative AI and natural language processing functionalities on messaging platforms to create customer journeys. The CX platform combines digital assistant and intelligent process automation to deliver personalized interactions.

Read more about Twixor

Users also considered
Taption logo

AI-driven subtitles, translations and video editing

learn more
Taption is a feature-rich platform that automatically generates high-quality transcripts, translations, and subtitles for videos. The platform's leading AI technology converts audio or video content into text in over 40 languages, allowing users to create embedded bilingual subtitles, labeled speaker transcripts, and translations for their video projects. Taption's intuitive editing tools make it easy to trim and adjust the text to align with video edits, ensuring a polished final product.

Read more about Taption

Users also considered
Amazon Transcribe logo

Automatic speech recognition platform

learn more
Amazon Transcribe is an automatic speech recognition platform that helps businesses convert speech to text and generate read or review transcripts. It includes a call analytics API, which allows developers to process live as well as recorded audio/video inputs and perform transcriptions.

Read more about Amazon Transcribe

Users also considered
CallHippo logo

Cloud-based phone system for sales, support & growing teams

learn more
CallHippo is a Virtual Phone System that is easy-to-use while offering robust functionality with advanced features, extensive reporting, and seamless integrations to empower sales and service teams to have effective conversations with customers. 24x7 World Class Support. Instant Setup

Read more about CallHippo

Users also considered
Machine Learning on AWS logo

Machine learning and AI solutions from AWS

learn more
AWS provides machine learning (ML) and artificial intelligence (AI) solutions designed to help businesses analyze data insights, personalize the customer experience, optimize business processes, and more.

Read more about Machine Learning on AWS

Users also considered
Reportex logo

Audio transcription & editing solution

learn more
Reportex from Sony is a cloud-based audio transcription and editing solution which allows users to automatically transcribe audio from multiple file formats, edit and correct transcriptions, create and share video clips of transcribed audio, download edited files, and more

Read more about Reportex

Users also considered
Speech Recognition Engine logo

AI-enabled speech recognition engine

learn more
LumenVox’s speech and voice software leverages artificial intelligence, natural language understanding, and deep machine learning technologies to deliver speech recognition technology. It includes neural networks to improve the ability to add new languages and dialects and serve a more diverse base of users.

Read more about Speech Recognition Engine

Users also considered
Castel Detect Live logo

Cloud-based speech recognition solution

learn more
Castel Detect LIVE is a voice recognition solution which helps firms of all sizes manage contact center speech analytics with alerts, reminders, scripting and call scoring. The platform allows users to regulate quality assurance via live calls analysis, post-call audits, and data-driven feedback.

Read more about Castel Detect Live

Users also considered
SoapBox logo

Speech recognition software designed to detect kid speech

learn more
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.

Read more about SoapBox

Users also considered
Kili logo

Training data platform for enterprise AI

learn more
Kili is a training data platform designed to help businesses in banking, manufacturing, and healthcare industries streamline the entire training process for artificial intelligence (AI) and machine learning models, from connecting with raw data sources to preparing models, processing information, and training. The platform enables organizations to handle multiple machine learning projects and process videos, images, text, and other types of data.

Read more about Kili

Users also considered
Translation Worldwide Software logo

Translation management tool for healthcare & medical sector

learn more
Translation Worldwide Software by JBI SOFTWARE is designed to help businesses across healthcare, legal, medical, insurance, banking, and other industries manage language translation projects. The artificial intelligence (AI)-enabled solution allows employees to handle text interpretation and translation processes and reduce lawsuits.

Read more about Translation Worldwide Software

Users also considered
VideoText logo

AI-powered video transcription and subtitle tool

learn more
VideoText is an AI-powered transcription platform that converts video and audio files into text-based content. The software generates timestamped transcripts, automated summaries, chapter markers, and subtitle files in SRT and VTT formats. VideoText supports ninety-nine languages for transcription and offers translation capabilities into more than seventy languages, along with speaker detection and labeling features.

Read more about VideoText

Users also considered
DeepTranscript logo

Transcribe large volumes of conversation

learn more
DeepTranscript is an automatic speech recognition provider for professionnals designed for large volumes and high accuracy. Let's collect all data available in conversations, talks, interview with our plug and play API.

Read more about DeepTranscript

Users also considered
Verbit logo

Verbit makes video and audio accessible and more engaging.

learn more
Verbit provides accurate captions & transcription of live and recorded video to make them accessible & engaging to all audiences. These tools help educators, government entities, legal agencies, business leaders and media producers meet the needs of those with disabilities & all engage viewers.

Read more about Verbit

Users also considered
GoSpeech logo

Saas solution for transcription and subtitling

learn more
Saas solution to convert speech to text based on artificial intelligence

Read more about GoSpeech

Users also considered
SpeechTexter logo

Speech recognition and conversion software

learn more
SpeechTexter is a speech recognition and conversion software that helps corporates, teachers, lawyers, writers, and students convert audio files into text. It offers a multi-language speech recognizer as well as document and email transcriber, enabling users to transcribe documents in real-time.

Read more about SpeechTexter

Users also considered
Matesub logo

Subtitling & Transcription in 200+ Languages

learn more
Matesub is a cloud-based closed captioning tool that utilizes AI technology to generate subtitles for videos. This solution offers compatibility with a range of video formats, facilitating transcription and translation across 85 languages. With Matesub, users can generate culturally sensitive subtitles that resonate with international audiences. Additionally, its WYSIWYG frame-level editor helps capture the contextual nuances of the source material.

Read more about Matesub

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
ELSA Speech Recognition API logo

Leading Speech Recognition API for Language Learning

learn more
ELSA uses proprietary speech recognition technology and artificial intelligence to help language learners improve their English pronunciation

Read more about ELSA Speech Recognition API

Users also considered
Rev.ai logo

Speech recognition software using asynchronous API

learn more
Rev.ai’s suite of speech-to-text APIs allows businesses to build downstream applications. Speech recognition software built from speech engine trained to transcribe content on various topics with various accents for various industries.

Read more about Rev.ai

Users also considered
3Play Media logo

Closed captioning and transcription solution

learn more
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement via a unified platform.

Read more about 3Play Media

Users also considered
INVOX Medical logo

Real-time dictation and transcription of medical reports.

learn more
INVOX Medical is a speech recognition software for real-time dictation and transcription of medical reports. It is compatible with any medical or EHR software and we have specific dictionaries for more than 15 medical specialties to ensure maximum accuracy in dictation transcription.

Read more about INVOX Medical

Users also considered
ELSA Speak logo

Personalized AI-powered language learning software

learn more
Proprietary Speech Recognition and A.I-enabled technology to help students speak English more fluently and effectively. The ELSA API can detect and correct pronunciation errors in scripted and unscripted speech input, providing immediate feedback and recommendations.

Read more about ELSA Speak

Users also considered
Mosaicx logo

Virtual agent and messaging outreach

learn more
Mosaicx uses conversational AI to offer agent-like experiences without human agents. A comprehensive set of service modules means automation creates a better customer experience than ever before.

Read more about Mosaicx

Users also considered