getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software - Page 2

Last updated: April 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


60 software options

wolkvox logo

Communication management system for contact centers with AI

learn more
Design interactive customer experiences with our ASR functionality, which allows you to interact with IVRs, virtual agents and other IT systems.

Read more about wolkvox

Users also considered
ClearTouch Operator logo

Cloud Contact Center Platform Provider

learn more
ClearTouch's speech recognition converts spoken conversations into structured, searchable data in real time. It helps analyze customer intent, detect trends, ensure compliance, and uncover service gaps—enabling faster decision-making, improved agent performance, and a better customer experience.

Read more about ClearTouch Operator

Users also considered
Taption logo

AI-driven subtitles, translations and video editing

learn more
Taption is a feature-rich platform that automatically generates high-quality transcripts, translations, and subtitles for videos. The platform's leading AI technology converts audio or video content into text in over 40 languages, allowing users to create embedded bilingual subtitles, labeled speaker transcripts, and translations for their video projects. Taption's intuitive editing tools make it easy to trim and adjust the text to align with video edits, ensuring a polished final product.

Read more about Taption

Users also considered
GoSpeech logo

Saas solution for transcription and subtitling

learn more
Saas solution to convert speech to text based on artificial intelligence

Read more about GoSpeech

Users also considered
ELSA Speak logo

Personalized AI-powered language learning software

learn more
Proprietary Speech Recognition and A.I-enabled technology to help students speak English more fluently and effectively. The ELSA API can detect and correct pronunciation errors in scripted and unscripted speech input, providing immediate feedback and recommendations.

Read more about ELSA Speak

Users also considered
CallHippo logo

Cloud-based phone system for sales, support & growing teams

learn more
CallHippo is a Virtual Phone System that is easy-to-use while offering robust functionality with advanced features, extensive reporting, and seamless integrations to empower sales and service teams to have effective conversations with customers. 24x7 World Class Support. Instant Setup

Read more about CallHippo

Users also considered
Twilio logo

Build, Scale, and Operate Customized Communication Solutions

learn more
Twilio offers an API for phone services enabling companies to make and receive phone calls and send and receive text messages. It allows programmers to integrate various communication methods and to use existing web development skills and codes to solve communication problems.

Read more about Twilio

Users also considered
Matesub logo

Subtitling & Transcription in 200+ Languages

learn more
Matesub is a cloud-based closed captioning tool that utilizes AI technology to generate subtitles for videos. This solution offers compatibility with a range of video formats, facilitating transcription and translation across 85 languages. With Matesub, users can generate culturally sensitive subtitles that resonate with international audiences. Additionally, its WYSIWYG frame-level editor helps capture the contextual nuances of the source material.

Read more about Matesub

Users also considered
Amberscript logo

Web-based speech recognition software

learn more
AmberScript is a suite of software products that allow you to transform audio and video files into searchable text and subtitles. Create closed captions and subtitles to improve accessibility, save money, and time.

Read more about Amberscript

Users also considered
INVOX Medical logo

Real-time dictation and transcription of medical reports.

learn more
INVOX Medical is a speech recognition software for real-time dictation and transcription of medical reports. It is compatible with any medical or EHR software and we have specific dictionaries for more than 15 medical specialties to ensure maximum accuracy in dictation transcription.

Read more about INVOX Medical

Users also considered
Snowfly logo

Employee engagement, gamification & corporate wellness tool

learn more
Snowfly is an employee engagement and gamification software designed to help businesses measure the performance of employees and engage them through incentives and rewards. It enables organizations to create, implement, and manage recognition programs to improve employee experience (EX) and satisfaction.

Read more about Snowfly

Users also considered
Rev.ai logo

Speech recognition software using asynchronous API

learn more
Rev.ai’s suite of speech-to-text APIs allows businesses to build downstream applications. Speech recognition software built from speech engine trained to transcribe content on various topics with various accents for various industries.

Read more about Rev.ai

Users also considered
LilySpeech logo

Speech to text software

learn more
LilySpeech is a windows application that allows users to type with voice. It is highly accurate and works with any Windows application.

Read more about LilySpeech

Users also considered
3Play Media logo

Closed captioning and transcription solution

learn more
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement via a unified platform.

Read more about 3Play Media

Users also considered
Trint  logo

Automated transcription platform with AI

learn more
Trint is a cloud-based audio and video transcription solution which leverages artificial intelligence (AI), machine learning, and natural language processing (NLP) to automatically transcribe audio from a range of file formats and generate an interactive, searchable, editable & shareable transcript

Read more about Trint

Users also considered
Maestra logo

Speech to text, closed captioning & transcription software

learn more
Maestra is a speech to text software designed to help educators, researchers, marketers, journalists, and media houses automatically add transcriptions, captions, subtitles, and voiceovers to audio and video files in real-time. The platform enables professionals to translate text into various languages including English, French, Spanish, and German.

Read more about Maestra

Users also considered
SubCap logo

Automatic subtitles software for videos.

learn more
Subcap is a mobile app that provides videos with automatic subtitles. Subcap allows users to upload a video from the gallery or take a video simultaneously. It automatically transcribes the audio to text. To generate subtitles, artifiial intelligence is used for Subcap’s auto-captions maker.

Read more about SubCap

Users also considered
Yactraq logo

Speech Analytics for Contact Centers

learn more
Yactraq is interactive analytics that integrates emotion detection, speech analytics, and predictive intelligence to understand customer communications across multiple channels (phone, email, social media).

Read more about Yactraq

Users also considered
TalkMark logo

AI‑enabled transcription and summarization tool

learn more
TalkMark is an advanced AI‑powered transcription and summarization tool designed to convert speech to highly accurate text (95%+), with speaker identification, fast processing, and secure EU‑hosted infrastructure - ideal for professionals, students, creators, and enterprises.

Read more about TalkMark

Users also considered
TENIOS Voice API logo

Integration platform for integrating telephony applications

learn more
TENIOS Voice API facilitates the seamless integration of speech services into your cloud telephony using standard web technologies. This API includes a variety of functions that enable software applications to initiate and receive calls, eliminating the need for developers to handle TK technologies.

Read more about TENIOS Voice API

Users also considered
Wavel logo

Full Stack Voice AI Solutions for Videos And Localization

learn more
Wavel is an AI-powered video assistant that uses advanced text-to-speech and speech-to-text technology to create captions, subtitles, and dubbing in over 40+ languages. It offers voiceover customization with 250+ emotions and integrates with popular platforms like YouTube and Vimeo

Read more about Wavel

Users also considered
Dictalogic logo

Cloud speech recognition solution

learn more
With the use of digital transformation, we allow a voice to text conversion on the fly, where you just record audio and send it to transcribe as you normally would and the audio converts to text before it reaches the transcriber. We have multiple options on assignment for you to explore.

Read more about Dictalogic

Users also considered
Speech Recognition Cloud logo

Speech recognition for doctors, professionals & students

learn more
Speech Recognition Cloud is cloud-based speech recognition software for doctors, professionals and students. Fast, high-accuracy speech-to-text dictation in Windows apps and the browser. Free option available, plus specialised Medical for clinical terminology and workflows.

Read more about Speech Recognition Cloud

Users also considered
SpeechWrite 360 logo

Speech recognition and mobile dictation software

learn more
SpeechWrite 360 is a cloud-based dictation and voice recognition workflow solution designed to meet the needs of modern professionals requiring flexible and mobile working capabilities. Hosted in the secure Amazon Web Services cloud infrastructure, SpeechWrite 360 requires no onsite servers or IT resources. Users always have access to the latest software version with no additional upgrades or maintenance.

Read more about SpeechWrite 360

Users also considered
Ebby logo

Cloud-based transcriptions software

learn more
Ebby helps lawyers, podcasters, journalists, researchers, and academic professionals convert audio recordings into text documents using AI technology. The built-in editor automatically synchronizes and plays audio or video files with text data, letting users review and edit transcripts in real-time.

Read more about Ebby

Users also considered