getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software with API (2026) - Page 2

Last updated: March 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


52 software options

Google Cloud Speech-to-Text logo

Speech-to-Text Solution

learn more
Google Cloud Speech-to-Text enables users to convert audio into text so they can work faster and more efficiently.

Read more about Google Cloud Speech-to-Text

Users also considered
Speech Recognition Engine logo

AI-enabled speech recognition engine

learn more
LumenVox’s speech and voice software leverages artificial intelligence, natural language understanding, and deep machine learning technologies to deliver speech recognition technology. It includes neural networks to improve the ability to add new languages and dialects and serve a more diverse base of users.

Read more about Speech Recognition Engine

Users also considered
Rythmex logo

Speech to text, transcription, medical transcription

learn more
Rythmex is an AI transcription solution offering real-time, multilingual transcription services in a user-friendly interface. With its intuitive features, API integrations, and robust data security, Rythmex is the go-to solution for individuals and businesses seeking accurate transcription solution.

Read more about Rythmex

Users also considered
Twixor logo

Customer communications management software

learn more
Twixor EnCaps is a low-code customer engagement platform that helps businesses deliver personalized interactions. The software utilizes generative AI and natural language processing functionalities on messaging platforms to create customer journeys. The CX platform combines digital assistant and intelligent process automation to deliver personalized interactions.

Read more about Twixor

Users also considered
Mosaicx logo

Virtual agent and messaging outreach

learn more
Mosaicx uses conversational AI to offer agent-like experiences without human agents. A comprehensive set of service modules means automation creates a better customer experience than ever before.

Read more about Mosaicx

Users also considered
Verbit logo

Verbit makes video and audio accessible and more engaging.

learn more
Verbit provides accurate captions & transcription of live and recorded video to make them accessible & engaging to all audiences. These tools help educators, government entities, legal agencies, business leaders and media producers meet the needs of those with disabilities & all engage viewers.

Read more about Verbit

Users also considered
Matesub logo

Subtitling & Transcription in 200+ Languages

learn more
Matesub is a cloud-based closed captioning tool that utilizes AI technology to generate subtitles for videos. This solution offers compatibility with a range of video formats, facilitating transcription and translation across 85 languages. With Matesub, users can generate culturally sensitive subtitles that resonate with international audiences. Additionally, its WYSIWYG frame-level editor helps capture the contextual nuances of the source material.

Read more about Matesub

Users also considered
GoSpeech logo

Saas solution for transcription and subtitling

learn more
Saas solution to convert speech to text based on artificial intelligence

Read more about GoSpeech

Users also considered
Rev.ai logo

Speech recognition software using asynchronous API

learn more
Rev.ai’s suite of speech-to-text APIs allows businesses to build downstream applications. Speech recognition software built from speech engine trained to transcribe content on various topics with various accents for various industries.

Read more about Rev.ai

Users also considered
3Play Media logo

Closed captioning and transcription solution

learn more
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement via a unified platform.

Read more about 3Play Media

Users also considered
SoapBox logo

Speech recognition software designed to detect kid speech

learn more
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.

Read more about SoapBox

Users also considered
DeepTranscript logo

Transcribe large volumes of conversation

learn more
DeepTranscript is an automatic speech recognition provider for professionnals designed for large volumes and high accuracy. Let's collect all data available in conversations, talks, interview with our plug and play API.

Read more about DeepTranscript

Users also considered
Kili logo

Training data platform for enterprise AI

learn more
Kili is a training data platform designed to help businesses in banking, manufacturing, and healthcare industries streamline the entire training process for artificial intelligence (AI) and machine learning models, from connecting with raw data sources to preparing models, processing information, and training. The platform enables organizations to handle multiple machine learning projects and process videos, images, text, and other types of data.

Read more about Kili

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
ELSA Speech Recognition API logo

Leading Speech Recognition API for Language Learning

learn more
ELSA uses proprietary speech recognition technology and artificial intelligence to help language learners improve their English pronunciation

Read more about ELSA Speech Recognition API

Users also considered
Voci logo

Speech analytics software with event-level metadata

learn more
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.

Read more about Voci

Users also considered
AI-Powered Voice Assistants logo

Customer experience software for eCommerce businesses

learn more
AI-Powered Voice Assistants is a conversational marketing software that helps businesses recognize speech, interpret human language and optimize communications. Administrators can automate various repetitive tasks including insurance premium payment reminders and debt collection processes.

Read more about AI-Powered Voice Assistants

Users also considered
Voximal logo

A phone platform based on Asterisk propulsed by Voximal

learn more
Voximal is Asterisk's VoiceXML engine with state-of-the-art of latest text to speech and speech to text on-premise or online offer.

Read more about Voximal

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
TENIOS Voice API logo

Integration platform for integrating telephony applications

learn more
TENIOS Voice API facilitates the seamless integration of speech services into your cloud telephony using standard web technologies. This API includes a variety of functions that enable software applications to initiate and receive calls, eliminating the need for developers to handle TK technologies.

Read more about TENIOS Voice API

Users also considered
Listener logo

Speech to Text - fast and reliable

learn more
Listener is a product that transcribes speech to text in real-time. It supports multiple languages and domains and provides high accuracy, speech adaptation, timestamps, speaker diarization, and flexible model deployment.

Read more about Listener

Users also considered
Authenti logo

Voice biometrics solution for identification & verification

learn more
Authenti is a voice biometric solution that verifies identity using vocal characteristics. It can be used for various applications, such as transactions, IVR, remote access, digital signatures, multi-factor authentication, and workforce management, healthcare, travel and education.

Read more about Authenti

Users also considered
Wavel logo

Full Stack Voice AI Solutions for Videos And Localization

learn more
Wavel is an AI-powered video assistant that uses advanced text-to-speech and speech-to-text technology to create captions, subtitles, and dubbing in over 40+ languages. It offers voiceover customization with 250+ emotions and integrates with popular platforms like YouTube and Vimeo

Read more about Wavel

Users also considered
Call automation bot logo

Automate customer phone calls

learn more
Call Automation Bot is a cloud-based conversational AI platform that provides features such as call center coordination, call forwarding, speech-to-text processing, intent recognition, and solution flows.

Read more about Call automation bot

Users also considered
Reteta logo

Cloud-based medical transcription tool for doctors.

learn more
Reteta is a cloud-based healthcare technology solution that transforms patient-physician conversations into comprehensive medical diagnoses and treatment notes. The platform provides automated speech recognition (ASR) models that allow medical professionals to recognize medical terminology, medication names, and multiple speakers to generate detailed clinical documentation.

Read more about Reteta

Users also considered