getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software - Page 3

Last updated: April 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


78 software options

SubCap logo

Automatic subtitles software for videos.

learn more
Subcap is a mobile app that provides videos with automatic subtitles. Subcap allows users to upload a video from the gallery or take a video simultaneously. It automatically transcribes the audio to text. To generate subtitles, artifiial intelligence is used for Subcap’s auto-captions maker.

Read more about SubCap

Users also considered
Dictalogic logo

Cloud speech recognition solution

learn more
With the use of digital transformation, we allow a voice to text conversion on the fly, where you just record audio and send it to transcribe as you normally would and the audio converts to text before it reaches the transcriber. We have multiple options on assignment for you to explore.

Read more about Dictalogic

Users also considered
Speech Recognition Cloud logo

Speech recognition for doctors, professionals & students

learn more
Speech Recognition Cloud is cloud-based speech recognition software for doctors, professionals and students. Fast, high-accuracy speech-to-text dictation in Windows apps and the browser. Free option available, plus specialised Medical for clinical terminology and workflows.

Read more about Speech Recognition Cloud

Users also considered
SpeechWrite 360 logo

Speech recognition and mobile dictation software

learn more
SpeechWrite 360 is a cloud-based dictation and voice recognition workflow solution designed to meet the needs of modern professionals requiring flexible and mobile working capabilities. Hosted in the secure Amazon Web Services cloud infrastructure, SpeechWrite 360 requires no onsite servers or IT resources. Users always have access to the latest software version with no additional upgrades or maintenance.

Read more about SpeechWrite 360

Users also considered
Wavel logo

Full Stack Voice AI Solutions for Videos And Localization

learn more
Wavel is an AI-powered video assistant that uses advanced text-to-speech and speech-to-text technology to create captions, subtitles, and dubbing in over 40+ languages. It offers voiceover customization with 250+ emotions and integrates with popular platforms like YouTube and Vimeo

Read more about Wavel

Users also considered
Ebby logo

Cloud-based transcriptions software

learn more
Ebby helps lawyers, podcasters, journalists, researchers, and academic professionals convert audio recordings into text documents using AI technology. The built-in editor automatically synchronizes and plays audio or video files with text data, letting users review and edit transcripts in real-time.

Read more about Ebby

Users also considered
Irma logo

Cloud-based and AI-enabled meeting notes tool

learn more
Irma is a cloud-based AI meeting assistant that helps automatically capture meeting notes.

Read more about Irma

Users also considered
Aveni Assist logo

Automated meeting capture and compliance for advisers

learn more
Aveni Assist transcribes adviser–client meetings with high accuracy, applies speaker diarisation, and links content to CRM records. Compliance checks analyse transcripts for risks and maintain a searchable audit trail.

Read more about Aveni Assist

Users also considered
Listener logo

Speech to Text - fast and reliable

learn more
Listener is a product that transcribes speech to text in real-time. It supports multiple languages and domains and provides high accuracy, speech adaptation, timestamps, speaker diarization, and flexible model deployment.

Read more about Listener

Users also considered
TENIOS Voice API logo

Integration platform for integrating telephony applications

learn more
TENIOS Voice API facilitates the seamless integration of speech services into your cloud telephony using standard web technologies. This API includes a variety of functions that enable software applications to initiate and receive calls, eliminating the need for developers to handle TK technologies.

Read more about TENIOS Voice API

Users also considered
Philips SpeechExec logo

Use the power of your voice with professional dictation

learn more
Philips SpeechExec Pro Dictation and Transcription Software is designed for authors to focus on recording with their preferred voice recorder, download dictations quickly, and automatically route to assistants or speech recognition to transcribe files.

Read more about Philips SpeechExec

Users also considered
Voximal logo

A phone platform based on Asterisk propulsed by Voximal

learn more
Voximal is Asterisk's VoiceXML engine with state-of-the-art of latest text to speech and speech to text on-premise or online offer.

Read more about Voximal

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
ValueFlow logo

AI voice agents for conducting interviews

learn more
Interview agents that conduct voice-based interviews automatically. Simply share pre-configured interview links with your audience.

Read more about ValueFlow

Users also considered
Vocova logo

AI transcription & translation for audio/video

learn more
Vocova is an AI-powered transcription tool that converts audio and video files into text across more than one hundred languages. The software features automatic speaker identification, word-level timestamps, and the ability to import content directly from over one thousand platforms including YouTube, TikTok, and various podcast hosts. Users can translate transcripts into more than one hundred forty languages and export results in multiple formats such as PDF, DOCX, SRT, and VTT.

Read more about Vocova

Users also considered
Infercall logo

AI phone answering service for small businesses

learn more
Infercall is an AI-powered phone answering service that handles incoming calls for businesses around the clock. The platform trains on business information by extracting data from websites and uploaded documents, enabling it to answer questions about services, pricing, and availability. Infercall supports simultaneous call handling, appointment scheduling, lead qualification, call transfers, and provides full call transcripts and recordings with analytics.

Read more about Infercall

Users also considered
AICHE logo

AI-enabled software that transforms voice into text

learn more
AICHE transforms voice into polished text with one hotkey. Speak naturally - the AI delivers clean, structured output instantly copied to your clipboard. Available on Windows, Mac, Linux with privacy-first zero audio retention.

Read more about AICHE

Users also considered
Amical logo

AI-based open-source speech-to-text application

learn more
Amical is an open-source speech-to-text application powered by generative AI technology that enables users to convert spoken words into text without using a keyboard. The application automatically understands context across different platforms, formatting dictation appropriately whether for professional emails or casual social media posts, while maintaining user privacy and delivering accurate transcriptions.

Read more about Amical

Users also considered
Picovoice logo

Developer-first platform for adding voice to anything

learn more
The first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, intent and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.

Read more about Picovoice

Users also considered
Akkadu logo

AI subtitles & interpretation for global chat.

learn more
Akkadu offers a range of innovative solutions for making meetings and events multilingual, whether they are on-site, hybrid, or online. With Akkadu, users can add remote simultaneous interpretation (RSI), AI subtitles, or human live captioning to enhance language accessibility for participants.

Read more about Akkadu

Users also considered
CommPeak Speech-to-Text logo

AI call insights with transcripts, summaries, tone analysis

learn more
Turn call recordings into actionable insights with AI analysis. Access searchable transcripts, AI-generated summaries, and sentiment analysis to identify which conversations need follow-up attention. Use real conversations to coach agents and strengthen training programs.

Read more about CommPeak Speech-to-Text

Users also considered
Help Genie logo

Support That Sells

learn more
Help Genie is a fully branded AI voice and chat support platform for small and medium businesses. Every Genie is trained on your documentation, speaks in your brand voice, and handles calls and chats 24/7, no technical setup required.

Read more about Help Genie

Users also considered
Sesame logo

Voice biometric identification system

learn more
Sesame by Utopia.AI is a cloud-based voice biometric identification solution which uses natural speech to identify callers in real time, by creating voice prints from previous calls without requiring caller enrollment. The software can also analyze caller vocabulary, sentiment, and emotional state.

Read more about Sesame

Users also considered
Heynds logo

AI-enabled writing and speech assistant

learn more
Heynds is an AI Writing and Speech Assistant desktop app for Mac and Windows, coming soon to Linux. It's designed to make your writing workflow much faster and easier. You can say goodbye to slow typing, writer's block, and endless editing.

Read more about Heynds

Users also considered
Authenti logo

Voice biometrics solution for identification & verification

learn more
Authenti is a voice biometric solution that verifies identity using vocal characteristics. It can be used for various applications, such as transactions, IVR, remote access, digital signatures, multi-factor authentication, and workforce management, healthcare, travel and education.

Read more about Authenti

Users also considered