getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Online Speech Recognition Software - Page 4

Last updated: March 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


89 software options

GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
CommPeak Speech-to-Text logo

AI call insights with transcripts, summaries, tone analysis

learn more
Turn call recordings into actionable insights with AI analysis. Access searchable transcripts, AI-generated summaries, and sentiment analysis to identify which conversations need follow-up attention. Use real conversations to coach agents and strengthen training programs.

Read more about CommPeak Speech-to-Text

Users also considered
Aveni Assist logo

Automated meeting capture and compliance for advisers

learn more
Aveni Assist transcribes adviser–client meetings with high accuracy, applies speaker diarisation, and links content to CRM records. Compliance checks analyse transcripts for risks and maintain a searchable audit trail.

Read more about Aveni Assist

Users also considered
CogniAIX logo

AI-based tool turning conversations into tracked tasks

learn more
CogniAIX is an AI-based productivity tool that transcribes conversations and extracts actionable items from audio recordings. The software allows users to upload audio files or record directly through their microphone, then automatically identifies decisions, commitments, and action items from the transcribed content. CogniAIX converts these extracted elements into assigned tasks with designated owners and provides automated task tracking and follow-up capabilities.

Read more about CogniAIX

Users also considered
Amical logo

AI-based open-source speech-to-text application

learn more
Amical is an open-source speech-to-text application powered by generative AI technology that enables users to convert spoken words into text without using a keyboard. The application automatically understands context across different platforms, formatting dictation appropriately whether for professional emails or casual social media posts, while maintaining user privacy and delivering accurate transcriptions.

Read more about Amical

Users also considered
Reteta logo

Cloud-based medical transcription tool for doctors.

learn more
Reteta is a cloud-based healthcare technology solution that transforms patient-physician conversations into comprehensive medical diagnoses and treatment notes. The platform provides automated speech recognition (ASR) models that allow medical professionals to recognize medical terminology, medication names, and multiple speakers to generate detailed clinical documentation.

Read more about Reteta

Users also considered
Speak2mail logo

Voice to text email generator for Gmail

learn more
Speak2mail is a Chrome extension that converts voice dictation into email text directly within Gmail. The tool uses speech recognition technology to transcribe spoken words in real time and applies artificial intelligence to generate context-aware email responses. Speak2mail integrates with Gmail, supports writing in fourteen languages, and includes translation capabilities for composing messages in recipients' preferred languages.

Read more about Speak2mail

Users also considered
Vocova logo

AI transcription & translation for audio/video

learn more
Vocova is an AI-powered transcription tool that converts audio and video files into text across more than one hundred languages. The software features automatic speaker identification, word-level timestamps, and the ability to import content directly from over one thousand platforms including YouTube, TikTok, and various podcast hosts. Users can translate transcripts into more than one hundred forty languages and export results in multiple formats such as PDF, DOCX, SRT, and VTT.

Read more about Vocova

Users also considered
ValueFlow logo

AI voice agents for conducting interviews

learn more
Interview agents that conduct voice-based interviews automatically. Simply share pre-configured interview links with your audience.

Read more about ValueFlow

Users also considered
VALT logo

Speech recognition solution

learn more
VALT is browser-based audio/video capture software for healthcare, education, government, and corporate use. It lets users record, manage, stream, and search content with features, including live observation of nine sessions, customizable data templates, and secure sharing.

Read more about VALT

Users also considered
Irma logo

Cloud-based and AI-enabled meeting notes tool

learn more
Irma is a cloud-based AI meeting assistant that helps automatically capture meeting notes.

Read more about Irma

Users also considered
AICHE logo

AI-enabled software that transforms voice into text

learn more
AICHE transforms voice into polished text with one hotkey. Speak naturally - the AI delivers clean, structured output instantly copied to your clipboard. Available on Windows, Mac, Linux with privacy-first zero audio retention.

Read more about AICHE

Users also considered
Speech Recognition Cloud logo

Speech recognition for doctors, professionals & students

learn more
Speech Recognition Cloud is cloud-based speech recognition software for doctors, professionals and students. Fast, high-accuracy speech-to-text dictation in Windows apps and the browser. Free option available, plus specialised Medical for clinical terminology and workflows.

Read more about Speech Recognition Cloud

Users also considered
Gladia logo

Multilingual speech to text transcription API

learn more
Gladia provides an audio transcription API that converts speech to text through both asynchronous and real-time processing capabilities. The platform supports over one hundred languages and offers features including speaker diarization, sentiment analysis, named entity recognition, and word-level timestamps with sub-three-hundred-millisecond latency for real-time transcription.

Read more about Gladia

Users also considered