getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Big Data Software with Data Cleansing (2026) - Page 3

Last updated: April 2026

1 filter applied

Features


Integrated with


Pricing model


Devices supported


Organization types


User rating


103 software options

Keatext logo

Your most impactful improvements start here.

learn more
Keatext is a text analytics solution that delivers AI-based recommendations and ready-to-share reports leveraging GPT to improve customer experience.

Read more about Keatext

Users also considered
Amazon EC2 Spot logo

Web-based software for increasing Amazon EC2 capacity

learn more
Amazon EC2 Spot is designed to help software developers increase the compute capacity in AWS Cloud and resize instances as per the business requirement. Organizations can utilize spot instances to launch, run, and maintain various fault-tolerant applications including big data, CI/CD, stateless web servers, API endpoints, analytics tools, and rendering workloads.

Read more about Amazon EC2 Spot

Users also considered
Cloudera Enterprise logo

Data cloud platform that delivers self-service analytics

learn more
Cloudera is an enterprise data cloud platform designed to help businesses in financial services, manufacturing, telecommunications, retail, technology, insurance, healthcare, public sector, education, energy, and utilities use self-service analytics across multi-cloud and hybrid environments.

Read more about Cloudera Enterprise

Users also considered
OpenText Analytics Database logo

Big data analytics and machine learning solution

learn more
Vertica is a powerful big data analytics platform that enables organizations to analyze their data on-premises, in the cloud, or on Hadoop. The analytics capabilities are enhanced by machine learning and predictive analytics, which enable users to uncover insights and identify patterns in their data.

Read more about OpenText Analytics Database

Users also considered
DataHero logo

Visualize data from spreadsheets & cloud services

learn more
DataHero is a cloud-based data visualization solution which allows users to create custom dashboards and charts with data from cloud services & spreadsheets

Read more about DataHero

Users also considered
Qrvey logo

Embedded Analytics for SaaS Companies

learn more
Qrvey is the embedded analytics platform designed specifically for SaaS companies.

Read more about Qrvey

Users also considered
Upsolver logo

Data lake platform, on-premises or in the cloud

learn more
Upsolver’s data lake platform helps simplify the process for developers to integrate, manage and structure streaming data for analysis, whether on-premises or in the cloud, through a set of advanced stream processing algorithms and an intuitive drag & drop interface

Read more about Upsolver

Users also considered
Mozart Data logo

The easiest and fastest way to set up a modern data stack.

learn more
Backed by award-winning data analyst support, Mozart Data’s all-in-one modern data platform empowers anyone to centralize, organize, and analyze their data without engineering resources. Instead of piecing together tools, companies get everything needed to spin up a reliable data stack in an hour.

Read more about Mozart Data

Users also considered
Pachyderm logo

The Leader in Data Versioning and Pipelines for MLOps

learn more
Pachyderm is the leader in data versioning and pipelines for MLOps. We help data science teams operationalize the data tasks in their ML lifecycle to iterate on data more quickly & reliably. Pachyderm’s data foundation allows data science teams to automate & scale their machine learning lifecycle.

Read more about Pachyderm

Users also considered
Chaossearch logo

Turn your AWS S3 into a hot, searchable analytic data lake.

learn more
CHAOSSEARCH is a fully managed log analytics platform for big data that leverages your AWS S3 as a data store. Our revolutionary technology radically lowers costs for analyzing log data at scale (i.e. Big Data!), passing those savings on to you! Try CHAOSSEARCH for your big data analysis challenges!

Read more about Chaossearch

Users also considered
Hopsworks logo

Machine learning (ML) application building platform

learn more
Hopsworks is an Open-source Enterprise Feature Store, a productivity platform for the development and operation of Machine Learning (ML) pipelines at scale. Users can easily manage their AI data for features and models for feature, training and inference pipelines.

Read more about Hopsworks

Users also considered
Datameer logo

Datameer Cloud: Your Data Transformation Solution

learn more
Datameer Cloud simplifies data transformation for data engineers. Optimize analytics, job management, and data accessibility with ease.

Read more about Datameer

Users also considered
Trendalyze  logo

Visualize, search & monitor for micro-trends in data

learn more
Leverage Hadoop platforms and AWS, Azure, GCP and OCI big data cloud services

Read more about Trendalyze

Users also considered
WinPure logo

Web-based email verification solution

learn more
WinPure is a web-based solution that helps businesses across healthcare, banking, education, insurance, and other sectors validate and verify email addresses. It allows managers to manage email lists, highlight email addresses with fake or improper names, and upload files in CSV formats.

Read more about WinPure

Users also considered
Zerve logo

Platform for managing data science and AI/ML development

learn more
Zerve is the agentic data workspace designed for anyone who works with data, from solo analysts, data scientists, business users and teams alike. Zerve brings together exploration, advanced analysis, collaboration, and production deployment into a single AI-native environment.

Read more about Zerve

Users also considered
Alooma logo

Data pipeline as a service

learn more
Alooma's data pipeline as a service exports, transforms & loads data into BigQuery, Redshift, and Snowflake for analytics, AI, machine learning, BI & reporting

Read more about Alooma

Users also considered
SAP BW/4HANA logo

Enterprise data warehouse

learn more
SAP BW/4HANA is an enterprise data warehouse based on SAP HANA. The solution is designed to simplify modelling and administration and includes an intuitive user experience. SAP BW/4HANA allows organizations to share data across the entire enterprise, streamline processes, and support innovations with a single source for real-time insights.

Read more about SAP BW/4HANA

Users also considered
Tonic logo

Realistic test data generation for developers

learn more
Tonic.ai offers a developer platform for data de-identification, synthesis, and provisioning to keep test data secure, accessible, and in sync across testing and development environments. Get the data you need to shorten your sprints, catch more bugs, and ship better products faster.

Read more about Tonic

Users also considered
OpenText Analytics Cloud logo

Data visualization and analysis solution for business

learn more
OpenText Magellan is a fully integrated AI & Analytics platform that lets business users access, blend, and explore data quickly, and apply advanced and predictive analytics techniques through a drag-and-drop experience that doesn't depend on IT or a data expert.

Read more about OpenText Analytics Cloud

Users also considered
BOEM logo

A data toolbox for business management & decision-making.

learn more
BOEM is cloud-based software, specifically designed for managers, that collects, treats, and analyses data in real-time. Through KPI and artificial intelligence, it sends automatic reports and notifications to help businesses grow.

Read more about BOEM

Users also considered
Millimetric.ai logo

AI-enabled key performance indicator analysis platform

learn more
Millimetric.ai is an automated KPI analysis software designed to help businesses utilize artificial intelligence and machine learning algorithms to monitor, identify, and understand anomalies, trends, and relationships across enterprise datasets. Marketing teams can connect the platform with several data sources and conduct root cause diagnostics on issues in real-time.

Read more about Millimetric.ai

Users also considered
Etleap logo

Cloud-based Redshift ETL tool

learn more
Etleap is a cloud-based Redshift ETL tool which allows users to combine data from multiple sources in a Redshift warehouse and apply custom data transformations

Read more about Etleap

Users also considered
Intelligent Engagement Platform logo

Customer engagement and experience software

learn more
NGDATA offers an intelligent engagement platform that builds rich customer data profiles to create truly personalized customer experiences with in-built real-time interaction management.

Read more about Intelligent Engagement Platform

Users also considered
Lumada DataOps Suite logo

Data automation and management software

learn more
Lumada DataOps unlocks business value by enabling businesses to operationalize data management with automation and collaboration. Lumada DataOps helps businesses build DataOps practices to improve operations via an intelligent data operations platform. Users can automate data pipeline scalability, lower costs, and activate production deployments through continuous integration and delivery across hybrid cloud environments.

Read more about Lumada DataOps Suite

Users also considered
Luz Analytics logo

Knowing is better.

learn more
A Competitor Insights Platform with a database that allows brands to track their competitors’ product sales volume down to SKU

Read more about Luz Analytics

Users also considered