DataCleaner is a Free App Data Quality Analysis, Profiling, Cleansing, Duplicate Detection and more
DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging.
Features
- Profiles and analyzes your database within minutes!
- Access almost any datastore - Oracle, MySQL, PostgreSQL, MS SQL Server, MongoDB, CUBRID, CSV files, Excel spreadsheets, dbase and more
- Discover patterns in your textual data with the Pattern Finder
- Find out which values occur the most with the Value Distribution profile
- Cleanse your contact details with name and address validations
- Detect duplicates using fuzzy logic and configurable weights and thresholds
- Merge your duplicates and create a single version of the truth
- Write data back to relational databases, CSV files, Excel spreadsheets or MongoDB databases
Platforms
- Windows
- Linux
- macOS
License
GNU Library or Lesser General Public License version 3.0 (LGPLv3)