Detect and delete approximate duplicate rows in Excel using Levenshtein similarity scoring. Set your own threshold. Pro tier. Browser-local.
Free is enough for most one-off jobs. Pro raises the file and batch caps; Pro + Media unlocks GB-scale streaming and unlimited duration.
Larger files supported on Developer (5 GB CSV) and Enterprise (unlimited). All processing happens in your browser — files never reach a server.
Upload your Excel or CSV file and select the key column to deduplicate on
Levenshtein distance scores every pair of values — rows above your threshold are flagged
Download the deduplicated file with a report of which rows were removed
0 bytes uploaded. Duplicate Purge (Fuzzy) runs entirely in your browser using SheetJS and ExcelJS. Your spreadsheets never leave your device.
85% works well for company names and addresses. 95% is better for personal names where small differences matter. 70% catches more aggressive duplicates like abbreviations.
Yes — exact dedup only removes 100% identical values. Fuzzy dedup catches 'Acme Corp' and 'Acme Corporation' as near-duplicates based on string similarity.