Built for real‑world documents: proposals, reports, letters, and more.
Find 100% duplicates and similar files with small edits using SimHash + TF‑IDF.
Upload folders with structure preserved. Handles PDFs, DOCX, and TXT.
Password prompts for protected PDFs/DOCX so nothing gets missed.
Highlight exact and near sentence matches to review changes quickly.
Preview PDFs inline and render DOCX as clean HTML for fast review.
Soft‑delete duplicates and let them auto‑expire in 7 days.
Export an entire folder (with structure) as a single zip.
Tune near‑duplicate sensitivity live to fit your documents.
Select a folder and we preserve its structure while extracting text from PDFs, DOCX, and TXT.
We compute robust signatures and similarities to find exact and near duplicates—even with minor edits.
Preview, compare sentences, archive unwanted copies (auto‑expires), or download cleaned folders as zip.