Discover duplicate e-Books in your collection

Home » Example projects
Over the years you may have downloaded the same book many times. Books from Project Gutenberg for example come with cryptic filenames like PG5200.MOBI so it's hard to tell what you have in your collection. You may also have the same book in many different formats, e.g. both in PDF, MOBI and EPUB.

Most ebooks' tags (Author and Title) are not reliable. i-DeClone examines the text content of books to make sure that they are identical. Its AI algorithm can work around minor differences (e.g one book missing a table of contents or cover page). If two books are mostly the same, they will be identified. i-DeClone considers only the pure text content of your ebooks, ignoring formatting, embedded pictures etc.

Situation
Multiple book downloads

Applies to
e-Books (MOBI, AZW, PDF, EPUB, etc)



Step by step instructions:

➀ Connect devices to scan

If you want to scan external disks, connect them, or just scan your PC folder contents. Click on Start scan toolbar button to begin. Then click Start new project to setup scan settings from scratch.

➁ Scan options

Set the scan category to e-Books and the folder to search to whichever folder you keep your books, or something generic like My Documents or your entire PC

Set the scan mode to Find similar files and the tolerance percent level (the default 90% is a good bet).

By default the similarity algorithm will examine book text contents, and will also discover the same books in various formats as well (EPUB,AZW,MOBI,PDF...). You can double check this is the case switching to advanced tab and ensure the option Files must have same extension is not ticked.

All set, click Start scan and wait for the results. If you have lots of PDF books, the search is going to take some time, but just leave i-DeClone to do its work unattended, and get back to it when it is finished.
Comparing document text requires system plugins that can extract text (IFilters) from various e-Book formats. We have compiled all these filters in a single easy to install plugin, which you must download to scan ebooks contents.
Click to download ebook search & preview plugin
  scan options dialog

➂ Mark and remove duplicates

Use the checkboxes to mark duplicate items for removal, then remove them to clean up space. Use Mark wizard to choose the originals (which will be kept) depending e.g. on their file size. Finally click Clean-up button to start deleting the marked duplicates. This is a standard procedure explained in detail in the documentation


Try it out for yourself:   DOWNLOAD FREE TRIAL
©2021-2025 ZABKAT, all rights reserved | Contact | Privacy policy