Most ebooks' tags (Author and Title) are not reliable. i-DeClone examines the text content of books to make sure that they are identical. Its AI algorithm can work around minor differences (e.g one book missing a table of contents or cover page). If two books are mostly the same, they will be identified. i-DeClone considers only the pure text content of your ebooks, ignoring formatting, embedded pictures etc.
Applies to
e-Books (MOBI, AZW, PDF, EPUB, etc)
Step by step instructions:
➀ Connect devices to scan
If you want to scan external disks, connect them, or just scan your PC folder contents. Click on Start scan toolbar button to begin. Then click Start new project to setup scan settings from scratch.➁ Scan options
Set the scan category to e-Books and the folder to search to whichever folder you keep your books, or something generic like My Documents or your entire PC
Set the scan mode to Find similar files and the tolerance percent level (the default 90% is a good bet). By default the similarity algorithm will examine book text contents, and will also discover the same books in various formats as well (EPUB,AZW,MOBI,PDF...). You can double check this is the case switching to advanced tab and ensure the option Files must have same extension is not ticked. All set, click Start scan and wait for the results. If you have lots of PDF books, the search is going to take some time, but just leave i-DeClone to do its work unattended, and get back to it when it is finished.
Comparing document text requires system plugins that can extract text (IFilters) from various e-Book formats. We have compiled all these filters in a single easy to install plugin, which you must download to scan ebooks contents.
◪ Click to download ebook search & preview plugin |
![]() |
➂ Mark and remove duplicates
Use the checkboxes to mark duplicate items for removal, then remove them to clean up space. Use Mark wizard to choose the originals (which will be kept) depending e.g. on their file size. Finally click Clean-up button to start deleting the marked duplicates. This is a standard procedure explained in detail in the documentation