i have nextcloudpi and the following problem. On my harddisk which is connected to the raspberry i have a huge amount of images and a lot of them are duplicates but with different file name. I´m looking for a command line tool to find duplicate images. The scanning of the images should analyze the images not the file name.
Is it possible with fdupe, or is this tool only scanning for duplicate files with same file name?
i have fdupe installed on my Pi. Will it really find duplicate images even if they have a different file name? DupeGuru works fine for me on my winodws laptop. I connect the raspi images folder to my computer and scan - that is working but needs a long time.
I would go directly to my raspi and try searching on the command line, therefore i have installed fdupe…my question is only will it really find duplicate images ?
ok seems that fdupe is the easiest way to find duplicate files and while it takes the md5 checksum it will work correctly.
i think antoher way finding duplicate images can be done on the command line like follows:
find Pictures/ -type f -exec md5sum ‘{}’ ‘;’ | sort | uniq --all-repeated=separate -w 15 > dupes.txt
Finds duplicates by generating and matching an md5sum hash on each file, and then using sort and uniq to print all the photo filenames in a text file, with duplicates listed together and separated by a blank line. It finds only duplicates, and will not count files that are not duplicated