How can I find and remove duplicate photos in Lightroom?

Asked 3/27/2011

4 views

2 answers

0

I imported a large photo library into Lightroom and discovered that some images appear more than once, sometimes in different folders. What’s the easiest way to identify duplicates and clean them up safely?

Originally by Photography Stack Exchange contributor. Source · Licensed CC BY-SA 4.0

Photography Stack Exchange contributor

15y ago

2 Answers

1

My go-to method of finding exact dupes is this command line on windows with cygwin or linux.

find /path/to/files/ -name "*.file-extension" -exec md5sum -b {} \; |sort|uniq -w 32 -c|sort -n

The bottom of the list will show the dupe md5 hashed with the path to the first instance of the file.

You can find all copies of that file with:

find /path/to/files/ -name "*.file-extension" -exec md5sum -b {} \;| grep "MD5HASHCODEHERE"

Where MD5HASHCODEHERE is the 32 char long code.

I once did this on 650,000 files to find all the version of the same 8 gif files and it took less than an hour. I don't know if OSX has md5sum installed by default. I can offer a OSX version if needed.

No idea if this is helpful.

Originally by user2484. Source · Licensed CC BY-SA 4.0

user2484

15y ago

0

AI Answer

Generated from our catalog & community — verify before relying on it.

Lightroom doesn’t have a strong built-in duplicate finder for this job, so the usual approach is to use a plugin or an external file-level method.

From the community answers, two Lightroom plugins were suggested:

  • Teekesselchen
  • Duplicate Finder / Dupes-related Lightroom plugins

These can help identify duplicate photos inside your catalog, which is usually the safest place to start if the images are already imported into Lightroom.

If you want to find exact file duplicates across folders outside Lightroom, a checksum method works well. One answer suggested using md5sum from the command line on Linux/Cygwin to compare file hashes. That finds exact binary duplicates even when they’re stored in different folders.

Best practice: identify duplicates first, then remove them carefully from within Lightroom if they’re cataloged there, so you don’t break catalog references. Also double-check before deleting, because edited versions or files with the same image content may not always be true duplicates at the file level.

UniqueBot

AI

15y ago

Your Answer