March 30, 2026

How Perceptual Hashing Finds Your Best Photo From Duplicates

You just got back from a weekend trip. You shot 400 photos. And now, scrolling through your camera roll, you're staring at 15 nearly identical shots of the same sunset, 12 versions of the same group photo, and a burst of 20 frames where your friend was mid-laugh. They all look the same, but they're not. One is slightly sharper. One has better lighting. One caught the perfect expression.

Finding that one best photo out of dozens of near-duplicates is tedious work. It's the kind of task that makes people abandon their photo libraries entirely. But there's a surprisingly elegant technology working behind the scenes to solve this problem: perceptual hashing. It's how tools like Photopicker can scan hundreds of photos, group the near-duplicates together, and automatically select the best version from each cluster.

Let's break down how this works, why it's different from what you might expect, and what happens after duplicates are found.

What Perceptual Hashing Actually Does (And Why Regular Comparisons Fail)

When most people think about finding duplicate files, they think about exact matches. Your computer can compare two files byte-by-byte and tell you if they're identical. This works great for documents or spreadsheets, but it falls apart completely with photos.

Here's why: take a single photo and resize it. Every byte in the file changes. Crop it slightly. Different bytes. Adjust the brightness by 1%. Different again. Save it as a JPEG instead of PNG. Completely different file. To a byte-level comparison, these are all unique photos. To your eyes, they're obviously the same image.

Perceptual hashing solves this by ignoring the file data entirely and focusing on what the image looks like . Instead of reading bytes, a perceptual hash algorithm analyzes the visual content of a photo and generates a compact fingerprint, typically just 64 bits long, that represents the image's visual structure.

The key insight is that visually similar images produce similar hashes, even if the underlying file data is completely different. Two photos taken half a second apart during a burst, one slightly brighter than the other, will generate hashes that are nearly identical. Meanwhile, two completely different photos will produce hashes with no meaningful similarity.

How Perceptual Hashing Finds Your Best Photo From Duplicates

What Perceptual Hashing Actually Does (And Why Regular Comparisons Fail)

How the Fingerprint Gets Created

Measuring Similarity With Hamming Distance

From Duplicate Groups to Picking the Winner

A Real-World Example

Why This Matters More Than Manual Culling

Where Human Eyes Still Struggle

The Tiered Results That Follow

Try It on Your Own Photo Collection