Deduplication: Why Computers See Differences in Files that Look Alike to You, by Craig Ball, Ball In Your Court


An employee of an e-discovery service provider asked me to help him explain to his boss why deduplication works well for native files but frequently fails when applied to TIFF images.  The question intrigued me because it requires we dip our toes into the shallow end of cryptographic hashing and dispel a common misconception about electronic documents. . . .