BudVitoff
Honorable Member
- Joined
- Sep 8, 2012
- Messages
- 70
- Thread Author
- #1
My daughter is running Win10 on a Dell laptop. Because she has four kids, she has thousands of photos and the list keeps growing. Thanks to some difficulties with downloading photos from her iPhone in the recent past, she has many duplicate photos and, unfortunately, many descriptive filenames have changed.
As an old programmer, I can’t stop thinking that I should able to help her detect and remove the duplicates. The basis for this is that although the filenames may be different, the actual photos should be identical, bit for bit, even including the supporting data on lens settings, orientation, etc. I don’t know "C" or its derivatives. The last language I used that might allow me to manipulate the layout of a record is BASIC. (I would be happiest if I could go back to my old mainframe days when I could work down at the bit level if necessary.)
There are packages that remove duplicates. I’ve worked with one from the CleanMyPC people. I’ve also thought about using iTunes to find the duplicates. The only idea I’ve had touches on setting the actual filename aside as a comment, converting the photo bits to hex (ASCII characters, should work just as well while cutting the filename length in half) and using the result as a "filename" to check for duplicates. Actually, I would only need to convert a relatively small initial part of the photo — this would give me some false positives but nothing more serious than that. That’s as far as I’ve gotten.
I’d appreciate any helpful comments and suggestions on this venture.
As an old programmer, I can’t stop thinking that I should able to help her detect and remove the duplicates. The basis for this is that although the filenames may be different, the actual photos should be identical, bit for bit, even including the supporting data on lens settings, orientation, etc. I don’t know "C" or its derivatives. The last language I used that might allow me to manipulate the layout of a record is BASIC. (I would be happiest if I could go back to my old mainframe days when I could work down at the bit level if necessary.)
There are packages that remove duplicates. I’ve worked with one from the CleanMyPC people. I’ve also thought about using iTunes to find the duplicates. The only idea I’ve had touches on setting the actual filename aside as a comment, converting the photo bits to hex (ASCII characters, should work just as well while cutting the filename length in half) and using the result as a "filename" to check for duplicates. Actually, I would only need to convert a relatively small initial part of the photo — this would give me some false positives but nothing more serious than that. That’s as far as I’ve gotten.
I’d appreciate any helpful comments and suggestions on this venture.