a tale of fail and win (image recovery/management under linux)
Hey, this post is really old.
It was published
and/or last updated over 3 years ago. It may be very out
of date; as the world, and I, may have changed a lot since
I wrote it. I could have completely changed my mind about the
below in the intervening time. I don't make a habit of
revisiting old posts to update them.
If I had, you would see the updated timestamp above, and likely
a note about what I changed at the bottom of the article. That
being said, if any views I've expressed here are particularly
offensive or breaches someone's privacy, please contact me.
use git-annex-assistant to create backups on several destinations
- use test-repo first, do some tests
- try on smaller directories w/ actually valuable data, create backups first
- annex-ize several other directories
- remove picture-backup from external HDD to make space for new backup via git annex (very bad ida)
- annex-ize several GB of pictures dating back to 2004 (RAWs and JPGs)
- fail somehow several times, remove .git directory start anew
- (do some other stuff)
- get back to the picture-dir, realize that it is e_mpty_ (besides some folders), .git directory contains nothing
use ntfsundelete, and some proprietary tools to recover (only marked as) deleted files from the ntfs volume (900 GB)
- use git annex fsck on the recovered .git data, get only some pictures back, not very much (about 2k files)
use photorec on several runs to recover .jpg and .cr2 (RAW) data
try to use picasa on the files to get some sorting (and kick out unwanted data as images from games etc.)
- picasa somehow mangles the raw-files :(
- picasa does not properly use the exif-provided file-creation date, but a mixture of that and the files’ date w(
fiddle around with exiftool to get back the timestamp from the files’ exif-data
find . -type f -name “*.jpg” -exec exiftool -FileModifyDate<DateTimeOriginal {} ;
try digikam
- somehow works
- slow on previews when using ‘import from files’
- slow on DB handling
- hangs itself when moving about 6k (?) files from one folder to another
- switch to MySQL as backend
- somehow fail, try google
- realize that the internal MySQL server won’t do, install external one
- use ‘settings’->’Database migration’ before switching via the config
- speed is better
- use the duplicate detection to remove redundant files (takes time …)