• 0 Posts
  • 3 Comments
Joined 1 year ago
cake
Cake day: July 2nd, 2023

help-circle


  • You can of course remove the metadata, but you can’t really be sure you removed all watermarks hidden in the actual content, unless you can make two downloads from different sources have the same hash with whatever method you’re going to use. That way you’d know for certain that you caught whatever was inserted to identify you. Anything other than metadata will be very hard to find and remove in an automated way unless you already know exactly what you’re looking for though.

    That said, this is how I’ve cleaned up metadata in batches of PDF files using qpdf and exiftool in the past:

    for file in *.pdf; do
        exiftool -all:all= -overwrite_original "$file"
        qpdf --linearize --replace-input "$file"
    done