this post was submitted on 02 Jul 2023
87 points (98.9% liked)
Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ
54565 readers
472 users here now
⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.
Rules • Full Version
1. Posts must be related to the discussion of digital piracy
2. Don't request invites, trade, sell, or self-promote
3. Don't request or link to specific pirated titles, including DMs
4. Don't submit low-quality posts, be entitled, or harass others
Loot, Pillage, & Plunder
📜 c/Piracy Wiki (Community Edition):
💰 Please help cover server costs.
Ko-fi | Liberapay |
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
That wouldn't work for something like Pathfinder PDFs from the Paizo website. They add a text watermark with the name and email associated with your account on their site to each page of the document. It's not metadata, it's actual data
Why would the checksum differ between downloads if there was a watermark with user identifiable data
Just checked one of my Paizo pdfs and in addition to my account name and email address it also has the datetime that I downloaded the pdf written in the watermark. Presumably because they append the file creation time when the pdf is being signed
Fair, then reprinting won't help. I'd go ahead and come up with some Python script which exported all pages as png, edited that specific portion of every image and recompile it to a pdf. I'm not sure if there is a too which could already do that out-of-the-box.
Unfortunately then you lose things like text and links. I think the only real solution for my specific example (which to be clear, might not be OP's dilemma) is to crack and directly edit the binary data of the PDF file