Back to Blog

What Your PDF Reveals About You

That innocent-looking PDF you're about to share might be telling the recipient more than you realize. Your name, your computer's username, the software you used, when you created it, and every edit you've made, it's all potentially embedded in the file.

The Metadata You Don't See

Every PDF carries invisible baggage. When you create or edit a document, your software automatically embeds information called metadata. This data doesn't appear when you view the PDF, but anyone who knows where to look can extract it.

Common metadata fields include:

  • Author: Often your full name or username
  • Creator: The application used to create the document
  • Producer: The software that generated the PDF
  • Creation date: When the document was first created
  • Modification date: The last time it was edited
  • Title and subject: Sometimes auto-filled from content

Real-World Consequences

Metadata leaks have caused real problems. Journalists have been identified through document metadata. Legal documents have revealed editing histories that contradicted official timelines. Whistleblowers have been exposed because the PDF they shared contained their organization's software license information.

In 2003, the British government published a dossier about Iraq. Metadata revealed the document had been edited by multiple government officials, contradicting claims about its origins. The metadata told a different story than the government intended to share.

Beyond Basic Metadata

PDFs can contain more than just document properties. Depending on how they were created, they might include:

  • Embedded fonts: Can reveal what software and operating system you use
  • Hidden layers: Content that was "deleted" but not actually removed
  • Comments and annotations: Review notes you thought were removed
  • Form field data: Previous entries in fillable forms
  • JavaScript: Embedded code that could track when and where the document is opened
  • Attached files: Documents embedded within the PDF

The Redaction Problem

Perhaps the most dangerous assumption is that covering text with a black box removes it. This has led to countless embarrassing and sometimes dangerous information leaks.

When you draw a black rectangle over text in most PDF editors, the text is still there underneath. Anyone can select it, copy it, or remove the rectangle to reveal what's beneath. True redaction requires actually removing the text from the document, not just hiding it visually.

Checking Your Own PDFs

Before sharing a PDF, consider what information it might contain. Most PDF readers let you view document properties, which shows basic metadata. For deeper inspection, tools like pdfinfo or online metadata viewers can reveal more.

Ask yourself:

  • Does the author field contain my real name?
  • Does the creation date reveal when I actually worked on this?
  • Did I properly redact sensitive information, or just cover it?
  • Are there any comments or annotations I forgot to remove?

Protecting Your Privacy

If you need to share PDFs without revealing metadata, you have options:

  • Print to PDF: Creating a new PDF from the print dialog often strips most metadata
  • Use dedicated tools: Some PDF tools can remove metadata and flatten documents
  • Convert and reconvert: Converting to images and back removes embedded data (though quality may suffer)
  • Check before sharing: Always inspect the final file before sending it

A Matter of Awareness

Most metadata exposure isn't malicious, it's accidental. Software adds this information by default, and most users never think to check. The solution isn't paranoia; it's awareness.

Next time you're about to share a PDF, take a moment to consider what invisible information might be traveling with it. Your documents might be saying more than you intended.