Analyze Features Contact Intelligence Encrypt Editor Watermark
security 100% Browser-Based & Secure

File Intelligence &
Metadata Analyzer

Analyze metadata, check privacy risks, and clean sensitive information — all processed locally in your browser. Your files never leave your device.

cloud_upload

Drop files here or click to upload

Analyze metadata, check privacy risks, and clean sensitive information

JPG PNG PDF DOCX MP3 MP4

Understanding Metadata

Hidden data inside your files tells more than you think. Learn what metadata is and why it matters for your privacy.

Metadata is often described as "data about data." According to the National Information Standards Organization (NISO), metadata is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource (NISO, 2004). Every digital file you create or share—from smartphone photos to PDF contracts—embeds invisible tags that document its origin, author, location, device, and history.

In the context of digital forensics and privacy, metadata serves as a digital fingerprint. The Library of Congress notes that metadata can include creation dates, modification history, software used, and geospatial coordinates (Library of Congress, Metadata). For photographers, journalists, lawyers, and corporate teams, this hidden layer of information can be both an asset and a significant liability.

Research from Stanford University Library emphasizes that metadata management is critical for digital preservation and privacy. Without proper inspection, sharing a document can inadvertently expose sensitive organizational or personal details. UniDoc Metadata Analyzer was built to bridge this gap—offering a client-side, zero-knowledge environment to inspect and sanitize this hidden data before it reaches unintended recipients.

How It Works

A simple four-step process to inspect and clean your file metadata without ever uploading to a server.

1
Upload Your File
Drag and drop any image, PDF, Office document, audio, or video file directly into your browser.
2
Extract Metadata
Our engine parses EXIF, XMP, IPTC, PDF properties, Office Core Properties, and media tags locally.
3
Analyze Risks
Get an instant privacy risk score with clear indicators for GPS location, authorship, and device data.
4
Clean & Export
Remove sensitive metadata fields and download a clean version of your file ready for sharing.

Processing Workflow

Visualizing how your files are analyzed locally from upload to clean export.

upload_file
Local Upload
File selected directly in browser via File API
data_object
Parse & Extract
EXIF, PDF Info, Core / Extended Office XML
security_scan
Privacy Score
Risk engine flags sensitive or identifying data
cleaning_services
Sanitize & Save
Metadata stripped and file rebuilt locally

Our Promise to Your Data

Privacy isn't a feature. It's the foundation of how we built this tool.

computer

100% Client-Side Processing

Your files are processed entirely within your browser using JavaScript. We do not operate an analysis backend or cloud pipeline that handles your raw files. References a web privacy best-practice model aligned with OWASP Web Security guidelines for client-side data handling.

cloud_off

Zero Data Transmission

No file contents are transmitted to UniDoc servers. The only network requests made are for the application code itself. This zero-knowledge model mirrors privacy architectures recommended by the Electronic Frontier Foundation (EFF) for sensitive document workflows.

no_encryption

No Persistent Storage

Files remain in memory (RAM) during analysis and are immediately discarded when you close or refresh the page. We do not use localStorage, cookies, or IndexedDB to cache your uploaded documents, following NIST guidelines on ephemeral data processing.

lock

Encrypted Memory Sandbox

Analysis runs within your browser's secure JavaScript sandbox. Modern browser memory isolation acts as a natural barrier, ensuring no other application or browser tab can access the file data you are inspecting.

Analyzer Features

Comprehensive tooling to inspect, score, and clean metadata across every major file type.

photo_camera

EXIF & Image Analysis

Deep extraction of EXIF, IPTC, and XMP metadata from JPEG, PNG, TIFF, and WEBP files including GPS coordinates, camera model, and lens information using the CIPA EXIF standard.

picture_as_pdf

PDF Document Audit

Inspect PDF document info dictionaries and XMP metadata. Identify author names, creation software, PDF producer version, and modification history based on ISO 32000 structure.

article

Office Metadata View

Read Core and Extended properties embedded in DOCX, XLSX, and PPTX files. Detect author, company, template path, revision identifiers, and total editing time from OOXML internals.

audiotrack

Audio & Video Tags

Extract ID3, Vorbis comments, and container metadata from MP3, FLAC, WAV, MP4, WEBM, and AVI files. Review encoder settings, recording dates, and embedded cover art.

shield

Privacy Risk Score

Automated heuristics assign a risk level based on the presence of personally identifiable information (PII), location data, software signatures, and authorship fields.

auto_fix_high

One-Click Metadata Cleaner

Strip sensitive metadata while preserving file integrity. Cleaned files are rebuilt locally for immediate download with no server round-trip.

Where to Use These Features

Industries and scenarios where metadata inspection is essential for security and compliance.

camera

Photography & Journalism

Remove GPS coordinates and camera serial numbers from images before publishing. Protects source location and expensive equipment investment details.

gavel

Legal & Compliance

Audit document metadata during eDiscovery to ensure no privileged author or editing history is leaked. Meets requirements for metadata hygiene in legal discovery.

corporate_fare

Corporate Security

Sanitize PDF and Office files shared externally with partners or clients. Eliminates exposure of internal usernames, network paths, and organizational templates.

public

Web Publishing

Clean images and documents before uploading to websites or social platforms. Prevents unintentional leakage of editing software, personal names, and location histories.

school

Academic Research

Verify data provenance by inspecting embedded file history. Ensure that shared datasets and published figures do not contain hidden edits or personal authorship trails.

contact_emergency

Personal Privacy

Before selling or donating devices, verify that exported photos and documents have been scrubbed of personal identifiers, geotags, and account-linked metadata.

Frequently Asked Questions

Expert answers backed by standards bodies and academic research.

What are the three main types of metadata?
According to NISO and the Library of Congress, the three broad categories of metadata are:
  • Descriptive Metadata: Used for discovery and identification (e.g., title, author, keywords, description).
  • Structural Metadata: Describes how compound objects are put together (e.g., page order, section hierarchies, file formatting).
  • Administrative Metadata: Provides information to help manage a resource, such as creation date, file type, access rights, and preservation metadata (NISO, Understanding Metadata).
What is meant by metadata?

Metadata is literally "data about data." In a technical context, it refers to the embedded attributes and tags within a file that are not part of the visible content but describe the file's properties, creation, and usage history. The Dublin Core Metadata Initiative defines metadata as structured information for describing resources to facilitate their discovery and management (Dublin Core, 2020). Examples include EXIF tags in images, PDF document info dictionaries, and Office Core Properties. The Stanford University Digital Library Systems describe metadata as essential for both resource discovery and maintaining the authenticity and context of digital objects over time.

What type of file metadata can be viewed?

UniDoc Metadata Analyzer can inspect a wide range of formats and their corresponding metadata types:

  • Images: EXIF, IPTC-NAA, XMP, GPS, TIFF tags — supported via the CIPA EXIF 2.3 standard as implemented by ExifJS and other libraries.
  • PDF: Document Information Dictionary, XMP Packet (Title, Author, Subject, Keywords, Creator, Producer, CreationDate, ModDate).
  • Office: DOCX, XLSX, PPTX Core and Extended Properties — based on ISO/IEC 29500 (OOXML) specifications used by Microsoft Office and LibreOffice.
  • Audio / Video: ID3v1/v2, Vorbis Comments, MP4 atoms (moov.meta), WAV chunks. Temporal metadata, encoder settings, and embedded thumbnails are all exposed.

Libraries such as pdf-lib (GitHub: Hopding/pdf-lib) and jszip (GitHub: Stuk/jszip) are leveraged to decode these formats locally without relying on proprietary APIs.

Why should we delete metadata?

Deleting metadata reduces digital fingerprinting and privacy risks. Research published by the Electronic Frontier Foundation (EFF) highlights that EXIF data in images can reveal precise GPS coordinates, camera serial numbers, and timestamps that expose personal movement patterns and identity. In corporate environments, the SANS Institute warns that PDF and Office metadata can leak internal usernames, organization names, template paths, and editing history that may be exploited in social engineering or competitive intelligence. For journalists and activists, the Reporters Without Borders recommends stripping metadata to protect source anonymity. Cleaning metadata before publication or sharing is considered a standard practice in digital OPSEC and data hygiene.

What is the free software to view metadata?

Several open-source and free tools exist for viewing and cleaning metadata, each with distinct strengths:

  • ExifTool by Phil Harvey — a Perl-based command-line utility widely regarded as the gold standard for reading and writing metadata across thousands of file types.
  • Exif.js (GitHub: exif-js/exif-js) — a lightweight JavaScript library for extracting EXIF data from images in the browser.
  • pdf-lib (GitHub: Hopding/pdf-lib) — a TypeScript library for creating and modifying PDF documents, including reading their metadata dictionaries.
  • ExifCleaner — an open-source desktop app (GitHub: szymonpk/exifcleaner) that provides an easy GUI for stripping EXIF data using ExifTool under the hood.
  • UniDoc Metadata Analyzer — this web-based tool combines the capabilities of the above libraries into a zero-installation, client-side interface that runs in any modern browser without uploading files.
How do I view metadata on my computer?

On most operating systems, you can inspect basic metadata natively, but deeper inspection requires specialized tools:

  • Windows: Right-click a file and select Properties > Details to see basic EXIF, PDF, or Office metadata. For complete EXIF extraction, PowerShell can access extended file properties.
  • macOS: Right-click and select Get Info for basic data. Use Preview > Tools > Show Inspector for EXIF on images. The command-line tool mdls queries native macOS metadata (Spotlight attributes).
  • Linux: Use command-line tools like exiftool, pdfinfo, or exiv2 to read embedded tags directly. Package managers (apt, dnf, pacman) distribute these utilities.
  • Browser-Based: For a cross-platform solution that requires no installation, UniDoc Metadata Analyzer runs entirely in your browser using the FileReader API to inspect files locally, supported by the same standard libraries used by desktop applications.

For academic and professional standards, Harvard Library maintains guides on using command-line tools and open-source catalogs to manage metadata for digital preservation and privacy (Harvard Library, Digital Stewardship).

Sources & References

Academic, standard body, and open-source references used throughout this guide.

menu_book
NISOUnderstanding Metadata (2004)
account_balance
Library of CongressDigital Preservation & Metadata Standards
school
Stanford University LibraryMetadata and Digital Preservation Research
label_important
Dublin CoreDCMI Metadata Terms (2020)
security
OWASPClient-Side Data Security Best Practices
privacy_tip
Electronic Frontier FoundationWhat Is Metadata? & Privacy Guides
gavel
SANS InstituteMetadata Leakage in PDF and Office Documents
campaign
Reporters Without BordersGuide to Securing Metadata in Digital Journalism
code
GitHub / Hopdingpdf-lib — PDF creation and modification library
photo_library
GitHub / exif-jsEXIF JavaScript Library for image metadata
folder_zip
GitHub / StukJSZip — JavaScript library for ZIP files
local_library
Harvard LibraryDigital Stewardship & Metadata Management