GroupDocs.Metadata for Python via .NET Overview
Leave feedback
On this page
What is GroupDocs.Metadata?
GroupDocs.Metadata for Python via .NET is a native Python library that reads, edits, removes, and exports metadata across documents, images, audio, video, and many other formats. It works with the most notable metadata standards — XMP, EXIF, IPTC, Image Resource Blocks, and ID3 — as well as format-specific and built-in document properties, and exposes them all through one unified, predicate-driven API regardless of file format. It runs entirely on-premise, requires no Microsoft Office installation, and ships as a pre-built wheel on Windows, Linux, and macOS.
Typical uses include:
Privacy and compliance — strip author, GPS location, comments, and hidden data from files before they leave your organization.
Digital asset management — read and edit EXIF, IPTC, and XMP fields on images to drive search, tagging, and cataloguing.
AI / content preprocessing — extract metadata as structured data (JSON, CSV) to enrich search indexes and LLM context. See Agents and LLM Integration.
Auditing — inspect documents for format, MIME type, encryption state, and statistics without modifying them.
Batch normalization — set or update properties (titles, dates, copyright) across many files with a single predicate.
With its powerful and straightforward API, you can:
Work with the most popular metadata standards: XMP, EXIF, IPTC, Image Resource Blocks, ID3, document properties, etc.
Predefined tags manipulate common properties (author, creation date, title) uniformly across every format.
On-premise
No cloud calls, no Microsoft Office install, no network traffic.
Quick Example
fromgroupdocs.metadataimportMetadatadefquick_example():"""Read every metadata property from a document."""withMetadata("./input.docx")asmetadata:# `lambda p: True` matches every property in the fileforpropinmetadata.find_properties(lambdap:True):print(f"{prop.name} = {prop.value}")if__name__=="__main__":quick_example()
fromgroupdocs.metadataimportLicense,Metadatadefremove_all_metadata():"""Strip every property and save a clean copy (saving requires a license)."""License().set_license("./GroupDocs.Metadata.lic")withMetadata("./input.pdf")asmetadata:removed=metadata.sanitize()print(f"Removed {removed} properties")metadata.save("./clean.pdf")if__name__=="__main__":remove_all_metadata()
Where to Next
Install the package — Installation walks through PyPI and offline wheel installation for Windows, Linux, and macOS.
Run your first example — the Quick Start Guide reads and removes metadata in a few minutes.
Explore the examples — How to Run Examples clones the runnable repository and runs every documented scenario locally or in Docker.
Use it in depth — the Developer Guide covers reading, editing, removing, standards, loading, saving, and exporting.
Plug it into AI pipelines — Agents and LLM Integration explains the MCP server and the AGENTS.md shipped inside the wheel.
Technical Support
If you encounter an issue while using GroupDocs.Metadata or have a technical question, feel free to create a post in our Free Support Forum. If free support is not sufficient, you can submit a ticket to our Paid Support Helpdesk.
Was this page helpful?
Any additional feedback you'd like to share with us?
Please tell us how we can improve this page.
Thank you for your feedback!
We value your opinion. Your feedback will help us improve our documentation.