Extract Metadata from Documents Leave feedback

Extract metadata

Python

from groupdocs.parser import Parser

with Parser("./sample.pdf") as parser:
    metadata_items = parser.get_metadata()
    if metadata_items is None:
        print("Metadata extraction is not supported for this format.")
    else:
        for item in metadata_items:
            print(f"{item.name}: {item.value}")

sample.pdf

The following sample file is used in this example: sample.pdf

Steps

Create a Parser for the target document.
Call get_metadata() to receive a collection of metadata items.
Iterate through name and value pairs and process them as needed.

For deeper parsing (attachments, text, images), combine metadata extraction with other basic usage topics.

id: extract-metadata-from-documents url: parser/python-net/extract-metadata-from-documents title: Extract Metadata from Documents weight: 7 version: 25.12 description: “Extract metadata (author, title, custom properties) from PDF, Office, images, emails, and other formats using GroupDocs.Parser for Python via .NET.” productName: GroupDocs.Parser for Python via .NET hideChildren: false toc: true tags: python, parser, metadata, document-properties, v25.12

GroupDocs.Parser extracts metadata such as author, title, creation date, and custom properties from supported formats (see supported formats).

Extract metadata

Python

from groupdocs.parser import Parser

with Parser("./sample.pdf") as parser:
    metadata_items = parser.get_metadata()
    if metadata_items is None:
        print("Metadata extraction is not supported for this format.")
    else:
        for item in metadata_items:
            print(f"{item.name}: {item.value}")

sample.pdf

The following sample file is used in this example: sample.pdf

Steps

Create a Parser for the target document.
Call get_metadata() to receive a collection of metadata items.
Iterate through name and value pairs and process them as needed.

For deeper parsing (attachments, text, images), combine metadata extraction with other basic usage topics.

We value your opinion. Your feedback will help us improve our documentation.

Extract Metadata from Documents Leave feedback

On this page

Extract metadata

Steps

For deeper parsing (attachments, text, images), combine metadata extraction with other basic usage topics.

Extract metadata

Steps

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!

On this page