Render to HTML
When rendering to HTML, GroupDocs.Viewer renders each page of the source document as a separate HTML document.
GroupDocs.Viewer for Python provides two options to manage CSS, fonts, images, and other resources:
- HTML with external resources stores page resources as separate files. This allows reusing common resources and results in reducing page size and loading speed.
- HTML with embedded resources integrates page resources into HTML. This makes each document page self-sufficient but results in increasing page size and loading speed.
To render files to HTML, follow these steps:
- Create an instance of the Viewer class. Specify the source document path as a constructor parameter.
- Instantiate the HtmlViewOptions object. Specify a path to save the rendered pages.
- Call the View.view() method of the Viewer object. Specify the HtmlViewOptions object as the parameter.
The following code snippet shows how to render a .docx document to HTML with external resources:
# Create an HTML file for each page.
# Specify the HTML file names and location of external resources.
# {0} and {1} are replaced with the current page number and resource name, respectively.
with gv.Viewer("sample.docx") as viewer:
html_options = gvo.HtmlViewOptions.for_external_resources("page_{0}.html", "page_{0}/resource_{0}_{1}", "page_{0}/resource_{0}_{1}")
viewer.view(html_options)
The following code snippet shows how to render a .docx document to HTML with embedded resources:
# Create an HTML files.
# {0} is replaced with the current page number in the file name.
with gv.Viewer(test_files.sample_docx) as viewer:
html_options = gvo.HtmlViewOptions.for_embedded_resources("page_{0}.html")
viewer.view(html_options)
print(f"\nSource document rendered successfully.\nCheck output in {output_directory}.")
For details, please refer to the following pages: