Creating Structured PDF Files Using XML Templates

Hardy, Matthew and Brailsford, David and Thomas, Peter (2004) Creating Structured PDF Files Using XML Templates. In: ACM Symposium on Document Engineering (DocEng2004), 27-31 October 2004, Milwaukee, USA.

This is the latest version of this item.

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


This paper describes a tool for recombining the logical structure from an XML document with the typeset appearance of the corresponding PDF document. The tool uses the XML representation as a template for the insertion of the logical structure into the existing PDF document, thereby creating a Structured/Tagged PDF. The addition of logical structure adds value to the PDF in three ways: the accessibility is improved (PDF screen readers for visually impaired users perform better), media options are enhanced (the ability to reflow PDF documents, using structure as a guide, makes PDF viable for use on hand-held devices) and the re-usability of the PDF documents benefits greatly from the presence of an XML-like structure tree to guide the process of text retrieval in reading order (e.g. when interfacing to XML applications and databases).

Item Type:Conference or Workshop Item (Paper)
Additional Information:Final draft of paper accepted for ACM DocEng '04 conference
Uncontrolled Keywords:XML, PDF, Logical Structure Insertion.
Schools/Departments:Faculty of Science > School of Computer Science and Information Technology
ID Code:190
Deposited By:Brailsford, David
Deposited On:28 Sep 2005
Last Modified:09 Oct 2007 16:50

Available Versions of this Item

Repository Staff Only: item control page