Creating Structured PDF Files Using XML Templates

Hardy, Matthew and Brailsford, David and Thomas, Peter (2004) Creating Structured PDF Files Using XML Templates. In: ACM Symposium on Document Engineering (DocEng2004), 27-31 October 2004, Milwaukee, USA.

This is the latest version of this item.

[img] PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (171kB)

Abstract

This paper describes a tool for recombining the logical structure from an XML document with the typeset appearance of the corresponding PDF document. The tool uses the XML representation as a template for the insertion of the logical structure into the existing PDF document, thereby creating a Structured/Tagged PDF. The addition of logical structure adds value to the PDF in three ways: the accessibility is improved (PDF screen readers for visually impaired users perform better), media options are enhanced (the ability to reflow PDF documents, using structure as a guide, makes PDF viable for use on hand-held devices) and the re-usability of the PDF documents benefits greatly from the presence of an XML-like structure tree to guide the process of text retrieval in reading order (e.g. when interfacing to XML applications and databases).

Item Type: Conference or Workshop Item (Paper)
Additional Information: Final draft of paper accepted for ACM DocEng '04 conference
Uncontrolled Keywords: XML, PDF, Logical Structure Insertion.
Schools/Departments: University of Nottingham UK Campus > Faculty of Science > School of Computer Science
Depositing User: Brailsford, Prof David
Date Deposited: 28 Sep 2005
Last Modified: 09 Oct 2007 15:50
URI: http://eprints.nottingham.ac.uk/id/eprint/190

Available Versions of this Item

Actions (Archive Staff Only)

Edit View Edit View