Automated conversion of Web-based marriage register data into a printed format with predefined layout

Brailsford, David F. (2011) Automated conversion of Web-based marriage register data into a printed format with predefined layout. In: ACM Symposium on Document Engineering (DocEng '11), 19-22 Sept 2011, Mountain View, California, USA.

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (85kB) | Preview


The Phillimore Marriage Registers for England were published in the period 1896 to 1922 and have defined a standard layout format for the typesetting of marriage data. However, not all English parish churches had their marriage registers analysed and printed by the Phillimore organisation within this time period.

This paper tells the story of Wirksworth, a town in Derbyshire with a large church, licensed for marriages, yet whose marriage data was not released to the Phillimore organisation. Hence there is no printed Phillimore Marriages volume for Wirksworth. However, in recent years, a Wirksworth web site, created by John Palmer, has become famous as being probably the most comprehensive record of a parish’s activities anywhere on the Web.

Within a total of 120 MB of data on the web site, covering events in Wirksworth from medieval times to the present, is a set of data recording births, marriages and deaths transcribed from the original hand-written church register volumes.

The work described here covers the software tools and techniques that were used in creating a set of awk scripts to extract all the marriage records from the Wirksworth web site data. The extracted material was then automatically re-processed, typeset and indexed to form an entirely new Phillimore-style volume for Wirksworth marriages.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Published in: DocEng '11: proceedings of the 11th ACM Symposium on Document Engineering. New York : ACM, 2011, ISBN: 978-1-4503-0863-2. pp. 61-64, doi: 10.1145/2034691.2034704
Keywords: Re-typesetting, Web-to-Print, troff, genealogy, hyperlinking, indexing
Schools/Departments: University of Nottingham UK Campus > Faculty of Science > School of Computer Science
Depositing User: Brailsford, Prof David
Date Deposited: 24 Feb 2015 12:06
Last Modified: 16 Sep 2016 03:56

Actions (Archive Staff Only)

Edit View Edit View