Automatic extraction of dates from historical documents
Access status:
Open Access
Type
Conference paperAbstract
The essential quality of information in a digital library is accessibility. Full text search is not enough for some collections, more can be done. Historical collections, for example, contain dates and it would be useful to historians to be able to search by them. However, these ...
See moreThe essential quality of information in a digital library is accessibility. Full text search is not enough for some collections, more can be done. Historical collections, for example, contain dates and it would be useful to historians to be able to search by them. However, these dates may occur anywhere within the text of historical documents, and to be searchd they must be extracted from the documents and integrated into the collection index. Doing this manually is very expensive, and described here is a system to do it automatically. This system was implemented within the Greenstone framework used by the New Zealand Digital Library, and involved the use of some carefully designed heuristics.
See less
See moreThe essential quality of information in a digital library is accessibility. Full text search is not enough for some collections, more can be done. Historical collections, for example, contain dates and it would be useful to historians to be able to search by them. However, these dates may occur anywhere within the text of historical documents, and to be searchd they must be extracted from the documents and integrated into the collection index. Doing this manually is very expensive, and described here is a system to do it automatically. This system was implemented within the Greenstone framework used by the New Zealand Digital Library, and involved the use of some carefully designed heuristics.
See less
Date
2001-01-01Publisher
Research Institute for Humanities and Social Sciences (RIHSS), the University of Sydney.Licence
Copyright the University of SydneyCitation
Computing Arts 2001 : digital resources for research in the humanities : 26th-28th September 2001, Veterinary Science Conference Centre, the University of Sydney / hosted by the Scholarly Text and Imaging Service (SETIS), the University of Sydney Library, and the Research Institute for Humanities and Social Sciences (RIHSS), the University of SydneyShare