Show simple item record

FieldValueLanguage
dc.contributor.authorRadford, William Edward John
dc.date.accessioned2015-03-12
dc.date.available2015-03-12
dc.date.issued2014-01-08
dc.identifier.urihttp://hdl.handle.net/2123/12850
dc.description.abstractNatural language is fraught with problems of ambiguity, including name reference. A name in text can refer to multiple entities just as an entity can be known by different names. This thesis examines how a mention in text can be linked to an external knowledge base (KB), in our case, Wikipedia. The named entity linking (NEL) task requires systems to identify the KB entry, or Wikipedia article, that a mention refers to; or, if the KB does not contain the correct entry, return NIL. Entity linking systems can be complex and we present a framework for analysing their different components, which we use to analyse three seminal systems which are evaluated on a common dataset and we show the importance of precise search for linking. The Text Analysis Conference (TAC) is a major venue for NEL research. We report on our submissions to the entity linking shared task in 2010, 2011 and 2012. The information required to disambiguate entities is often found in the text, close to the mention. We explore apposition, a common way for authors to provide information about entities. We model syntactic and semantic restrictions with a joint model that achieves state-of-the-art apposition extraction performance. We generalise from apposition to examine local descriptions specified close to the mention. We add local description to our state-of-the-art linker by using patterns to extract the descriptions and matching against this restricted context. Not only does this make for a more precise match, we are also able to model failure to match. Local descriptions help disambiguate entities, further improving our state-of-the-art linker. The work in this thesis seeks to link textual entity mentions to knowledge bases. Linking is important for any task where external world knowledge is used and resolving ambiguity is fundamental to advancing research into these problems.en
dc.rightsThe author retains copyright of this thesis
dc.subjectNatural language processingen
dc.subjectNamed entity linkingen
dc.subjectAppositionen
dc.subjectWikipediaen
dc.titleLinking named entities to Wikipediaen
dc.typeThesisen
dc.date.valid2015-01-01en
dc.type.thesisDoctor of Philosophyen
usyd.facultyFaculty of Engineering and Information Technologies, School of Information Technologiesen
usyd.degreeDoctor of Philosophy Ph.D.en
usyd.awardinginstThe University of Sydneyen


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.