Show simple item record

FieldValueLanguage
dc.contributor.authorHansen, Silvia
dc.contributor.authorTeich, Elke
dc.date.accessioned2010-06-08
dc.date.available2010-06-08
dc.date.issued2001-01-01
dc.identifier.citationComputing Arts 2001 : digital resources for research in the humanities : 26th-28th September 2001, Veterinary Science Conference Centre, the University of Sydney / hosted by the Scholarly Text and Imaging Service (SETIS), the University of Sydney Library, and the Research Institute for Humanities and Social Sciences (RIHSS), the University of Sydneyen_AU
dc.identifier.urihttp://hdl.handle.net/2123/6217
dc.description.abstractIn the proposed talk we discuss the application of a set of computational text analysis techniques for the analysis of the linguistic features of translations. The goal of this analysis is to test two hypotheses about the specific properties of translations: Baker's hypothesis of normalization (Baker, 1995) and Toury's law of interference (Toury, 1995). The corpus we analyze consists of English and German original texts and translations of those texts into German and English, respectively. The analysis task is complex in a number of respects. First, a multi-level analysis (clause, phrases, words) has to be carried out; second, among the linguistic features selected for analysis are some rather abstract ones, ranging from functional-grammatical features, e.g., Subject, Adverbial of Time, etc, to semantic features, e.g., semantic roles, such as Agent, Goal, Locative, etc.; third, monolingual and contrastive analyses are involved. This places certain requirements on the computational techniques to be employed both regarding corpus encoding, linguistic annotation and information extraction. We show how a combination of commonly available techniques can fulfill these requirements to a large degree and point out their limitations for application to the research questions raised. These techniques range from document encoding (TEI, XML) over automatic corpus annotation (notably part-of-speech tagging; Brants, 2000) and semi-automatic annotation (O'Donnell, 1995) to query systems as implemented in e.g., the IMS Corpus Workbench (Christ, 1994), the MATE system (Mengel & Lezius, 2000) and the Gsearch system (Keller et al., 1999).en_AU
dc.description.sponsorshipHosted by the Scholarly Text and Imaging Service (SETIS), the University of Sydney Library, and the Research Institute for Humanities and Social Sciences (RIHSS), the University of Sydney.en_AU
dc.language.isoenen_AU
dc.publisherResearch Institute for Humanities and Social Sciences (RIHSS), the University of Sydney.en_AU
dc.rightsCopyright the University of Sydneyen
dc.subjectHumanities Computingen_AU
dc.subjectComputational text analysisen_AU
dc.titleTowards an integrated representation of multiple layers of linguistic annotation in multilingual corporaen_AU
dc.typeConference paperen_AU


Show simple item record

Associated file/s

Associated collections

Show simple item record

There are no previous versions of the item available.