Word classes of White Hmong: a computational approach

Meng, Weijian

Access status:

Open Access

Field	Value	Language
dc.contributor.author	Meng, Weijian
dc.date.accessioned	2023-10-19T02:05:44Z
dc.date.available	2023-10-19T02:05:44Z
dc.date.issued	2016
dc.identifier.uri	https://hdl.handle.net/2123/31786
dc.description.abstract	This thesis employs a computational approach to study the word classes in White Hmong, a minority language of Mainland Southeast Asia. It proposes an automatic discovery procedure for word classes based on a careful review and comparison of existing algorithms. Motivated by the distributional hypothesis, which posits that similar words occur in similar environments, the procedure represents words as vectors defined by pairwise co-occurrence. It then measures their grammatical similarity in terms of spatial proximity and clusters them into a hierarchical taxonomy. The procedure is applied to an unannotated corpus of White Hmong, yielding a classification of its lexicon. The classification is evaluated against known grammatical properties of the language, demonstrating the linguistic meaningfulness of the results.	en
dc.language.iso	en	en
dc.rights	Other	en
dc.subject	Hmong language	en
dc.subject	word class	en
dc.subject	computational method	en
dc.subject	machine learning	en
dc.title	Word classes of White Hmong: a computational approach	en
dc.type	Thesis	en
dc.identifier.doi	10.25910/g4ta-0585
dc.type.thesis	Honours	en
dc.rights.other	The author retains copyright of this thesis. It may only be used for the purposes of research and study. It must not be used for any other purposes and may not be transmitted or shared with others without prior permission.	en
usyd.faculty	Faculty of Arts and Social Sciences	en
usyd.department	Linguistics	en
workflow.metadata.only	No	en