Word classes of White Hmong: a computational approach
Field | Value | Language |
dc.contributor.author | Meng, Weijian | |
dc.date.accessioned | 2023-10-19T02:05:44Z | |
dc.date.available | 2023-10-19T02:05:44Z | |
dc.date.issued | 2016 | |
dc.identifier.uri | https://hdl.handle.net/2123/31786 | |
dc.description.abstract | This thesis employs a computational approach to study the word classes in White Hmong, a minority language of Mainland Southeast Asia. It proposes an automatic discovery procedure for word classes based on a careful review and comparison of existing algorithms. Motivated by the distributional hypothesis, which posits that similar words occur in similar environments, the procedure represents words as vectors defined by pairwise co-occurrence. It then measures their grammatical similarity in terms of spatial proximity and clusters them into a hierarchical taxonomy. The procedure is applied to an unannotated corpus of White Hmong, yielding a classification of its lexicon. The classification is evaluated against known grammatical properties of the language, demonstrating the linguistic meaningfulness of the results. | en_AU |
dc.language.iso | en | en_AU |
dc.subject | Hmong language | en_AU |
dc.subject | word class | en_AU |
dc.subject | computational method | en_AU |
dc.subject | machine learning | en_AU |
dc.title | Word classes of White Hmong: a computational approach | en_AU |
dc.type | Thesis | en_AU |
dc.identifier.doi | 10.25910/g4ta-0585 | |
dc.type.thesis | Honours | en_AU |
usyd.faculty | Faculty of Arts and Social Sciences | en_AU |
usyd.department | Linguistics | en_AU |
workflow.metadata.only | No | en_AU |
Associated file/s
Associated collections