Universal Dependencies for Malayalam
Publication date
2023Published in
The Prague Bulletin of Mathematical LinguisticsVolume / Issue
Neuveden (120)ISBN / ISSN
ISSN: 0032-6585ISBN / ISSN
eISSN: 1804-0462Metadata
Show full item recordCollections
This publication has a published version with DOI 10.14712/00326585.026
Abstract
Treebanks can play a crucial role in developing natural language processing systems and to have a gold-standard treebank data it becomes necessary to adopt a uniform framework for the annotations. Universal Dependencies (UD) aims to develop cross-linguistically consistent annotations for the world's languages. The current paper presents the essential pivots of the UD based syntactically annotated treebank for Malayalam. Sentences extracted from the IndicCorp corpus were manually annotated for morphological features and dependency relations. Language-specific properties are discussed which shed light on many of the grammatical areas in the Dravidian language syntax which needs to be examined in-depth. This paper also discusses some pertaining issues in UD taking into consideration the Dravidian languages and provides insights for further improvements in the existing treebanks.
Keywords
Universal, Dependencies, Malayalam
Permanent link
https://hdl.handle.net/20.500.14178/2314License
Full text of this result is licensed under: Creative Commons Uveďte původ-Neužívejte dílo komerčně-Nezpracovávejte 3.0 Unported