Abstract
This paper presents a study on the automatic classification of default and nondefault codings for aspect-marked verbs in six Slavic and one Baltic language. As classifier a Support Vector Machine (SVM) and as verbal features Shannon Information (SI) and Average Information Content (IC) have been utilised. In all languages high accuracy of the classification has been achieved. In addition, we found indications for the validity of the Uniform Information Density principle within SI and IC.
Original language | English |
---|---|
Journal | CEUR Workshop Proceedings |
Volume | 2521 |
ISSN | 1613-0073 |
Publication status | Published - 2019 |
Event | 3rd Workshop on Natural Language for Artificial Intelligence, NL4AI 2019 - Rende, Italy Duration: 19. Nov 2019 → 22. Nov 2019 |
Conference
Conference | 3rd Workshop on Natural Language for Artificial Intelligence, NL4AI 2019 |
---|---|
Country/Territory | Italy |
City | Rende |
Period | 19/11/2019 → 22/11/2019 |
Bibliographical note
Funding Information:* Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundatino) project number: 357550571.
Publisher Copyright:
Copyright © 2019 for this paper by its authors.
Keywords
- Coding
- Information content
- Verb aspect