An automatic model and Gold Standard for translation alignment of Ancient Greek.

Tariq Yousef, Chiara Palladino, Farnoosh Shamsian, Anise d’Orange Ferreira, Michel Ferreira dos Reis

Publikation: Kapitel i bog/rapport/konference-proceedingKonferencebidrag i proceedingsForskningpeer review

Abstract

This paper illustrates a workflow for developing and evaluating automatic translation alignment models for Ancient Greek. We designed an annotation Style Guide and a gold standard for the alignment of Ancient Greek-English and Ancient Greek-Portuguese, measured inter-annotator agreement and used the resulting dataset to evaluate the performance of various translation alignment models. We proposed a fine-tuning strategy that employs unsupervised training with mono- and bilingual texts and supervised training using manually aligned sentences. The results indicate that the fine-tuned model based on XLM-Roberta is superior in performance, and it achieved good results on language pairs that were not part of the training data.
OriginalsprogEngelsk
TitelProceedings of the Thirteenth Language Resources and Evaluation Conference
RedaktørerNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Jan Odijk, Stelios Piperidis
ForlagEuropean Language Resources Association
Publikationsdatojun. 2022
Sider5894-5905
ISBN (Elektronisk)9791095546726
StatusUdgivet - jun. 2022
Udgivet eksterntJa
Begivenhed13th Conference on Language Resources and Evaluation - Palais du Pharo, Marseille, Frankrig
Varighed: 20. jun. 202225. jun. 2022

Konference

Konference13th Conference on Language Resources and Evaluation
LokationPalais du Pharo
Land/OmrådeFrankrig
ByMarseille
Periode20/06/202225/06/2022

Fingeraftryk

Dyk ned i forskningsemnerne om 'An automatic model and Gold Standard for translation alignment of Ancient Greek.'. Sammen danner de et unikt fingeraftryk.

Citationsformater