Attribution of Quoted Speech in Portuguese Text

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

This paper describes and evaluates a rule-based system implementing a novel method for quote attribution in Portuguese text, working on top of a Constraint-Grammar parse. Both direct and indirect speech are covered, as well as certain other text- embedded quote sources. In a first step, the system performs quote segmentation and identifies speech verbs, taking into account the different styles used in literature and news text. Speakers are then identified using syntactically and semantically grounded Constraint-Grammar rules. We rely on relational links and stream variables to handle anaphorical mentions and to recover the names of implied or underspecified speakers. In an evaluation including both literature and news text, the system performed well on both the segmentation and attribution tasks, achieving F-scores of 98-99% for the former and 89-94% for the latter.
Original languageEnglish
Title of host publicationProceedings of CG-MTA 2023 : Constraint Grammar Workshop at NoDaLiDa 2023, Thórshavn
PublisherAssociation for Computational Linguistics (ACL)
Publication date2023
Pages1-9
Publication statusPublished - 2023

Fingerprint

Dive into the research topics of 'Attribution of Quoted Speech in Portuguese Text'. Together they form a unique fingerprint.

Cite this