Show simple item record

gApp: a text preprocessing system to improve the neural machine translation of discontinuous multiword expressions

dc.contributor.authorHidalgo-Ternero, Carlos Manuel
dc.contributor.authorZhou-Lian, Xiaoqing
dc.date.accessioned2024-02-15T09:57:06Z
dc.date.available2024-02-15T09:57:06Z
dc.date.issued2023-09
dc.identifier.isbn978-2-9701733-0-4
dc.identifier.urihttps://hdl.handle.net/10115/30471
dc.description.abstractIn this paper we present research results with gApp, a text-preprocessing system designed for automati-cally detecting and converting discontinuous multiword expressions (MWEs) into their continuous forms so as to improve the performance of current neural machine translation systems (NMT) (see Hidalgo-Ternero, 2021 & 2022, Hidalgo-Ternero & Corpas Pastor, 2020, 2022a & 2022b, Hidalgo-Ternero, Lista, and Corpas Pastor, 2022, and Hidalgo-Ternero and Zhou-Lian, 2022a & 2022b). To test its effectiveness, eight experiments with several NMT systems such as DeepL, Google Translate, ModernMT and VIP have been carried out in different language directionalities (ES/FR/IT > ES/EN/DE/FR/IT/PT/ZH) for the trans-lation of somatisms, i.e., MWEs containing lexemes referring to human or animal body parts (Mellado Blanco, 2004). More specifically, we have analysed both flexible verb-noun idiomatic constructions (VNICs) and flexible verb + prepositional phrase (VPP) constructions. In this regard, the promising results obtained for these typologies of MWEs throughout experiments 1-8 will shed some light on new avenues for enhancing MWE-aware NMT systems.es
dc.language.isoenges
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titlegApp: a text preprocessing system to improve the neural machine translation of discontinuous multiword expressionses
dc.typeinfo:eu-repo/semantics/workingPaperes
dc.typeinfo:eu-repo/semantics/conferenceObjectes
dc.rights.accessRightsinfo:eu-repo/semantics/openAccesses


Files in this item

This item appears in the following Collection(s)

Show simple item record

Attribution 4.0 InternationalExcept where otherwise noted, this item's license is described as Attribution 4.0 International