Logo of Humboldt-Universität zu BerlinLogo of Humboldt-Universität zu Berlin
edoc-Server
Open-Access-Publikationsserver der Humboldt-Universität
de|en
Header image: facade of Humboldt-Universität zu Berlin
View Item 
  • edoc-Server Home
  • Artikel und Monographien
  • Zweitveröffentlichungen
  • View Item
  • edoc-Server Home
  • Artikel und Monographien
  • Zweitveröffentlichungen
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
All of edoc-ServerCommunity & CollectionTitleAuthorSubjectThis CollectionTitleAuthorSubject
PublishLoginRegisterHelp
StatisticsView Usage Statistics
All of edoc-ServerCommunity & CollectionTitleAuthorSubjectThis CollectionTitleAuthorSubject
PublishLoginRegisterHelp
StatisticsView Usage Statistics
View Item 
  • edoc-Server Home
  • Artikel und Monographien
  • Zweitveröffentlichungen
  • View Item
  • edoc-Server Home
  • Artikel und Monographien
  • Zweitveröffentlichungen
  • View Item
2022-06-09Zeitschriftenartikel DOI: 10.3389/frai.2022.868249
“The Rodney Dangerfield of Stylistic Devices”: End-to-End Detection and Extraction of Vossian Antonomasia Using Neural Networks
Schwab, Michel cc
Jäschke, Robert cc
Fischer, Frank cc
Philosophische Fakultät
Vossian Antonomasia (VA) is a well-known stylistic device based on attributing a certain property to a person by relating them to another person who is famous for this property. Although the morphological and semantic characteristics of this phenomenon have long been the subject of linguistic research, little is known about its distribution. In this paper, we describe end-to-end approaches for detecting and extracting VA expressions from large news corpora in order to study VA more broadly. We present two types of approaches: binary sentence classifiers that detect whether or not a sentence contains a VA expression, and sequence tagging of all parts of a VA on the word level, enabling their extraction. All models are based on neural networks and outperform previous approaches, best results are obtained with a fine-tuned BERT model. Furthermore, we study the impact of training data size and class imbalance by adding negative (and possibly noisy) instances to the training data. We also evaluate the models' performance on out-of-corpus and real-world data and analyze the ability of the sequence tagging model to generalize in terms of new entity types and syntactic patterns.
Files in this item
Thumbnail
frai-05-868249.pdf — Adobe PDF — 1.226 Mb
MD5: 73b3474de60ffa7cc3a30cc4fd4896c0
Cite
BibTeX
EndNote
RIS
(CC BY 4.0) Attribution 4.0 International(CC BY 4.0) Attribution 4.0 International
Details
DINI-Zertifikat 2019OpenAIRE validatedORCID Consortium
Imprint Policy Contact Data Privacy Statement
A service of University Library and Computer and Media Service
© Humboldt-Universität zu Berlin
 
DOI
10.3389/frai.2022.868249
Permanent URL
https://doi.org/10.3389/frai.2022.868249
HTML
<a href="https://doi.org/10.3389/frai.2022.868249">https://doi.org/10.3389/frai.2022.868249</a>