Article (Scientific journals)
impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers.
DURING, Marten; Romanello, Matteo; Ehrmann, Maud et al.
2023In Frontiers in Big Data, 6, p. 1249469
Peer Reviewed verified by ORBi
 

Files


Full Text
fdata-06-1249469.pdf
Author postprint (3.83 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
comparison; data visualization; historical newspapers; impresso; scalable reading; semantic enrichment; text reuse; user tasks; Computer Science (miscellaneous); Information Systems; Artificial Intelligence
Abstract :
[en] Text Reuse reveals meaningful reiterations of text in large corpora. Humanities researchers use text reuse to study, e.g., the posterior reception of influential texts or to reveal evolving publication practices of historical media. This research is often supported by interactive visualizations which highlight relations and differences between text segments. In this paper, we build on earlier work in this domain. We present impresso Text Reuse at Scale, the to our knowledge first interface which integrates text reuse data with other forms of semantic enrichment to enable a versatile and scalable exploration of intertextual relations in historical newspaper corpora. The Text Reuse at Scale interface was developed as part of the impresso project and combines powerful search and filter operations with close and distant reading perspectives. We integrate text reuse data with enrichments derived from topic modeling, named entity recognition and classification, language and document type detection as well as a rich set of newspaper metadata. We report on historical research objectives and common user tasks for the analysis of historical text reuse data and present the prototype interface together with the results of a user evaluation.
Disciplines :
History
Author, co-author :
DURING, Marten   ;  University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital History and Historiography ; Digital History & Historiography, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg
Romanello, Matteo;  Institute of Archeology and Classical Studies (ASA), University of Lausanne, Lausanne, Switzerland
Ehrmann, Maud;  DHLAB, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
Beelen, Kaspar;  Digital Humanities Research Hub, School of Advanced Study, University of London, London, United Kingdom
GUIDO, Daniele ;  University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital Infrastructure ; Digital Research Infrastructure, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg
Deseure, Brecht;  Royal Library of Belgium, Brussels, Belgium
BUNOUT, Estelle  ;  University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Contemporary History of Luxembourg ; Contemporary History of Luxembourg, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg
Keck, Jana;  German Historical Institute Washington, Washington, DC, United States
APOSTOLOPOULOS, Petros ;  University of Luxembourg > Luxembourg Centre for Contemporary and Digital History (C2DH) > Digital History and Historiography ; Digital History & Historiography, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg
 These authors have contributed equally to this work.
External co-authors :
yes
Language :
English
Title :
impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers.
Publication date :
2023
Journal title :
Frontiers in Big Data
eISSN :
2624-909X
Publisher :
Frontiers Media SA, Switzerland
Volume :
6
Pages :
1249469
Peer reviewed :
Peer Reviewed verified by ORBi
Name of the research project :
U-AGR-7251 - INTER/SNF/22/17498891/IMPRESSO2 (01/09/2023 - 28/02/2027) - DURING Marten
Funders :
SNF - Schweizerischer Nationalfonds zur Förderung der wissenschaftlichen Forschung [CH]
Funding number :
ID CR- SII5_173719
Funding text :
The workshop was funded by the Luxembourg Center for Contemporary and Digital History (CDH). This work is building on the research project impresso–Media Monitoring of the Past funded by the Swiss National Science Foundation (SNSF) under grant ID CR- SII5_173719. 2
Available on ORBilu :
since 05 January 2024

Statistics


Number of views
12 (0 by Unilu)
Number of downloads
6 (0 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
0
WoS citations
 
1

Bibliography


Similar publications



Contact ORBilu