Contribution to collective works (Parts of books)
Exploring Corpus Linguistics Approaches in Linguistic Landscape Research with Automatic Text Recognition Software
GILLES, Peter; Ziegler, Evelyn
2021In Ziegler, Evelyn; Marten, Heiko F. (Eds.) Linguistic Landscapes im deutschsprachigen Kontext
Peer reviewed


Full Text
Author preprint (5.53 MB)

All documents in ORBilu are protected by a user license.

Send to


Keywords :
text recognition; corpus linguistics
Abstract :
[en] Taking a more quantitative approach in linguistic landscape research, we explore recent techniques of automatic information extraction from images. The recently released Cloud Vision API by Google offers new perspectives on the software-assisted processing and classification of pictures. A software interface makes it possible to extract various kinds of information from pictures automatically, among them the written text, certain labels to describe the picture (e.g. road sign, shop sign, prohibition sign) or the colours used in the picture. Applying this new technique to large-scale image data collections will not only enhance analysis but may also offer hitherto unrecognized structures. The data comes from a large-scale investigation of the Ruhr Metropolis in Germany, where 25,504 photos have been taken to document the linguistic landscape of selected neighbourhoods in four cities (Ziegler et al. 2018). This data has been annotated manually in various categories to analyze the occurrence, form and function of visual multilingualism. These pictures are then automatically processed by the Cloud Vision API and the results compared to the manual annotation. It will be shown that the quality of the image recognition greatly depends on the quality of the picture. The textual information extracted from the pictures will be stored in a database. Rather than presenting results on the linguistic landscape, this chapter is predominantly concerned with practical tools to facilitate large-scale linguistic landscape research.
Disciplines :
Languages & linguistics
Author, co-author :
GILLES, Peter  ;  University of Luxembourg > Faculty of Humanities, Education and Social Sciences (FHSE) > Department of Humanities (DHUM)
Ziegler, Evelyn
External co-authors :
Language :
Title :
Exploring Corpus Linguistics Approaches in Linguistic Landscape Research with Automatic Text Recognition Software
Publication date :
Main work title :
Linguistic Landscapes im deutschsprachigen Kontext
Editor :
Ziegler, Evelyn
Marten, Heiko F.
Publisher :
Peter Lang D, Frankfurt, Unknown/unspecified
978-3-631-84069-6 978-3-631-84068-9 978-3-631-84070-2 978-3-631-79110-3
Pages :
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Available on ORBilu :
since 21 May 2021


Number of views
250 (0 by Unilu)
Number of downloads
324 (3 by Unilu)



Similar publications

Contact ORBilu