An Annotation Framework for Luxembourgish Sentiment Analysis

SIRAJZADE, Joshgun; GIERSCHEK, Daniela; SCHOMMER, Christoph

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

SIRAJZADE, Joshgun; GIERSCHEK, Daniela; SCHOMMER, Christoph

2020 • In Besacier, Laurent; Sakti, Sakriani; Soria, Claudia et al. (Eds.) Proceedings of the LREC 2020 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020)

Peer reviewed

Permalink
https://hdl.handle.net/10993/43136

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

AnAnnotationFrameworkforLuxembourgishSentimentAnalysis.pdf

Publisher postprint (1.18 MB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Opinion Mining; Sentiment Analysis; Corpus (Creation, Annotation, etc.); Luxembourgish Language; Crowdsourcing; Time Series

Abstract :

[en] The aim of this paper is to present a framework developed for crowdsourcing sentiment annotation for the low-resource language Luxembourgish. Our tool is easily accessible through a web interface and facilitates sentence-level annotation of several annotators in parallel. In the heart of our framework is an XML database, which serves as central part linking several components. The corpus in the database consists of news articles and user comments. One of the components is LuNa, a tool for linguistic preprocessing of the data set. It tokenizes the text, splits it into sentences and assigns POS-tags to the tokens. After that, the preprocessed text is stored in XML format into the database. The Sentiment Annotation Tool, which is a browser-based tool, then enables the annotation of split sentences from the database. The Sentiment Engine, a separate module, is trained with this material in order to annotate the whole data set and analyze the sentiment of the comments over time and in relationship to the news articles. The gained knowledge can again be used to improve the sentiment classification on the one hand and on the other hand to understand the sentiment phenomenon from the linguistic point of view.

Disciplines :

Languages & linguistics
Computer science

Author, co-author :

SIRAJZADE, Joshgun ; University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)

GIERSCHEK, Daniela ; University of Luxembourg > Faculty of Language and Literature, Humanities, Arts and Education (FLSHASE) > Identités, Politiques, Sociétés, Espaces (IPSE)

SCHOMMER, Christoph ; University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)

External co-authors :

Language :

English

Title :

An Annotation Framework for Luxembourgish Sentiment Analysis

Publication date :

May 2020

Event name :

LREC 2020 Workshop Language Resources and Evaluation Conference 11–16 May 2020, 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020)

Event organizer :

European Language Resources Association (ELRA)

Event place :

Marseille, France

Event date :

from 11-05-2020 to 16-05-2020

Audience :

International

Main work title :

Proceedings of the LREC 2020 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020)

Editor :

Besacier, Laurent

Sakti, Sakriani

Soria, Claudia

Beermann, Dorothee

Publisher :

European Language Resources Association (ELRA), Paris, France

ISBN/EAN :

979-10-95546-35-1
9791095546351

Pages :

172-176

Peer reviewed :

Peer reviewed

Focus Area :

Computational Sciences

Additional URL :

https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/SLTUCCURLbook.pdf

Available on ORBilu :

since 11 May 2020

Statistics

Number of views

382 (50 by Unilu)

Number of downloads

179 (23 by Unilu)

More statistics