Reference : The LuNa Open Toolbox for the Luxembourgish Language
Scientific congresses, symposiums and conference proceedings : Paper published in a book
Engineering, computing & technology : Computer science
Arts & humanities : Languages & linguistics
Computational Sciences
http://hdl.handle.net/10993/40407
The LuNa Open Toolbox for the Luxembourgish Language
English
Sirajzade, Joshgun mailto [University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC) >]
Schommer, Christoph mailto [University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC) >]
2019
Advances in Data Mining, Applications and Theoretical Aspects, Poster Proceedings 2019
Perner, Petra
ibai publishing
No
International
978-3-942952-61-3
Leipzig
Germany
19th Industrial Conference on Data Mining, ICDM 2019
from 17-07-2019 to 21-07-2019
New York
USA
[en] Luxembourgish language ; POS-Tagging ; Topic Modeling ; Sentiment Analysis ; Text Preparation ; XML-Database
[en] Despite some recent work, the ongoing research for the processing of Luxembourgish is still largely in its infancy. While a rich variety of linguistic processing tools exist, especially for English, these software tools offer little scope for the Luxembourgish language. LuNa (a Tool for Luxembourgish National Corpus) is an Open Toolbox that allows researchers to annotate a text corpus written in Luxembourgish language and to build/query an annotated corpus. The aim of the paper is to demonstrate the components of the system and its usage for Machine Learning applications like Topic Modelling and Sentiment Detection. Overall, LuNa bases on a XML-database to store the data and to define the XML scheme, it offers a Graphical User Interface (GUI) for a linguistic data preparation such as tokenization, Part-Of-Speech tagging, and morphological analysis -- just to name a few.
Researchers
http://hdl.handle.net/10993/40407

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
CRC_industrial_paper_84.pdfAuthor preprint1.37 MBView/Open

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.