Reference : Corpus of long-term instant messaging based dialogues between advanced learners of Ge...
Computer developments : Other
Engineering, computing & technology : Computer science
http://hdl.handle.net/10993/20579
Corpus of long-term instant messaging based dialogues between advanced learners of German as a foreign language and German native speakers: deL1L2IM
German
Höhn, Sviatlana mailto [University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC) >]
20-Mar-2015
ELRA
[en] Chat ; Corrective Feedback ; Learner Language
[en] The deL1L2IM corpus, created between May and August 2012 and last updated in August 2014, has been collected within the framework of a PhD project on the development of a learning method implying conversations with an artificial companion. This PhD work is presented as a qualitative investigation of instant messaging dialogues on a long-term basis (four months) between advanced learners of German and German native speakers, chatting about whatever topic they wish. The dataset is composed of 72 dialogues, each of them having a duration of 20 to 45 minutes. The whole corpus contains ca. 52,000 words and 4,800 messages and has a file size of 0.5 Mb. Nine pairs of participants – i.e. nine learners and four native speakers – were required, with 8 dialogues per pair. The interactions have undergone linguistic analysis whereby the annotation will be performed only on repair/correction sequences (incomplete learner error annotation). The goal of the project was to create an application for language modelling and to improve learner language applications, tutoring software and dialogue systems. The corpus is delivered in one written text file (in XML format, customized under TEI P5).
Researchers ; Professionals ; Students
http://hdl.handle.net/10993/20579
http://islrn.org/resources/339-799-085-669-8/
1.0

There is no file associated with this reference.

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.