DSCo: A Language Modeling Approach for Time Series Classification

LI, Daoyuan; LI, Li; BISSYANDE, Tegawendé François D Assise; KLEIN, Jacques; LE TRAON, Yves

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

DSCo: A Language Modeling Approach for Time Series Classification

LI, Daoyuan; LI, Li; BISSYANDE, Tegawendé François D Assise et al.

2016 • In 12th International Conference on Machine Learning and Data Mining (MLDM 2016)

Peer reviewed

Permalink
https://hdl.handle.net/10993/26733

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

paper.pdf

Author preprint (353.96 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Time Series; Time Series Classification; Language Modeling

Abstract :

[en] Time series data are abundant in various domains and are often characterized as large in size and high in dimensionality, leading to storage and processing challenges. Symbolic representation of time series – which transforms numeric time series data into texts – is a promising technique to address these challenges. However, these techniques are essentially lossy compression functions and information are partially lost during transformation. To that end, we bring up a novel approach named Domain Series Corpus (DSCo), which builds per-class language models from the symbolized texts. To classify unlabeled samples, we compute the fitness of each symbolized sample against all per-class models and choose the class represented by the model with the best fitness score. Our work innovatively takes advantage of mature techniques from both time series mining and NLP communities. Through extensive experiments on an open dataset archive, we demonstrate that it performs similarly to approaches working with original uncompressed numeric data.

Disciplines :

Computer science

Author, co-author :

LI, Daoyuan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

LI, Li ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

BISSYANDE, Tegawendé François D Assise ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

KLEIN, Jacques ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > Computer Science and Communications Research Unit (CSC)

LE TRAON, Yves ; University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Computer Science and Communications Research Unit (CSC)

External co-authors :

Language :

English

Title :

DSCo: A Language Modeling Approach for Time Series Classification

Publication date :

July 2016

Event name :

12th International Conference on Machine Learning and Data Mining (MLDM 2016)

Event date :

from 16-07-2016 to 21-07-2016

Audience :

International

Main work title :

12th International Conference on Machine Learning and Data Mining (MLDM 2016)

Peer reviewed :

Peer reviewed

Focus Area :

Computational Sciences

Available on ORBilu :

since 15 April 2016

Statistics

Number of views

382 (29 by Unilu)

Number of downloads

1030 (14 by Unilu)

More statistics