Visualization of AE's Training on Credit Card Transactions with Persistent Homology

Charlier, Jérémy Henri J.; Petit, François; Ormazabal, Gaston; State, Radu; Hilger, Jean

Paper published in a journal (Scientific congresses, symposiums and conference proceedings)

Charlier, Jérémy Henri J.; Petit, François; Ormazabal, Gaston et al.

2019 • In Proceedings of the International Workshop on Applications of Topological Data Analysis In conjunction with ECML PKDD 2019

Peer reviewed

Permalink
https://hdl.handle.net/10993/40330

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

ECMLPKDD_PHomWAE(1).pdf

Author preprint (530.68 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Barcodes; Encoding-Decoding; Persistence Diagrams

Abstract :

[en] Auto-encoders are among the most popular neural network architecture for dimension reduction. They are composed of two parts: the encoder which maps the model distribution to a latent manifold and the decoder which maps the latent manifold to a reconstructed distribution. However, auto-encoders are known to provoke chaotically scattered data distribution in the latent manifold resulting in an incomplete reconstructed distribution. Current distance measures fail to detect this problem because they are not able to acknowledge the shape of the data manifolds, i.e. their topological features, and the scale at which the manifolds should be analyzed. We propose Persistent Homology for Wasserstein Auto-Encoders, called PHom-WAE, a new methodology to assess and measure the data distribution of a generative model. PHom-WAE minimizes the Wasserstein distance between the true distribution and the reconstructed distribution and uses persistent homology, the study of the topological features of a space at different spatial resolutions, to compare the nature of the latent manifold and the reconstructed distribution. Our experiments underline the potential of persistent homology for Wasserstein Auto-Encoders in comparison to Variational Auto-Encoders, another type of generative model. The experiments are conducted on a real-world data set particularly challenging for traditional distance measures and auto-encoders. PHom-WAE is the first methodology to propose a topological distance measure, the bottleneck distance, for Wasserstein Auto-Encoders used to compare decoded samples of high quality in the context of credit card transactions.

Research center :

Interdisciplinary Centre for Security, Reliability and Trust (SnT) > Services and Data management research group (SEDAN)

Disciplines :

Computer science

Author, co-author :

Charlier, Jérémy Henri J. ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Petit, François ; University of Luxembourg > Faculty of Science, Technology and Communication (FSTC) > Mathematics Research Unit

Ormazabal, Gaston

State, Radu ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Hilger, Jean ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

External co-authors :

yes

Language :

English

Title :

Visualization of AE's Training on Credit Card Transactions with Persistent Homology

Publication date :

September 2019

Event name :

International Workshop on Applications of Topological Data Analysis In conjunction with ECML PKDD 2019

Event organizer :

ECML PKDD

Event place :

Würzburg, Germany

Event date :

from 16-09-2019 to 20-09-2019

Audience :

International

Journal title :

Proceedings of the International Workshop on Applications of Topological Data Analysis In conjunction with ECML PKDD 2019

Peer reviewed :

Peer reviewed

Available on ORBilu :

since 12 September 2019

Statistics

Number of views

87 (4 by Unilu)

Number of downloads

86 (2 by Unilu)

More statistics