Publications des agents du Cirad


Deep multivariate time series embedding clustering via attentive-gated autoencoder

Ienco D., Interdonato R.. 2020. In : Lauw H.W. (ed.), Wong R.C.W. (ed.), Ntoulas A. (ed.), Lim E.P. (ed.), Ng S.K. (ed.), Pan S.J. (ed.). Advances in knowledge discovery and data mining: 24th Pacific-Asia Conference, PAKDD 2020, Singapore, May 11¿14, 2020, Proceedings, Part I. Cham : Springer, p. 318-329. (Lecture Notes in Artificial Intelligence, 12084). Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2020). 24, 2020-05-11/2020-05-14, Singapour (Singapour).

DOI: 10.1007/978-3-030-47426-3_25

Nowadays, great quantities of data are produced by a large and diverse family of sensors (e.g., remote sensors, biochemical sensors, wearable devices), which typically measure multiple variables over time, resulting in data streams that can be profitably organized as multivariate time-series. In practical scenarios, the speed at which such information is collected often makes the data labeling task uneasy and too expensive, so that limit the use of supervised approaches. For this reason, unsupervised and exploratory methods represent a fundamental tool to deal with the analysis of multivariate time series. In this paper we propose a deep-learning based framework for clustering multivariate time series data with varying lengths. Our framework, namely DeTSEC (Deep Time Series Embedding Clustering), includes two stages: firstly a recurrent autoencoder exploits attention and gating mechanisms to produce a preliminary embedding representation; then, a clustering refinement stage is introduced to stretch the embedding manifold towards the corresponding clusters. Experimental assessment on six real-world benchmarks coming from different domains has highlighted the effectiveness of our proposal.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :