TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification

Shifaz, Ahmed; Pelletier, Charlotte; Petitjean, Francois; Webb, Geoffrey I.

doi:10.1007/s10618-020-00679-8

Computer Science > Machine Learning

arXiv:1906.10329 (cs)

[Submitted on 25 Jun 2019 (v1), last revised 14 Feb 2020 (this version, v2)]

Title:TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification

Authors:Ahmed Shifaz, Charlotte Pelletier, Francois Petitjean, Geoffrey I. Webb

View PDF

Abstract:Time Series Classification (TSC) has seen enormous progress over the last two decades. HIVE-COTE (Hierarchical Vote Collective of Transformation-based Ensembles) is the current state of the art in terms of classification accuracy. HIVE-COTE recognizes that time series data are a specific data type for which the traditional attribute-value representation, used predominantly in machine learning, fails to provide a relevant representation. HIVE-COTE combines multiple types of classifiers: each extracting information about a specific aspect of a time series, be it in the time domain, frequency domain or summarization of intervals within the series. However, HIVE-COTE (and its predecessor, FLAT-COTE) is often infeasible to run on even modest amounts of data. For instance, training HIVE-COTE on a dataset with only 1,500 time series can require 8 days of CPU time. It has polynomial runtime with respect to the training set size, so this problem compounds as data quantity increases. We propose a novel TSC algorithm, TS-CHIEF (Time Series Combination of Heterogeneous and Integrated Embedding Forest), which rivals HIVE-COTE in accuracy but requires only a fraction of the runtime. TS-CHIEF constructs an ensemble classifier that integrates the most effective embeddings of time series that research has developed in the last decade. It uses tree-structured classifiers to do so efficiently. We assess TS-CHIEF on 85 datasets of the University of California Riverside (UCR) archive, where it achieves state-of-the-art accuracy with scalability and efficiency. We demonstrate that TS-CHIEF can be trained on 130k time series in 2 days, a data quantity that is beyond the reach of any TSC algorithm with comparable accuracy.

Comments:	37 pages, 10 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1906.10329 [cs.LG]
	(or arXiv:1906.10329v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.10329
Journal reference:	Data Mining and Knowledge Discovery 34 (2020) 742-775
Related DOI:	https://doi.org/10.1007/s10618-020-00679-8

Submission history

From: Ahmed Shifaz [view email]
[v1] Tue, 25 Jun 2019 05:41:16 UTC (1,086 KB)
[v2] Fri, 14 Feb 2020 04:14:24 UTC (344 KB)

Computer Science > Machine Learning

Title:TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators