Research on Time Series Data Classification Combine SAX and AdaBoost Algorithm
摘要:
SAX是一种典型的符号化特征表示方法.该方法在时间序列特征表示中不仅可以有效地降维、降噪,而且具有简单、直观等特点.时间序列长度不一、特征表示过程中信息损失等问题的存在,使得常规的分类算法难以很好地完成分类任务.在对时间序列数据进行基于SAX符号化的BOP表示方法的基础上,提出了结合集成学习中AdaBoost算法进行分类的新方法,实验结果表明,该方法不仅能很好地处理SAX符号化表示中的信息损失问题,而且与已有方法相比,在分类准确度方面也有了显著的提高.
Symbolic Aggregate approXimation (SAX) is a typical symbolic representation method, which is straight-for- ward and very simple, and it efficiently converts time series data to a symbolic representation with dimension reduction. The is- sues of time series data such as variable in length, and information lose during the representation, making many traditional clas- sification methods unable to apply directly. This paper focus on the SAX discretization method coupled with the Bag of Patterns (BOP) representation in classification task, and proposed the new approach by use AdaBoost Algorithm to remedy the informa- tion loss by SAX representation. The experimental results show that, the approach improved the classification accuracy obvi- ously.
作者:
宋玉 高明磊 宋伟
机构地区:
郑州大学信息工程学院
出处:
《betway官方app 学报:自然科学版》 CAS 北大核心 2015年第3期155-160,共6页
基金:
国家自然科学基金(61202207) 河南省教育厅科学技术研究重点项目(13A520453)
关键词:
时间序列 分类 SAX BOP ADABOOST
time series classification SAX BOP AdaBoost
分类号:
TP311 [自动化与计算机技术—计算机软件与理论]