基于三支决策的二阶段分类模型研究
摘要:
当对三支决策边界域进一步划分时,边界域知识存在划分信息不足,从而导致分类精度不高,针对上述问题提出一种新的基于三支决策的二阶段分类模型(TWD-TP).第一阶段根据贝叶斯规则构建三支决策中样本的条件概率,通过求解最优化损失函数得到所需阈值,然后按照三支决策规则对数据集进行划分.三支决策是基于最小风险贝叶斯决策理论的划分,在其正域、负域中包含一定的误分类样本;在第二阶段通过类标签索引分别将正域、负域中误分样本作为增量信息引入延迟决策域,形成重构边界域,最后对重构边界域进行划分.实验结果表明:所提出的TWD-TP模型不仅能在三支决策划分中筛选出高误分类特征的样本,同时其重构边界域中不能被划分的样本得到正确划分,分类精度进一步提高.
Aiming at the further division of the three-way decisions boundary domains,the problem of insufficient classification accuracy of the boundary knowledge of the three-way decisions caused by insufficient information,this paper proposes a new two-stage classification model based on three-way decisions(TWD-TP).In the first stage,the conditional probabilities of samples in three-way decisions are constructed by Bayesian rule,the required thresholds are obtained by solving the optimal loss function.Then the data sets are divided according to the three decision rules.However,the three-way decisions are based on the division of least-risk Bayes decision theory,including some misclassified samples in positive and negative domains.In the second phase,the samples of misclassification in positive domain and negative domain are introduced into the delayed decision domain as incremental information by class label index to construct new boundary domain,that is,reconstruction boundary domain.Finally,the classifier is used to perform classification verification on the reconstruction boundary domain objects.The experimental results show that the TWD-TP model proposed in this paper can not only filter out the samples with high misclassification features in the three-way decisions division,but also can correctly divide the previously undivided samples in the reconstruction boundary and improve the classification accuracy.
作者:
徐久成 徐战威 李梦凡 王楠
Xu Jiucheng;Xu Zhanwei;Li Mengfan;Wang Nan(College of Computer and Information Engineering,Henan Normal University,Xinxiang 453007,China;Henan Technology Research Center for Computational Intelligence and Data Mining,Henan Normal University,Xinxiang 453007,China)
机构地区:
betway官方app 计算机与信息工程学院 河南省高校计算智能与数据挖掘工程技术研究中心
出处:
《betway官方app 学报:自然科学版》 CAS 北大核心 2019年第3期28-34,124,共8页
基金:
国家自然科学基金(61370169 61402153 60873104) 中国博士后科学基金项目(2016M602247) 河南省科技攻关重点项目(162102210261)
关键词:
三支决策 二阶段 增量信息 边界域
three-way decisions two-stage incremental information boundary domain
分类号:
O225 [理学—运筹学与控制论] TP181 [自动化与计算机技术—控制理论与控制工程]