电子科技 ›› 2019, Vol. 32 ›› Issue (2): 47-50.doi: 10.16180/j.cnki.issn1007-7820.2019.02.010

• • 上一篇    下一篇

一种改进的字典学习方法在医疗疾病分析中的应用

骆冲,邬春学   

  1. 上海理工大学 光电信息与计算机工程学院, 上海 200093
  • 收稿日期:2018-01-18 出版日期:2019-02-15 发布日期:2019-01-02
  • 作者简介:骆冲(1991-),男,硕士研究生。研究方向:大数据,机器学习,无线传感网等。|邬春学(1964-),男,教授。研究方向:无线传感网,分布式和嵌入式系统,物联网等。
  • 基金资助:
    上海市科学计划项目(16111107502);上海市科学计划项目(17511107203)

An Improved Dictionary Learning Method for Medical Disease Analysis

LUO Chong,WU Chunxue   

  1. School of Optical Electrical & Computer Engineering,University of Shanghai for Science & Technology,Shanghai 200093,China
  • Received:2018-01-18 Online:2019-02-15 Published:2019-01-02
  • Supported by:
    Shanghai Science and Technology Innovation Action Plan Project(16111107502);Shanghai Science and Technology Innovation Action Plan Project(17511107203)

摘要:

针对传统字典学习算法忽略样本与字典原子之间关联信息及有限的训练集容易出现过拟合的现象,文中提出了通过加权机制及二范数约束的方法,建立样本与字典原子之间的权重关系,并对编码系数用二范数进行约束。采用留一交叉验证法在抑郁症数据集上比较了算法的精确度、灵敏度和错误率等指标。实验结果表明,改进的算法在医疗疾病分类中有良好的效果,分类精确度平均提高了21.87%,灵敏度和错误率等方面也有良好的表现。

关键词: 医疗大数据, 字典学习, 稀疏表示, 疾病分类, 加权机制, 范式约束

Abstract:

This paper proposed the weighted mechanism to combine the sample with the dictionary atom based on the traditional dictionary learning algorithm, which ignored the relationship between the sample and the dictionary atom. At the same time, the l2 norm regularization constraint was adopted to avoid over fitting on coding coefficients. The paper used the leave one out cross validation to compare the accuracy, sensitivity and mean error rate of the algorithm in the depression data sets. The results showed that the improved method had a good effect on the medical disease classification. The average classification accuracy was improved by 21.87%, and the sensitivity and mean error rate also displayed good performance.

Key words: medical big data, dictionary learning, sparse representation, disease classification, weighted mechanism, paradigm constraint

中图分类号: 

  • TP301.6