电子科技 ›› 2024, Vol. 37 ›› Issue (1): 81-86.doi: 10.16180/j.cnki.issn1007-7820.2024.01.012

• • 上一篇    

基于深度学习的文本分类研究综述

汪家伟1,余晓2   

  1. 1.东南大学 网络空间安全学院,江苏 南京 210096
    2.东南大学 继续教育学院,江苏 南京 210096
  • 收稿日期:2022-09-17 出版日期:2024-01-15 发布日期:2024-01-11
  • 作者简介:汪家伟(1997-),男,硕士研究生。研究方向:自然语言处理。|余晓(1973-),女,讲师。研究方向:网络管理、云计算、网络安全。
  • 基金资助:
    中国高校产学研创新基金(2020ITA07007)

Review of Text Classification Research Based on Deep Learning

WANG Jiawei1,YU Xiao2   

  1. 1. School of Cyber Science and Engineering,Southeast University,Nanjing 210096,China
    2. School of Continuing Education,Southeast University,Nanjing 210096,China
  • Received:2022-09-17 Online:2024-01-15 Published:2024-01-11
  • Supported by:
    China University Industry-University-Research Innovation Fund(2020ITA07007)

摘要:

与传统的机器学习模型相比,深度学习模型试图模仿人的学习思路,通过计算机自动进行海量数据的特征提取工作。文本分类是自然语言处理中的一个重要应用,在文本信息处理过程中具有关键作用。过去几年,使用深度学习方法进行文本分类的研究激增并取得了较好效果。文中简要介绍了基于传统模型的文本分类方法和基于深度学习的文本分类方法,回顾了先进文本分类方法并重点关注了其中基于深度学习的模型,对近年来用于文本分类的深度学习模型的研究进展以及成果进行介绍和总结,并对深度学习在文本分类领域的发展趋势和研究的难点进行了总结和展望。

关键词: 深度学习, 自然语言处理, 文本分类, 机器学习, 神经网络, 预训练模型, 注意力机制, 长短期记忆网络

Abstract:

Compared with traditional machine learning models, deep learning models attempts to imitate human learning ideas and automatically perform feature extraction from massive data through computers.Text classification is an important application in natural language processing and plays a key role in text information processing.In the past few years, research on text classification has surged and achieved good results.This study briefly introduces text classification methods based on traditional models and deep learning models,and reviews advanced text classification methods, with a focus on models for deep learning.The deep learning methods, research progress and achievements used in text classification in recent years are introduced and summarized, and the development trend of deep learning in the field of text classification and the difficulties are summarized and prospected in this study.

Key words: deep learning, natural language processing, text classification, machine learning, neural networks, pre-trained model, attention mechanism, long short-term memory

中图分类号: 

  • TP391