Research on sensitive text classification detection and classification based on improved artificial neural network

Gu, Haisheng; Li, Qing; Shen, Duanming

doi:10.1007/s10878-023-01085-8

Research on sensitive text classification detection and classification based on improved artificial neural network

Published: 28 September 2023

Volume 46, article number 20, (2023)
Cite this article

Journal of Combinatorial Optimization Aims and scope Submit manuscript

Haisheng Gu¹,
Qing Li¹ &
Duanming Shen¹

145 Accesses
1 Citation
Explore all metrics

Abstract

With the rapid development of information technology, a large number of sensitive words have appeared, which has brought great harm to network security and social stability. Therefore, how to identify and classify these sensitive information accurately has become an important issue. Combined with the improved artificial network algorithm, an adaptive classification algorithm is proposed in the experiment, which can provide local intelligent classification service according to the classification results. At the same time, the algorithm transforms the clustering model structure of the traditional network algorithm, introduces the classification information, so that it can be applied to the classification problem, and expands the application field of the data field theory. The experimental results show that: (1) From the experimental results of different text quantities, the SOM algorithm assigns the classification task to different levels of nodes, and realizes the modularization of detection. (2) The overall mean results show that the highest recall rate is 87%, which has met the basic grading criteria, and the detection accuracy of sensitive words will also be improved. (3) The experimental results show that the algorithm can accurately classify the sensitive and speed up the parameter optimization, and is superior to the comparison algorithm in many indicators. (4) From the simulation results, compared with the traditional neural network algorithm, the precision and recall of the algorithm are maintained at more than 90% and the loss is less than 0.11.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Applying NLP techniques to malware detection in a practical environment

Article Open access 06 June 2021

Review: machine learning techniques applied to cybersecurity

Article 04 January 2019

Survey of intrusion detection systems: techniques, datasets and challenges

Article Open access 17 July 2019

Data availability

The original contributions presented in this study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

References

Chen X, Zhang J, Li X et al (2011) A text classification method for chinese sensitive web page recognition. Meas Control Technol 05:27–31
Google Scholar
Dai G, She J (2012) Dictionary improvement and implementation based on double array Trie tree algorithm. Softw Guide 07:17–19
Google Scholar
Dai X, Lin X, Zhou X (2009) Research on clustering method of pattern neural network. J Beijing Inst Petrochem Technol 17(004):13–16
Google Scholar
Du D, Mu Z, Chen J, Fang X (2007) A multi-layer self-organizing neural network for simulating the cognitive process of chinese characters. J Univ Sci Technol Beijing 29(001):102–106
Google Scholar
Han K, Wang Y (2000) A nonlinear weighted method for subject extraction. Chin J Inform Sci 06:650–653
Google Scholar
He J (2008) A new method to realize the peripheral contour of Chinese characters. Comput Digit Eng 36(012):149–151
Google Scholar
Huo L, Xu W, Zhao Y (2013) Design and implementation of embedded license plate recognition system. Comput Technol Dev 05(4):206–208
Google Scholar
Li W (2020) Design of web sensitive word filtering and sensitive text classification system. Comput Knowl Technol 16(08):251–253
Google Scholar
Li G, Li X, He X (2005) Application of improved ART1 neural network in fault diagnosis of aero-engine. Microcomput Inform 21(09S):156–158
Google Scholar
Liu Z (2017) Chinese character image font recognition based on CNN network. Mod Comput 21(5):67–70
Google Scholar
Liu Q, Li Y, Yang D (2016) Summary of knowledge map construction technology. Comput Res Dev 53(3):582–600
Google Scholar
Mo H, Guan F, Lu S (2007) Research on multi-class and multi-dimensional problems of artificial immune memory classifier. Comput Eng Sci 29(2):100–103
Google Scholar
Shi X (2006) Development and application of artificial neural network. J Chongqing Inst Sci Technol Nat Sci Edit 8(002):99–101
Google Scholar
Suo H, Liu Y, Cao S (2006) A keyword extraction method based on lexical chain. Chin J Inform 06:25–30
Google Scholar
Tong X, Zhu J (2006) Rough classification of off-line handwritten Chinese character recognition with large character set. Comput Appl 26(B06):24–26
Google Scholar
Wu Y, Zhao M (2001) Neuron model with adjustable activation function and its supervised learning and application. Sci China E Ser 31(003):263–272
Google Scholar
Yang Z, Yan Y (2006) Research and development of som neural network algorithms. Comput Eng 32:16
Google Scholar
Yang F, Li H, Tian X (2009) chinese character rough classification based on RBF neural network. Comput Eng Appl 45(006):170–172
Google Scholar
Yu P, Liu B (2004) Improved ART1 network and its application in english character recognition. J Yunnan Univ Nat Sci Edit 26(5):397–400
Google Scholar
Zeng J, Wang X (2014) A novel KNN classification algorithm based on Mahalanobis distance. Mod Comput 32:7–12
Google Scholar
Zhang S, Pan X (2011) Research on a novel text classification method based on Mahalanobis distance. J Changchun Inst Technol Nat Sci Edit 02:102–105
Google Scholar
Zhou X, Lin X, Chen X, Li Y (2008) An improved artificial neural network model. Compt Eng Appl 44(009):64–66
Google Scholar
Zuo L, Tan M (2002) Data clustering based on ART1 artificial neural network. Comput Eng Sci 24(2):92–95
MathSciNet Google Scholar

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Research Institute of Petroleum Exploration and Development, PetroChina, Beijing, 100083, China
Haisheng Gu, Qing Li & Duanming Shen

Authors

Haisheng Gu
View author publications
You can also search for this author in PubMed Google Scholar
Qing Li
View author publications
You can also search for this author in PubMed Google Scholar
Duanming Shen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors participated in the preparation and presentation of the manuscript.

Corresponding author

Correspondence to Haisheng Gu.

Ethics declarations

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Gu, H., Li, Q. & Shen, D. Research on sensitive text classification detection and classification based on improved artificial neural network. J Comb Optim 46, 20 (2023). https://doi.org/10.1007/s10878-023-01085-8

Download citation

Accepted: 30 August 2023
Published: 28 September 2023
DOI: https://doi.org/10.1007/s10878-023-01085-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Research on sensitive text classification detection and classification based on improved artificial neural network

Abstract

Access this article

Similar content being viewed by others

Applying NLP techniques to malware detection in a practical environment

Review: machine learning techniques applied to cybersecurity

Survey of intrusion detection systems: techniques, datasets and challenges

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Research on sensitive text classification detection and classification based on improved artificial neural network

Abstract

Access this article

Similar content being viewed by others

Applying NLP techniques to malware detection in a practical environment

Review: machine learning techniques applied to cybersecurity

Survey of intrusion detection systems: techniques, datasets and challenges

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation