Skip to main content
Log in

Research on sensitive text classification detection and classification based on improved artificial neural network

  • Published:
Journal of Combinatorial Optimization Aims and scope Submit manuscript

Abstract

With the rapid development of information technology, a large number of sensitive words have appeared, which has brought great harm to network security and social stability. Therefore, how to identify and classify these sensitive information accurately has become an important issue. Combined with the improved artificial network algorithm, an adaptive classification algorithm is proposed in the experiment, which can provide local intelligent classification service according to the classification results. At the same time, the algorithm transforms the clustering model structure of the traditional network algorithm, introduces the classification information, so that it can be applied to the classification problem, and expands the application field of the data field theory. The experimental results show that: (1) From the experimental results of different text quantities, the SOM algorithm assigns the classification task to different levels of nodes, and realizes the modularization of detection. (2) The overall mean results show that the highest recall rate is 87%, which has met the basic grading criteria, and the detection accuracy of sensitive words will also be improved. (3) The experimental results show that the algorithm can accurately classify the sensitive and speed up the parameter optimization, and is superior to the comparison algorithm in many indicators. (4) From the simulation results, compared with the traditional neural network algorithm, the precision and recall of the algorithm are maintained at more than 90% and the loss is less than 0.11.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The original contributions presented in this study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

References

  • Chen X, Zhang J, Li X et al (2011) A text classification method for chinese sensitive web page recognition. Meas Control Technol 05:27–31

    Google Scholar 

  • Dai G, She J (2012) Dictionary improvement and implementation based on double array Trie tree algorithm. Softw Guide 07:17–19

    Google Scholar 

  • Dai X, Lin X, Zhou X (2009) Research on clustering method of pattern neural network. J Beijing Inst Petrochem Technol 17(004):13–16

    Google Scholar 

  • Du D, Mu Z, Chen J, Fang X (2007) A multi-layer self-organizing neural network for simulating the cognitive process of chinese characters. J Univ Sci Technol Beijing 29(001):102–106

    Google Scholar 

  • Han K, Wang Y (2000) A nonlinear weighted method for subject extraction. Chin J Inform Sci 06:650–653

    Google Scholar 

  • He J (2008) A new method to realize the peripheral contour of Chinese characters. Comput Digit Eng 36(012):149–151

    Google Scholar 

  • Huo L, Xu W, Zhao Y (2013) Design and implementation of embedded license plate recognition system. Comput Technol Dev 05(4):206–208

    Google Scholar 

  • Li W (2020) Design of web sensitive word filtering and sensitive text classification system. Comput Knowl Technol 16(08):251–253

    Google Scholar 

  • Li G, Li X, He X (2005) Application of improved ART1 neural network in fault diagnosis of aero-engine. Microcomput Inform 21(09S):156–158

    Google Scholar 

  • Liu Z (2017) Chinese character image font recognition based on CNN network. Mod Comput 21(5):67–70

    Google Scholar 

  • Liu Q, Li Y, Yang D (2016) Summary of knowledge map construction technology. Comput Res Dev 53(3):582–600

    Google Scholar 

  • Mo H, Guan F, Lu S (2007) Research on multi-class and multi-dimensional problems of artificial immune memory classifier. Comput Eng Sci 29(2):100–103

    Google Scholar 

  • Shi X (2006) Development and application of artificial neural network. J Chongqing Inst Sci Technol Nat Sci Edit 8(002):99–101

    Google Scholar 

  • Suo H, Liu Y, Cao S (2006) A keyword extraction method based on lexical chain. Chin J Inform 06:25–30

    Google Scholar 

  • Tong X, Zhu J (2006) Rough classification of off-line handwritten Chinese character recognition with large character set. Comput Appl 26(B06):24–26

    Google Scholar 

  • Wu Y, Zhao M (2001) Neuron model with adjustable activation function and its supervised learning and application. Sci China E Ser 31(003):263–272

    Google Scholar 

  • Yang Z, Yan Y (2006) Research and development of som neural network algorithms. Comput Eng 32:16

    Google Scholar 

  • Yang F, Li H, Tian X (2009) chinese character rough classification based on RBF neural network. Comput Eng Appl 45(006):170–172

    Google Scholar 

  • Yu P, Liu B (2004) Improved ART1 network and its application in english character recognition. J Yunnan Univ Nat Sci Edit 26(5):397–400

    Google Scholar 

  • Zeng J, Wang X (2014) A novel KNN classification algorithm based on Mahalanobis distance. Mod Comput 32:7–12

    Google Scholar 

  • Zhang S, Pan X (2011) Research on a novel text classification method based on Mahalanobis distance. J Changchun Inst Technol Nat Sci Edit 02:102–105

    Google Scholar 

  • Zhou X, Lin X, Chen X, Li Y (2008) An improved artificial neural network model. Compt Eng Appl 44(009):64–66

    Google Scholar 

  • Zuo L, Tan M (2002) Data clustering based on ART1 artificial neural network. Comput Eng Sci 24(2):92–95

    MathSciNet  Google Scholar 

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Authors

Contributions

The authors participated in the preparation and presentation of the manuscript.

Corresponding author

Correspondence to Haisheng Gu.

Ethics declarations

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gu, H., Li, Q. & Shen, D. Research on sensitive text classification detection and classification based on improved artificial neural network. J Comb Optim 46, 20 (2023). https://doi.org/10.1007/s10878-023-01085-8

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10878-023-01085-8

Keywords

Navigation