Skip to main content
Log in

Neighborhood rough set with neighborhood equivalence relation for feature selection

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Feature selection of the neighborhood rough set is an important step in preprocessing the data and improving classification performance. Neighborhood granules form the basis for neighborhood rough set learning and reasoning, but granules typically have overlap, which will cause sample classification uncertainty or repeatability. For this reason, a new notion of neighborhood equivalence relation is used in this paper. Neighborhood equivalence granules solve the above problems, those granules are usually finer than that of the classical neighborhood rough set. In this paper, the neighborhood relation in traditional neighborhood rough set is replaced by neighborhood equivalence relation. Based on neighborhood equivalence relation, a Neighborhood rough set Model based on neighborhood Equivalence Relation (NMER) is proposed. We also introduce the properties of NMER and explain the significance of features. Based on the proposed NMER, a feature selection algorithm is also designed. The reduction results on twelve datasets show that the proposed feature selection algorithm can select main and useful features, confirming the effectiveness of the algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data availability

The algorithm proposed in this paper can provide source code and experimental results of the algorithm.

References

  1. Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction. Springer, Berlin

    Google Scholar 

  2. Xie XJ, Qin XL (2017) A novel incremental attribute reduction approach for dynamic incomplete decision systems. Int J Approx Reason 93:443–462

    MathSciNet  Google Scholar 

  3. Sun L, Zhang XY, Qian YH, Xu JC, Zhang SG (2019) Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification. Inform Sci 502:18–41

    MathSciNet  Google Scholar 

  4. Pawlak Z (1982) Rough set. Int J Comput Inf Sci 11(5):341–356

    Google Scholar 

  5. Pawlak Z, Skowron A (2006) Rudiments of rough set. Inf Sci 177:3–27

    MathSciNet  Google Scholar 

  6. Greco S, Matarazzo B, Slowinski R (2002) Rough approximation by dominance relations. Int J Intell Syst 17:153–171

    Google Scholar 

  7. Sang B, Chen H, Yang L, Li T, Xu W, Luo C (2021) Feature selection for dynamic interval-valued ordered data based on fuzzy dominance neighborhood rough set. Knowl Based Syst 227:107223

    Google Scholar 

  8. Palangetić M, Cornelis C, Greco S, Słowiński R (2021) Fuzzy extensions of the dominance-based rough set approach. Int J Approx Reason 129:1–19

    MathSciNet  Google Scholar 

  9. Yang S, Yang H, De Baets B, Jah M, Shi G (2021) Quantitative dominance-based neighborhood rough sets via fuzzy preference relations. IEEE Trans Fuzzy Syst 29(3):515–529

    Google Scholar 

  10. Dai J, Zou X, Wu WZ (2022) Novel fuzzy β-covering rough set models and their applications. Inform Sci 608:286–312

    Google Scholar 

  11. Huang Z, Li J (2021) Multi-scale covering rough sets with applications to data classification. Appl Soft Comput 110(107736):1–12

    Google Scholar 

  12. Zhou J, Xu F, Guan Y, Wang H (2021) Three types of fuzzy covering-based rough set models. Fuzzy Sets Syst 423:122–148

    MathSciNet  Google Scholar 

  13. Huang Z, Li J, Qian Y (2022) Noise-tolerant fuzzy β covering based multi-granulation rough sets and feature subset selection. IEEE Trans Fuzzy Syst 30(7):2721–2735

    Google Scholar 

  14. An S, Zhao E, Wang C, Guo G, Zhao S, Li P (2023) Relative fuzzy rough approximations for feature selection and classification. IEEE Trans Cybern 53(4):2200–2210

    PubMed  Google Scholar 

  15. An S, Hu Q, Wang C (2021) Probability granular distance-based fuzzy rough set model. Appl Soft Comput 102:107064

    Google Scholar 

  16. Wang C, Qian Y, Ding W, Feng X (2022) Feature selection with fuzzy-rough minimum classification error criterion. IEEE Trans Fuzzy Syst 30(8):2930–2942

    Google Scholar 

  17. Hu QH, Zhang L, Zhang D, Pan W, An S, Pedrycz W (2011) Measuring relevance between discrete and continuous features based on neighborhood mutual information. Expert Syst Appl 38:10737–10750

    Google Scholar 

  18. Hu Q, Liu J, Yu D (2008) Mixed feature selection based on granulation and approximation. Knowl Based Syst 21(4):294–304

    Google Scholar 

  19. Hu Q, Yu D, Liu J, Wu C (2008) Neighborhood rough set based heterogeneous feature subset selection. Inform Sci 178(18):3577–3594

    MathSciNet  Google Scholar 

  20. Yang X, Liang S, Yu H, Gao S, Qian Y (2019) Pseudo-label neighborhood rough set: measures and attribute reductions. Int J Approx Reason 105:112–129

    MathSciNet  Google Scholar 

  21. Wang Q, Qian Y, Liang X, Guo Q, Liang J (2018) Local neighborhood rough set. Knowl Based Syst 153:53–64

    Google Scholar 

  22. Guo Y, Tsang ECC, Xu W, Chen D (2019) Local logical disjunction double-quantitative rough sets. Inform Sci 500:87–112

    Google Scholar 

  23. Li W, Huang Z, Jia X, Cai X (2016) Neighborhood based decision-theoretic rough set models. Int J Approx Reason 69:1–17

    MathSciNet  CAS  Google Scholar 

  24. Lin G, Qian Y, Li J (2012) NMGRS: Neighborhood-based multigranulation rough set. Int J Approx Reason 53(7):1080–1093

    MathSciNet  Google Scholar 

  25. Sun L, Wang L, Ding W, Qian Y, Xu J (2020) Neighborhood multi-granulation rough set-based attribute reduction using lebesgue and entropy measures in incomplete neighborhood decision systems. Knowl Based Syst 192:10537

    Google Scholar 

  26. Hu C, Zhang L, Wang B, Zhang Z, Li F (2019) Incremental updating knowledge in neighborhood multigranulation rough set under dynamic granular structures. Knowl Based Syst 163:811–829

    Google Scholar 

  27. Guo Y, Tsang ECC, Xu W, Chen D (2020) Adaptive weighted generalized multi-granulation interval-valued decision-theoretic rough sets. Knowl Based Syst 187:104804

    Google Scholar 

  28. Sang B, Yang L, Chen H, Xu W, Guo Y, Yuan Z (2019) Generalized multi-granulation double-quantitative decision-theoretic rough set of multi-source information system. Int J Approx Reason 115:157–179

    MathSciNet  Google Scholar 

  29. Zhang HD, Zhan JM, He YP (2019) Multi-granulation hesitant fuzzy rough sets and corresponding applications. Soft Comput 23(24):13085–13103

    Google Scholar 

  30. Luo S, Miao D, Zhang Z, Zhang Y, Hu S (2020) A neighborhood rough set model with nominal metric embedding. Inform Sci 520:373–388

    MathSciNet  Google Scholar 

  31. Yu Y, Pedrycz W, Miao D (2013) Neighborhood rough set based multi-label classification for automatic image annotation. Int J Approx Reason 54(9):1373–1387

    Google Scholar 

  32. Liu D, Li J (2019) Safety monitoring data classification method based on wireless rough network of neighborhood rough set. Saf Sci 118:103–108

    ADS  Google Scholar 

  33. Chu X, Sun B, Li X, Han K, Zhang Y, Huang Q (2020) Neighborhood rough setbased three-way clustering considering attribute correlations: an approach to classification of potential gout groups. Inform Sci 535:28–41

    MathSciNet  Google Scholar 

  34. Chen Y, Zhang Z, Zheng J, Ma Y, Xue Y (2017) Gene selection for tumor classification using neighborhood rough set and entropy measures. J Biomed Inform 67:59–68

    PubMed  Google Scholar 

  35. Behera B, Kumaravelan G (2021) Text document classification using fuzzy rough set based on robust nearest neighbor (FRS-RNN). Soft Comput 25(15):9915–9923

    Google Scholar 

  36. Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24:833–849

    ADS  Google Scholar 

  37. Wang C, Shi Y, Fan X, Shao M (2019) Attribute reduction based on k-nearest neighborhood rough set. Int J Approx Reason 106:18–31

    MathSciNet  Google Scholar 

  38. Shu W, Qian W, Xie Y (2020) Incremental feature selection for dynamic hybrid data using neighborhood rough set. Knowl Based Syst 194:105516

    Google Scholar 

  39. Yu W, Zhang M, Shen Y (2019) Learning a local manifold representation based on improved neighborhood rough set and LLE for hyperspectral dimensionality reduction. Signal Process 164:20–29

    Google Scholar 

  40. Liu J, Lin Y, Li Y, Weng W, Wu S (2018) Online multi-label streaming feature selection based on neighborhood rough set. Pattern Recognit 84:273–287

    ADS  Google Scholar 

  41. Chen H, Li T, Fan X, Luo C (2019) Feature selection for imbalanced data based on neighborhood rough set. Inform Sci 483:1–20

    Google Scholar 

  42. Xie X, Zhang XY, Yang QL (2022) Improved ID3 decision tree algorithm induced by neighborhood equivalence relation. Appl Res Comput 39:1

    Google Scholar 

  43. Sun L, Wang LY, Qian YH, Xu JC, Zhang SG (2019) Feature selection using Lebesgue and entropy measures for incomplete neighborhood decision systems. Knowl Based Syst 186:104942

    Google Scholar 

  44. Sun L, Wang LY, Xu JC, Zhang SG (2019) A neighborhood rough set-based attribute reduction method using Lebesgue and entropy measures. Entropy 21(2):138

    MathSciNet  PubMed  PubMed Central  ADS  Google Scholar 

  45. Paul A, Sil J, Mukhopadhyay CD (2017) Gene selection for designing optimal fuzzy rule base classifier by estimating missing value. Appl Soft Comput 55:276–288

    Google Scholar 

  46. Wang GY (2003) Rough reduction in algebra view and information view. Int J Intell Syst 18:679–688

    Google Scholar 

  47. Hu Q, Zhao H, Xie Z, Yu D (2007) Consistency based attribute reduction. In: Proceedings of Pacific-Asia conference on knowledge discovery and data mining, pp 96–107

  48. Dash M, Liu H (2003) Consistency-based search in feature selection. Artif Intell 151:155–176

    MathSciNet  Google Scholar 

Download references

Acknowledgements

The work was partly supported by the National Natural Science Foundation of China (Nos. 12161082, 61861039).

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: SW; Formal analysis and investigation: SW, LW; Methodology: SW, LW; Writing -original draft preparation: LW, SG; Experimentation: LW, SG, ZH; Writing-review and editing: SW, LW, ZH, YL.

Corresponding author

Correspondence to Shangzhi Wu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Consent to participate

Informed consent was not required, as no humans or animals were involved.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, S., Wang, L., Ge, S. et al. Neighborhood rough set with neighborhood equivalence relation for feature selection. Knowl Inf Syst 66, 1833–1859 (2024). https://doi.org/10.1007/s10115-023-01999-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-023-01999-z

Keywords

Navigation