Abstract
Feature selection of the neighborhood rough set is an important step in preprocessing the data and improving classification performance. Neighborhood granules form the basis for neighborhood rough set learning and reasoning, but granules typically have overlap, which will cause sample classification uncertainty or repeatability. For this reason, a new notion of neighborhood equivalence relation is used in this paper. Neighborhood equivalence granules solve the above problems, those granules are usually finer than that of the classical neighborhood rough set. In this paper, the neighborhood relation in traditional neighborhood rough set is replaced by neighborhood equivalence relation. Based on neighborhood equivalence relation, a Neighborhood rough set Model based on neighborhood Equivalence Relation (NMER) is proposed. We also introduce the properties of NMER and explain the significance of features. Based on the proposed NMER, a feature selection algorithm is also designed. The reduction results on twelve datasets show that the proposed feature selection algorithm can select main and useful features, confirming the effectiveness of the algorithm.
Similar content being viewed by others
Data availability
The algorithm proposed in this paper can provide source code and experimental results of the algorithm.
References
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction. Springer, Berlin
Xie XJ, Qin XL (2017) A novel incremental attribute reduction approach for dynamic incomplete decision systems. Int J Approx Reason 93:443–462
Sun L, Zhang XY, Qian YH, Xu JC, Zhang SG (2019) Feature selection using neighborhood entropy-based uncertainty measures for gene expression data classification. Inform Sci 502:18–41
Pawlak Z (1982) Rough set. Int J Comput Inf Sci 11(5):341–356
Pawlak Z, Skowron A (2006) Rudiments of rough set. Inf Sci 177:3–27
Greco S, Matarazzo B, Slowinski R (2002) Rough approximation by dominance relations. Int J Intell Syst 17:153–171
Sang B, Chen H, Yang L, Li T, Xu W, Luo C (2021) Feature selection for dynamic interval-valued ordered data based on fuzzy dominance neighborhood rough set. Knowl Based Syst 227:107223
Palangetić M, Cornelis C, Greco S, Słowiński R (2021) Fuzzy extensions of the dominance-based rough set approach. Int J Approx Reason 129:1–19
Yang S, Yang H, De Baets B, Jah M, Shi G (2021) Quantitative dominance-based neighborhood rough sets via fuzzy preference relations. IEEE Trans Fuzzy Syst 29(3):515–529
Dai J, Zou X, Wu WZ (2022) Novel fuzzy β-covering rough set models and their applications. Inform Sci 608:286–312
Huang Z, Li J (2021) Multi-scale covering rough sets with applications to data classification. Appl Soft Comput 110(107736):1–12
Zhou J, Xu F, Guan Y, Wang H (2021) Three types of fuzzy covering-based rough set models. Fuzzy Sets Syst 423:122–148
Huang Z, Li J, Qian Y (2022) Noise-tolerant fuzzy β covering based multi-granulation rough sets and feature subset selection. IEEE Trans Fuzzy Syst 30(7):2721–2735
An S, Zhao E, Wang C, Guo G, Zhao S, Li P (2023) Relative fuzzy rough approximations for feature selection and classification. IEEE Trans Cybern 53(4):2200–2210
An S, Hu Q, Wang C (2021) Probability granular distance-based fuzzy rough set model. Appl Soft Comput 102:107064
Wang C, Qian Y, Ding W, Feng X (2022) Feature selection with fuzzy-rough minimum classification error criterion. IEEE Trans Fuzzy Syst 30(8):2930–2942
Hu QH, Zhang L, Zhang D, Pan W, An S, Pedrycz W (2011) Measuring relevance between discrete and continuous features based on neighborhood mutual information. Expert Syst Appl 38:10737–10750
Hu Q, Liu J, Yu D (2008) Mixed feature selection based on granulation and approximation. Knowl Based Syst 21(4):294–304
Hu Q, Yu D, Liu J, Wu C (2008) Neighborhood rough set based heterogeneous feature subset selection. Inform Sci 178(18):3577–3594
Yang X, Liang S, Yu H, Gao S, Qian Y (2019) Pseudo-label neighborhood rough set: measures and attribute reductions. Int J Approx Reason 105:112–129
Wang Q, Qian Y, Liang X, Guo Q, Liang J (2018) Local neighborhood rough set. Knowl Based Syst 153:53–64
Guo Y, Tsang ECC, Xu W, Chen D (2019) Local logical disjunction double-quantitative rough sets. Inform Sci 500:87–112
Li W, Huang Z, Jia X, Cai X (2016) Neighborhood based decision-theoretic rough set models. Int J Approx Reason 69:1–17
Lin G, Qian Y, Li J (2012) NMGRS: Neighborhood-based multigranulation rough set. Int J Approx Reason 53(7):1080–1093
Sun L, Wang L, Ding W, Qian Y, Xu J (2020) Neighborhood multi-granulation rough set-based attribute reduction using lebesgue and entropy measures in incomplete neighborhood decision systems. Knowl Based Syst 192:10537
Hu C, Zhang L, Wang B, Zhang Z, Li F (2019) Incremental updating knowledge in neighborhood multigranulation rough set under dynamic granular structures. Knowl Based Syst 163:811–829
Guo Y, Tsang ECC, Xu W, Chen D (2020) Adaptive weighted generalized multi-granulation interval-valued decision-theoretic rough sets. Knowl Based Syst 187:104804
Sang B, Yang L, Chen H, Xu W, Guo Y, Yuan Z (2019) Generalized multi-granulation double-quantitative decision-theoretic rough set of multi-source information system. Int J Approx Reason 115:157–179
Zhang HD, Zhan JM, He YP (2019) Multi-granulation hesitant fuzzy rough sets and corresponding applications. Soft Comput 23(24):13085–13103
Luo S, Miao D, Zhang Z, Zhang Y, Hu S (2020) A neighborhood rough set model with nominal metric embedding. Inform Sci 520:373–388
Yu Y, Pedrycz W, Miao D (2013) Neighborhood rough set based multi-label classification for automatic image annotation. Int J Approx Reason 54(9):1373–1387
Liu D, Li J (2019) Safety monitoring data classification method based on wireless rough network of neighborhood rough set. Saf Sci 118:103–108
Chu X, Sun B, Li X, Han K, Zhang Y, Huang Q (2020) Neighborhood rough setbased three-way clustering considering attribute correlations: an approach to classification of potential gout groups. Inform Sci 535:28–41
Chen Y, Zhang Z, Zheng J, Ma Y, Xue Y (2017) Gene selection for tumor classification using neighborhood rough set and entropy measures. J Biomed Inform 67:59–68
Behera B, Kumaravelan G (2021) Text document classification using fuzzy rough set based on robust nearest neighbor (FRS-RNN). Soft Comput 25(15):9915–9923
Swiniarski RW, Skowron A (2003) Rough set methods in feature selection and recognition. Pattern Recogn Lett 24:833–849
Wang C, Shi Y, Fan X, Shao M (2019) Attribute reduction based on k-nearest neighborhood rough set. Int J Approx Reason 106:18–31
Shu W, Qian W, Xie Y (2020) Incremental feature selection for dynamic hybrid data using neighborhood rough set. Knowl Based Syst 194:105516
Yu W, Zhang M, Shen Y (2019) Learning a local manifold representation based on improved neighborhood rough set and LLE for hyperspectral dimensionality reduction. Signal Process 164:20–29
Liu J, Lin Y, Li Y, Weng W, Wu S (2018) Online multi-label streaming feature selection based on neighborhood rough set. Pattern Recognit 84:273–287
Chen H, Li T, Fan X, Luo C (2019) Feature selection for imbalanced data based on neighborhood rough set. Inform Sci 483:1–20
Xie X, Zhang XY, Yang QL (2022) Improved ID3 decision tree algorithm induced by neighborhood equivalence relation. Appl Res Comput 39:1
Sun L, Wang LY, Qian YH, Xu JC, Zhang SG (2019) Feature selection using Lebesgue and entropy measures for incomplete neighborhood decision systems. Knowl Based Syst 186:104942
Sun L, Wang LY, Xu JC, Zhang SG (2019) A neighborhood rough set-based attribute reduction method using Lebesgue and entropy measures. Entropy 21(2):138
Paul A, Sil J, Mukhopadhyay CD (2017) Gene selection for designing optimal fuzzy rule base classifier by estimating missing value. Appl Soft Comput 55:276–288
Wang GY (2003) Rough reduction in algebra view and information view. Int J Intell Syst 18:679–688
Hu Q, Zhao H, Xie Z, Yu D (2007) Consistency based attribute reduction. In: Proceedings of Pacific-Asia conference on knowledge discovery and data mining, pp 96–107
Dash M, Liu H (2003) Consistency-based search in feature selection. Artif Intell 151:155–176
Acknowledgements
The work was partly supported by the National Natural Science Foundation of China (Nos. 12161082, 61861039).
Author information
Authors and Affiliations
Contributions
Conceptualization: SW; Formal analysis and investigation: SW, LW; Methodology: SW, LW; Writing -original draft preparation: LW, SG; Experimentation: LW, SG, ZH; Writing-review and editing: SW, LW, ZH, YL.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Consent to participate
Informed consent was not required, as no humans or animals were involved.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wu, S., Wang, L., Ge, S. et al. Neighborhood rough set with neighborhood equivalence relation for feature selection. Knowl Inf Syst 66, 1833–1859 (2024). https://doi.org/10.1007/s10115-023-01999-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-023-01999-z