Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter June 28, 2022

Clustering algorithms to increase fairness in collegiate wrestling

  • Nathan Carter ORCID logo EMAIL logo , Andrew Harrison , Amar Iyengar , Matthew Lanham , Scott Nestler , Dave Schrader and Amir Zadeh ORCID logo

Abstract

In NCAA Division III Wrestling, the question arose how to assign schools to regions in a way that optimizes fairness for individual wrestlers aspiring to the national tournament. The problem fell within cluster analysis but no known clustering algorithms supported its complex and interrelated set of needs. We created several bespoke clustering algorithms based on various heuristics (balanced optimization, weighted spatial clustering, and weighted optimization rectangles) for finding an optimal assignment, and tested each against the generic technique of genetic algorithms. While each of our algorithms had different strengths, the genetic algorithm achieved the highest value on our objective function, including when comparing it to the region assignments that preceded our work. This paper therefore demonstrates a technique that can be used to solve a broad category of clustering problems that arise in athletics, particularly any sport in which athletes compete individually but are assigned to regions as a team.


Corresponding author: Nathan Carter, Bentley University, Waltham, USA, E-mail:

Acknowledgment

Three anonymous reviewers provided suggestions that improved the clarity of this article, connected it better to the existing literature, and improved how we check the robustness of our methods.

  1. Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

  2. Research funding: None declared.

  3. Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

References

Andreopoulos, B., A. An, X. Wang, and M. Schroeder. 2009. “A Roadmap of Clustering Algorithms: Finding a Match for a Biomedical Application.” Briefings in Bioinformatics 10 (3): 297–314. https://doi.org/10.1093/bib/bbn058.Search in Google Scholar PubMed

Bigsby, K., and J. Ohlmann. 2017. “Ranking and Prediction of Collegiate Wrestling.” Journal of Sports Analytics 3 (1): 1–19. https://doi.org/10.3233/jsa-160024.Search in Google Scholar

Bliese, P. D. 2000. “Within-group Agreement, Non-independence, and Reliability: Implications for Data Aggregation and Analysis.” In Chapter in Multilevel Theory, Research, and Methods in Organizations: Foundations, Extensions, and New Directions, 349–81. Jossey-Bass.Search in Google Scholar

Bradley, P., U. Fayyad, and C. Reina. 1998. “Scaling Clustering Algorithms to Large Databases.” In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining: 9–15.Search in Google Scholar

Carter, N. C. 2020. Python Code Applying Genetic Clustering Algorithms to NCAA Division III Wrestling. Online Also available at https://github.com/nathancarter/clustering-for-ncaa.Search in Google Scholar

Cowgill, M., R. Harvey, and L. Watson. 1999. “A Genetic Algorithm Approach to Cluster Analysis.” Computers and Mathematics with Applications 37: 99–108. https://doi.org/10.1016/s0898-1221(99)00090-5.Search in Google Scholar

Derringer, G., and R. Suich. 1980. “Simultaneous Optimization of Several Response Variables.” Journal of Quality Technology 12 (4): 214–9. https://doi.org/10.1080/00224065.1980.11980968.Search in Google Scholar

Duda, R. O., and P. E. Hart. 1973. Pattern classification and scene analysis. New York: John Willey & Sons.Search in Google Scholar

Estivill-Castro, V. E. 2002. “Why So Many Clustering Algorithms: A Position Paper.” SIGKDD Explor. Newsl. 4 (1): 65–75. https://doi.org/10.1145/568574.568575.Search in Google Scholar

Gan, G., C. Ma, and J. Wu. 2007. “Data Clustering: Theory, Algorithms, and Applications.” In Society for Industrial and Applied Mathematics. Philadelphia, Pennsylvania: SIAM.10.1137/1.9780898718348Search in Google Scholar

Hall, L. O., I. B. Ozyurt, and J. C. Bezdek. 1999. “Clustering with a Genetically Optimized Approach.” Trans. Evol. Comp 3 (2): 103–12. https://doi.org/10.1109/4235.771164.Search in Google Scholar

Han, J., M. Kamber, and J. Pei. 2012. Data mining concepts and techniques, 3rd ed. India: Elsevier Ltd.Search in Google Scholar

Hruschka, E., R. Campello, and A. Freitas. 2009a. “A Survey of Evolutionary Algorithms for Clustering.” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 39 (2): 1133–155. https://doi.org/10.1109/tsmcc.2008.2007252.Search in Google Scholar

Hruschka, E. R., R. J. G. B. Campello, A. A. Freitas, and A. C. F. Ponce Leon. 2009b. “A Survey of Evolutionary Algorithms for Clustering.” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 39 (2): 133–55. https://doi.org/10.1109/tsmcc.2008.2007252.Search in Google Scholar

Malinen, M. I., and P. Fränti. 2014. “Balanced K-Means for Clustering”. In Structural, Syntactic, and Statistical Pattern Recognition, edited by P. Fränti, G. Brown, M. Loog, F. Escolano, and M. Pelillo, 32–41. Berlin: Springer Berlin Heidelberg.10.1007/978-3-662-44415-3_4Search in Google Scholar

NCAA. 2015. Regional Alignment and the Growth of Division III Wrestling. Online Also available at https://www.d3wrestle.com/regional-alignment-and-the-growth-of-division-iii-wrestling/.Search in Google Scholar

NCAA. 2020. Division III Wrestling Website. Online Also available at https://www.ncaa.com/sports/wrestling/d3.Search in Google Scholar

Wagstaff, K., C. Cardie, S. Rogers, and S. Schrödl. 2001. “Constrained K-Means Clustering with Background Knowledge.” ICML ’01 Proceedings of the Eighteenth International Conference on Machine Learning 1: 577–84.Search in Google Scholar

Xu, R., and D. Wunsch. 2008. Clustering: Wiley-IEEE Press.10.1002/9780470382776Search in Google Scholar

Zhou, A., B.-Y. Qu, H. Li, S.-Z. Zhao, P. N. Suganthan, and Q. Zhang. 2011. “Multiobjective Evolutionary Algorithms: A Survey of the State of the Art.” Swarm and Evolutionary Computation 1 (1): 32–49. https://doi.org/10.1016/j.swevo.2011.03.001.Search in Google Scholar

Zhu, S., D. Wang, and T. Li. 2010. “Data Clustering with Size Constraints.” Knowledge-Based Systems 23 (8): 883–9. https://doi.org/10.1016/j.knosys.2010.06.003.Search in Google Scholar

Received: 2020-09-04
Accepted: 2022-05-04
Published Online: 2022-06-28
Published in Print: 2022-06-25

© 2022 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 4.5.2024 from https://www.degruyter.com/document/doi/10.1515/jqas-2020-0101/html
Scroll to top button