Abstract
In NCAA Division III Wrestling, the question arose how to assign schools to regions in a way that optimizes fairness for individual wrestlers aspiring to the national tournament. The problem fell within cluster analysis but no known clustering algorithms supported its complex and interrelated set of needs. We created several bespoke clustering algorithms based on various heuristics (balanced optimization, weighted spatial clustering, and weighted optimization rectangles) for finding an optimal assignment, and tested each against the generic technique of genetic algorithms. While each of our algorithms had different strengths, the genetic algorithm achieved the highest value on our objective function, including when comparing it to the region assignments that preceded our work. This paper therefore demonstrates a technique that can be used to solve a broad category of clustering problems that arise in athletics, particularly any sport in which athletes compete individually but are assigned to regions as a team.
Acknowledgment
Three anonymous reviewers provided suggestions that improved the clarity of this article, connected it better to the existing literature, and improved how we check the robustness of our methods.
-
Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.
-
Research funding: None declared.
-
Conflict of interest statement: The authors declare no conflicts of interest regarding this article.
References
Andreopoulos, B., A. An, X. Wang, and M. Schroeder. 2009. “A Roadmap of Clustering Algorithms: Finding a Match for a Biomedical Application.” Briefings in Bioinformatics 10 (3): 297–314. https://doi.org/10.1093/bib/bbn058.Search in Google Scholar PubMed
Bigsby, K., and J. Ohlmann. 2017. “Ranking and Prediction of Collegiate Wrestling.” Journal of Sports Analytics 3 (1): 1–19. https://doi.org/10.3233/jsa-160024.Search in Google Scholar
Bliese, P. D. 2000. “Within-group Agreement, Non-independence, and Reliability: Implications for Data Aggregation and Analysis.” In Chapter in Multilevel Theory, Research, and Methods in Organizations: Foundations, Extensions, and New Directions, 349–81. Jossey-Bass.Search in Google Scholar
Bradley, P., U. Fayyad, and C. Reina. 1998. “Scaling Clustering Algorithms to Large Databases.” In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining: 9–15.Search in Google Scholar
Carter, N. C. 2020. Python Code Applying Genetic Clustering Algorithms to NCAA Division III Wrestling. Online Also available at https://github.com/nathancarter/clustering-for-ncaa.Search in Google Scholar
Cowgill, M., R. Harvey, and L. Watson. 1999. “A Genetic Algorithm Approach to Cluster Analysis.” Computers and Mathematics with Applications 37: 99–108. https://doi.org/10.1016/s0898-1221(99)00090-5.Search in Google Scholar
Derringer, G., and R. Suich. 1980. “Simultaneous Optimization of Several Response Variables.” Journal of Quality Technology 12 (4): 214–9. https://doi.org/10.1080/00224065.1980.11980968.Search in Google Scholar
Duda, R. O., and P. E. Hart. 1973. Pattern classification and scene analysis. New York: John Willey & Sons.Search in Google Scholar
Estivill-Castro, V. E. 2002. “Why So Many Clustering Algorithms: A Position Paper.” SIGKDD Explor. Newsl. 4 (1): 65–75. https://doi.org/10.1145/568574.568575.Search in Google Scholar
Gan, G., C. Ma, and J. Wu. 2007. “Data Clustering: Theory, Algorithms, and Applications.” In Society for Industrial and Applied Mathematics. Philadelphia, Pennsylvania: SIAM.10.1137/1.9780898718348Search in Google Scholar
Hall, L. O., I. B. Ozyurt, and J. C. Bezdek. 1999. “Clustering with a Genetically Optimized Approach.” Trans. Evol. Comp 3 (2): 103–12. https://doi.org/10.1109/4235.771164.Search in Google Scholar
Han, J., M. Kamber, and J. Pei. 2012. Data mining concepts and techniques, 3rd ed. India: Elsevier Ltd.Search in Google Scholar
Hruschka, E., R. Campello, and A. Freitas. 2009a. “A Survey of Evolutionary Algorithms for Clustering.” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 39 (2): 1133–155. https://doi.org/10.1109/tsmcc.2008.2007252.Search in Google Scholar
Hruschka, E. R., R. J. G. B. Campello, A. A. Freitas, and A. C. F. Ponce Leon. 2009b. “A Survey of Evolutionary Algorithms for Clustering.” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 39 (2): 133–55. https://doi.org/10.1109/tsmcc.2008.2007252.Search in Google Scholar
Malinen, M. I., and P. Fränti. 2014. “Balanced K-Means for Clustering”. In Structural, Syntactic, and Statistical Pattern Recognition, edited by P. Fränti, G. Brown, M. Loog, F. Escolano, and M. Pelillo, 32–41. Berlin: Springer Berlin Heidelberg.10.1007/978-3-662-44415-3_4Search in Google Scholar
NCAA. 2015. Regional Alignment and the Growth of Division III Wrestling. Online Also available at https://www.d3wrestle.com/regional-alignment-and-the-growth-of-division-iii-wrestling/.Search in Google Scholar
NCAA. 2020. Division III Wrestling Website. Online Also available at https://www.ncaa.com/sports/wrestling/d3.Search in Google Scholar
Wagstaff, K., C. Cardie, S. Rogers, and S. Schrödl. 2001. “Constrained K-Means Clustering with Background Knowledge.” ICML ’01 Proceedings of the Eighteenth International Conference on Machine Learning 1: 577–84.Search in Google Scholar
Xu, R., and D. Wunsch. 2008. Clustering: Wiley-IEEE Press.10.1002/9780470382776Search in Google Scholar
Zhou, A., B.-Y. Qu, H. Li, S.-Z. Zhao, P. N. Suganthan, and Q. Zhang. 2011. “Multiobjective Evolutionary Algorithms: A Survey of the State of the Art.” Swarm and Evolutionary Computation 1 (1): 32–49. https://doi.org/10.1016/j.swevo.2011.03.001.Search in Google Scholar
Zhu, S., D. Wang, and T. Li. 2010. “Data Clustering with Size Constraints.” Knowledge-Based Systems 23 (8): 883–9. https://doi.org/10.1016/j.knosys.2010.06.003.Search in Google Scholar
© 2022 Walter de Gruyter GmbH, Berlin/Boston