Abstract
As the number of articles on postsecondary topics expands, new methods are required to quantitatively understand the literature. Previous scholars looking at the higher education literature use manual coding, which limits the number of years that can be studied, or network analysis of citations and words, which does not yield groupings of articles by topic area. Instead, we use topic modeling to understand the subject areas that scholars investigate, as well as changes in these subject areas over time. Topic modeling assumes that a group of abstracts contains a mix of topics that are hidden (or latent) because we can only observe abstracts and the words that appear within abstracts, but not the underlying topics. Each abstract and word are then viewed as having a probability of belonging to a topic or subject area. Our data consist of abstracts from the set of articles published in The Journal of Higher Education, Research in Higher Education, and Review of Higher Education between 1991 and 2020. We find 24 main topics in the postsecondary literature in the past three decades. The most common topics in the literature during the past three decades are research usage and research methodology (18%), followed by college access (9%), identities and experiences (9%), student engagement (9%), and academic careers (8%). The research topics that became more popular over time are all student related: identities and experiences, college access, financial aid, student experiences with diversity, and student success. Topics that became less popular over time include academic misconduct, research usage and research methodology, and academic careers.
Similar content being viewed by others
References
Arun, R. V., Suresh, C. E., Madhavan, V., & Narasimha Murthy, N. M. (2010). On finding the natural number of topics with latent dirichlet allocation: Some observations. In M. J. Zaki, J. X. Yu, B. Ravindran, & V. Pudi (Eds.), Advances in knowledge discovery and data mining (pp. 391–402). Springer.
Biglan, A. (1973). The characteristics of subject matter in different academic areas. Journal of Applied Psychology, 57(3), 195–203.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.
Bray, N. J., & Major, C. H. (2011). Status of journals in the field of higher education. Journal of Higher Education, 82(4), 479–503.
Budd, J. M. (1990). Higher education literature: Characteristics of citation patterns. Journal of Higher Education, 61(1), 84–97.
Budd, J. M., & Magnuson, L. (2010). Higher education literature revisited: Citation patterns examined. Research in Higher Education, 51, 294–304.
Calma, A., & Davies, M. (2015). Studies in Higher Education 1976–2013: A retrospective using citation network analysis. Studies in Higher Education, 40(1), 4–21.
Calma, A., & Davies, M. (2017). Geographies of influence: A citation network analysis of Higher Education 1972–2014. Scientometrics, 110(3), 1579–1599.
Cao, J., Tian, X., Jintao, L. Yongdong, Z., & Sheng, T. (2009). A density-based method for adaptive lda model selection. In Neurocomputing—16th European Symposium on Artificial Neural Networks 2008 (vol. 72, pp. 1775–1781).
Chow, N. L., Tateishi, N., & Goldhar, A. (2023). Does knowledge have a half-life? An observational study analyzing the use of older citations in medical and scientific publications. British Medical Journal Open, 23, e072374.
Creamer, E. G. (1994). Gender and publications in core higher education journals. Journal of College Student Development, 35(1), 35–39.
Crisp, G., Carales, V. D., & Núñez, A. (2016). Where is the research on community college students? Community College Journal of Research and Practice, 40(9), 767–778.
Davis, T. L., & Liddell, D. L. (1997). Publication trends in the Journal of College Student Development: 1987–1995. Journal of College Student Development, 38(4), 325–332.
De Battisti, F., Ferrara, A., & Salini, S. (2015). A decade of research in statistics: A topic model approach. Scientometrics, 103, 413–433.
Deveaud, R., San Juan, E., & Bellot, P. (2014). Accurate and effective latent concept modeling for ad hoc information retrieval. Document Numérique, 17(1), 61–84.
Donaldson, J. F., & Townsend, B. K. (2007). Higher education journals’ discourse about adult undergraduate students. Journal of Higher Education, 78(1), 27–50.
Earp, V. J. (2010). A bibliometric snapshot of The Journal of Higher Education and Its Impact on the Field. Behavioral & Social Sciences Librarian, 29(4), 283–295.
Gatti, C. J., Brooks, J. D., Nurre, S. G. (2015). A historical analysis of the field of OR/MS using topic models. http://arxiv.org/abs/1510.05154.
Gentzkow, M., Kelly, B., & Taddy, M. (2019). Text as data. Journal of Economic Literature, 57(3), 535–574.
Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences, 101(supplement 1), 5228–5235.
Grimmer, J., & Stewart, B. M. (2017). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political Analysis, 21(3), 267–297.
Güneş, E., Üstündağ, M. T., Yalçın, H., & Safran, M. (2017). Investigating educational research articles (1980–2014) in terms of bibliometric indicators. International Online Journal of Educational Sciences, 9(1), 101–117.
Hahn A., Mohanty S. D., Manda P. (2017). What’s hot and what’s not?—Exploring trends in bioinformatics literature using topic modeling and keyword analysis. In: Cai Z., Daescu O., Li M. (eds.) Bioinformatics Research and Applications. ISBRA 2017. Lecture Notes in Computer Science, 10330, 279–290.
Harper, S. R. (2012). Race without Racism: How higher education researchers minimize racist institutional norms. Review of Higher Education, 36(1), 9–29.
Hollibaugh, G. E. (2019). The use of text as data methods in public administration: A review and an application to agency priorities. Journal of Public Administration Research and Theory, 29(3), 474–490.
Hutchinson, S., & Lovell, C. (2004). A review of methodological characteristics of research published in key journals in higher education: Implications for graduate research training. Research in Higher Education, 45(4), 383–403.
Kuh, G. D., Bean, J. P., Bradley, R. K., Coomes, M. D., & Hunter, D. E. (1986). Changes in research on college students published in selected journals between 1969 and 1983. Review of Higher Education, 9(2), 177–192.
Kwiek, M. (2021). The prestige economy of higher education journals: A quantitative approach. Higher Education, 81, 493–519.
Libkind, A. N., Markusova, V. A., & Libkind, I. A. (2020). Approach for using journal citation reports in determining the dynamics of half-life indicators of journals. Automatic Documentation and Mathematical Linguistics, 54, 174–183.
Milam, J. H. (1991). The presence of paradigms in the core higher education journal literature. Research in Higher Education, 32(6), 651–668.
Okamura, K. (2022). Scientometric engineering: Exploring citation dynamics via arXiv eprints. Quantitative Science Studies, 3(1), 122–146.
Peña, E. V. (2014). Marginalization of published scholarship on students with disabilities in higher education journals. Journal of College Student Development, 55(1), 30–40.
Perna, L. W., & Titus, M. A. (2004). Understanding differences in the choice of college attended: The role of state public policies. Review of Higher Education, 27(4), 501–525.
Silverman, R. J. (1985). Higher education as a maturing field. Research in Higher Education, 23(2), 150–183.
Silverman, R. J. (1987). How we know what we know: A study of higher education journal articles. Review of Higher Education, 11(1), 39–59.
Smart, J. C., & Elton, C. F. (1981). Structural characteristics and citation rates of education journals. American Education Research Journal, 18(4), 399–414.
Smith, R. A. (2019). Structuring the conversations: Using co-citation networks to trace 60 years of The Journal of College Student Development. Journal of College Student Development, 60(6), 695–717.
Smith, R. A., & Brown, M. G. (2020). Far beyond postsecondary: Longitudinal analyses of topical and citation networks in the field of higher education studies. Review of Higher Education, 44(2), 237–264.
Steinhardt, I., Schneijderberg, C., Götze, N., Baumann, J., & Krücken, G. (2017). Mapping the quality assurance of teaching and learning in higher education: the emergence of a specialty? Higher Education, 74, 221–237.
Tight, M. (2017). Higher education journals: Their characteristics and contribution. Higher Education Research & Development, 37(3), 607–619.
Wang, Y., Bowers, A. J., & Fikis, W. J. (2017). Automated text data mining analysis of five decades of educational leadership research literature: Probabilistic topic modeling of EAQ articles from 1965 to 2014. Education Administration Quarterly, 53(2), 289–323.
Wells, R. S., Kolek, E. A., Williams, E. A., & Saunders, D. B. (2015). How do we know what we know: A systematic comparison of research methods employed in higher education journals, 1996–2000 v. 2006–2010. Journal of Higher Education, 86, 171–198.
Zou, C. (2018). Analyzing research trends on drug safety using topic modeling. Expert Opinion on Drug Safety, 17(6), 629–636.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Takei, M., Porter, S.R., Umbach, P.D. et al. Understanding Themes in Postsecondary Research Using Topic Modeling and Journal Abstracts. Res High Educ (2023). https://doi.org/10.1007/s11162-023-09761-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11162-023-09761-8