Skip to main content
Log in

Understanding Themes in Postsecondary Research Using Topic Modeling and Journal Abstracts

  • Published:
Research in Higher Education Aims and scope Submit manuscript

Abstract

As the number of articles on postsecondary topics expands, new methods are required to quantitatively understand the literature. Previous scholars looking at the higher education literature use manual coding, which limits the number of years that can be studied, or network analysis of citations and words, which does not yield groupings of articles by topic area. Instead, we use topic modeling to understand the subject areas that scholars investigate, as well as changes in these subject areas over time. Topic modeling assumes that a group of abstracts contains a mix of topics that are hidden (or latent) because we can only observe abstracts and the words that appear within abstracts, but not the underlying topics. Each abstract and word are then viewed as having a probability of belonging to a topic or subject area. Our data consist of abstracts from the set of articles published in The Journal of Higher Education, Research in Higher Education, and Review of Higher Education between 1991 and 2020. We find 24 main topics in the postsecondary literature in the past three decades. The most common topics in the literature during the past three decades are research usage and research methodology (18%), followed by college access (9%), identities and experiences (9%), student engagement (9%), and academic careers (8%). The research topics that became more popular over time are all student related: identities and experiences, college access, financial aid, student experiences with diversity, and student success. Topics that became less popular over time include academic misconduct, research usage and research methodology, and academic careers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Arun, R. V., Suresh, C. E., Madhavan, V., & Narasimha Murthy, N. M. (2010). On finding the natural number of topics with latent dirichlet allocation: Some observations. In M. J. Zaki, J. X. Yu, B. Ravindran, & V. Pudi (Eds.), Advances in knowledge discovery and data mining (pp. 391–402). Springer.

    Chapter  Google Scholar 

  • Biglan, A. (1973). The characteristics of subject matter in different academic areas. Journal of Applied Psychology, 57(3), 195–203.

    Article  Google Scholar 

  • Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.

    Google Scholar 

  • Bray, N. J., & Major, C. H. (2011). Status of journals in the field of higher education. Journal of Higher Education, 82(4), 479–503.

    Article  Google Scholar 

  • Budd, J. M. (1990). Higher education literature: Characteristics of citation patterns. Journal of Higher Education, 61(1), 84–97.

    Google Scholar 

  • Budd, J. M., & Magnuson, L. (2010). Higher education literature revisited: Citation patterns examined. Research in Higher Education, 51, 294–304.

    Article  Google Scholar 

  • Calma, A., & Davies, M. (2015). Studies in Higher Education 1976–2013: A retrospective using citation network analysis. Studies in Higher Education, 40(1), 4–21.

    Article  Google Scholar 

  • Calma, A., & Davies, M. (2017). Geographies of influence: A citation network analysis of Higher Education 1972–2014. Scientometrics, 110(3), 1579–1599.

    Article  Google Scholar 

  • Cao, J., Tian, X., Jintao, L. Yongdong, Z., & Sheng, T. (2009). A density-based method for adaptive lda model selection. In Neurocomputing—16th European Symposium on Artificial Neural Networks 2008 (vol. 72, pp. 1775–1781).

  • Chow, N. L., Tateishi, N., & Goldhar, A. (2023). Does knowledge have a half-life? An observational study analyzing the use of older citations in medical and scientific publications. British Medical Journal Open, 23, e072374.

    Google Scholar 

  • Creamer, E. G. (1994). Gender and publications in core higher education journals. Journal of College Student Development, 35(1), 35–39.

    Google Scholar 

  • Crisp, G., Carales, V. D., & Núñez, A. (2016). Where is the research on community college students? Community College Journal of Research and Practice, 40(9), 767–778.

    Article  Google Scholar 

  • Davis, T. L., & Liddell, D. L. (1997). Publication trends in the Journal of College Student Development: 1987–1995. Journal of College Student Development, 38(4), 325–332.

    Google Scholar 

  • De Battisti, F., Ferrara, A., & Salini, S. (2015). A decade of research in statistics: A topic model approach. Scientometrics, 103, 413–433.

    Article  Google Scholar 

  • Deveaud, R., San Juan, E., & Bellot, P. (2014). Accurate and effective latent concept modeling for ad hoc information retrieval. Document Numérique, 17(1), 61–84.

    Article  Google Scholar 

  • Donaldson, J. F., & Townsend, B. K. (2007). Higher education journals’ discourse about adult undergraduate students. Journal of Higher Education, 78(1), 27–50.

    Article  Google Scholar 

  • Earp, V. J. (2010). A bibliometric snapshot of The Journal of Higher Education and Its Impact on the Field. Behavioral & Social Sciences Librarian, 29(4), 283–295.

    Article  Google Scholar 

  • Gatti, C. J., Brooks, J. D., Nurre, S. G. (2015). A historical analysis of the field of OR/MS using topic models. http://arxiv.org/abs/1510.05154.

  • Gentzkow, M., Kelly, B., & Taddy, M. (2019). Text as data. Journal of Economic Literature, 57(3), 535–574.

    Article  Google Scholar 

  • Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences, 101(supplement 1), 5228–5235.

    Article  Google Scholar 

  • Grimmer, J., & Stewart, B. M. (2017). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political Analysis, 21(3), 267–297.

    Article  Google Scholar 

  • GüneÅŸ, E., ÃœstündaÄŸ, M. T., Yalçın, H., & Safran, M. (2017). Investigating educational research articles (1980–2014) in terms of bibliometric indicators. International Online Journal of Educational Sciences, 9(1), 101–117.

    Google Scholar 

  • Hahn A., Mohanty S. D., Manda P. (2017). What’s hot and what’s not?—Exploring trends in bioinformatics literature using topic modeling and keyword analysis. In: Cai Z., Daescu O., Li M. (eds.) Bioinformatics Research and Applications. ISBRA 2017. Lecture Notes in Computer Science, 10330, 279–290.

  • Harper, S. R. (2012). Race without Racism: How higher education researchers minimize racist institutional norms. Review of Higher Education, 36(1), 9–29.

    Article  Google Scholar 

  • Hollibaugh, G. E. (2019). The use of text as data methods in public administration: A review and an application to agency priorities. Journal of Public Administration Research and Theory, 29(3), 474–490.

    Article  Google Scholar 

  • Hutchinson, S., & Lovell, C. (2004). A review of methodological characteristics of research published in key journals in higher education: Implications for graduate research training. Research in Higher Education, 45(4), 383–403.

    Article  Google Scholar 

  • Kuh, G. D., Bean, J. P., Bradley, R. K., Coomes, M. D., & Hunter, D. E. (1986). Changes in research on college students published in selected journals between 1969 and 1983. Review of Higher Education, 9(2), 177–192.

    Article  Google Scholar 

  • Kwiek, M. (2021). The prestige economy of higher education journals: A quantitative approach. Higher Education, 81, 493–519.

    Article  Google Scholar 

  • Libkind, A. N., Markusova, V. A., & Libkind, I. A. (2020). Approach for using journal citation reports in determining the dynamics of half-life indicators of journals. Automatic Documentation and Mathematical Linguistics, 54, 174–183.

    Article  Google Scholar 

  • Milam, J. H. (1991). The presence of paradigms in the core higher education journal literature. Research in Higher Education, 32(6), 651–668.

    Article  Google Scholar 

  • Okamura, K. (2022). Scientometric engineering: Exploring citation dynamics via arXiv eprints. Quantitative Science Studies, 3(1), 122–146.

    Article  Google Scholar 

  • Peña, E. V. (2014). Marginalization of published scholarship on students with disabilities in higher education journals. Journal of College Student Development, 55(1), 30–40.

    Article  Google Scholar 

  • Perna, L. W., & Titus, M. A. (2004). Understanding differences in the choice of college attended: The role of state public policies. Review of Higher Education, 27(4), 501–525.

    Article  Google Scholar 

  • Silverman, R. J. (1985). Higher education as a maturing field. Research in Higher Education, 23(2), 150–183.

    Article  Google Scholar 

  • Silverman, R. J. (1987). How we know what we know: A study of higher education journal articles. Review of Higher Education, 11(1), 39–59.

    Article  Google Scholar 

  • Smart, J. C., & Elton, C. F. (1981). Structural characteristics and citation rates of education journals. American Education Research Journal, 18(4), 399–414.

    Article  Google Scholar 

  • Smith, R. A. (2019). Structuring the conversations: Using co-citation networks to trace 60 years of The Journal of College Student Development. Journal of College Student Development, 60(6), 695–717.

    Article  Google Scholar 

  • Smith, R. A., & Brown, M. G. (2020). Far beyond postsecondary: Longitudinal analyses of topical and citation networks in the field of higher education studies. Review of Higher Education, 44(2), 237–264.

    Article  Google Scholar 

  • Steinhardt, I., Schneijderberg, C., Götze, N., Baumann, J., & Krücken, G. (2017). Mapping the quality assurance of teaching and learning in higher education: the emergence of a specialty? Higher Education, 74, 221–237.

    Article  Google Scholar 

  • Tight, M. (2017). Higher education journals: Their characteristics and contribution. Higher Education Research & Development, 37(3), 607–619.

    Article  Google Scholar 

  • Wang, Y., Bowers, A. J., & Fikis, W. J. (2017). Automated text data mining analysis of five decades of educational leadership research literature: Probabilistic topic modeling of EAQ articles from 1965 to 2014. Education Administration Quarterly, 53(2), 289–323.

    Article  Google Scholar 

  • Wells, R. S., Kolek, E. A., Williams, E. A., & Saunders, D. B. (2015). How do we know what we know: A systematic comparison of research methods employed in higher education journals, 1996–2000 v. 2006–2010. Journal of Higher Education, 86, 171–198.

    Google Scholar 

  • Zou, C. (2018). Analyzing research trends on drug safety using topic modeling. Expert Opinion on Drug Safety, 17(6), 629–636.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stephen R. Porter.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Co-occurrence Networks for Topics

Appendix: Co-occurrence Networks for Topics

See Figs. 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 and 28.

Fig. 5
figure 5

Topic 1, epistemological beliefs of students

Fig. 6
figure 6

Topic 2, identities and experiences

Fig. 7
figure 7

Topic 3, financial aid

Fig. 8
figure 8

Topic 4, college access

Fig. 9
figure 9

Topic 5, quantitative methodological approaches

Fig. 10
figure 10

Topic 6, women in STEM

Fig. 11
figure 11

Topic 7, Law and Policy

Fig. 12
figure 12

Topic 8, careers and labor markets

Fig. 13
figure 13

Topic 9, minority students

Fig. 14
figure 14

Topic 10, academic careers

Fig. 15
figure 15

Topic 11, management and leadership

Fig. 16
figure 16

Topic 12, evolution of academic work

Fig. 17
figure 17

Topic 13, teaching and learning

Fig. 18
figure 18

Topic 14, institutional identity

Fig. 19
figure 19

Topic 15, student engagement

Fig. 20
figure 20

Topic 16, institutional revenues

Fig. 21
figure 21

Topic 17, institutional powerbrokers

Fig. 22
figure 22

Topic 18, institutional expenditures

Fig. 23
figure 23

Topic 19, student success

Fig. 24
figure 24

Topic 20, research usage and research methodology

Fig. 25
figure 25

Topic 21, faculty governance

Fig. 26
figure 26

Topic 22, student experiences with diversity

Fig. 27
figure 27

Topic 23, academic misconduct

Fig. 28
figure 28

Topic 24, persistence

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Takei, M., Porter, S.R., Umbach, P.D. et al. Understanding Themes in Postsecondary Research Using Topic Modeling and Journal Abstracts. Res High Educ (2023). https://doi.org/10.1007/s11162-023-09761-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11162-023-09761-8

Keywords

Navigation