research-article

Medical Question Summarization with Entity-driven Contrastive Learning

Authors:
Wenpeng Lu

Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences) and Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China

Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences) and Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China

0000-0002-1840-3540
View Profile

,
Sibo Wei

Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences) and Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China

Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center (National Supercomputer Center in Jinan), Qilu University of Technology (Shandong Academy of Sciences) and Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China

0009-0003-1540-1245
View Profile

,
Xueping Peng

Australian Artificial Intelligence Institute, University of Technology Sydney, Sydney, Australia

Australian Artificial Intelligence Institute, University of Technology Sydney, Sydney, Australia

0000-0002-8901-1472
View Profile

,
Yi-Fei Wang

Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, China

Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, China

0000-0003-0562-3058
View Profile

,
Usman Naseem

School of Computing, Macquarie University, Sydney, Australia

School of Computing, Macquarie University, Sydney, Australia

0000-0003-0191-7171
View Profile

,
Shoujin Wang

Data Science Institute, University of Technology Sydney, Sydney, Australia

Data Science Institute, University of Technology Sydney, Sydney, Australia

0000-0003-1133-9379
View Profile

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 23 Issue 4Article No.: 56pp 1–19https://doi.org/10.1145/3652160

Published:15 April 2024Publication History

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

By summarizing longer consumer health questions into shorter and essential ones, medical question-answering systems can more accurately understand consumer intentions and retrieve suitable answers. However, medical question summarization is very challenging due to obvious distinctions in health trouble descriptions from patients and doctors. Although deep learning has been applied to successfully address the medical question summarization (MQS) task, two challenges remain: how to correctly capture question focus to model its semantic intention, and how to obtain reliable datasets to fairly evaluate performance. To address these challenges, this article proposes a novel medical question summarization framework based on entity-driven contrastive learning (ECL). ECL employs medical entities present in frequently asked questions (FAQs) as focuses and devises an effective mechanism to generate hard negative samples. This approach compels models to focus on essential information and consequently generate more accurate question summaries. Furthermore, we have discovered that some MQS datasets, such as the iCliniq dataset with a 33% duplicate rate, have significant data leakage issues. To ensure an impartial evaluation of the related methods, this article carefully examines leaked samples to reorganize more reasonable datasets. Extensive experiments demonstrate that our ECL method outperforms the existing methods and achieves new state-of-the-art performance, i.e., 52.85, 43.16, 41.31, 43.52 in terms of ROUGE-1 metric on MeQSum, CHQ-Summ, iCliniq, HealthCareMagic dataset, respectively. The code and datasets are available at https://github.com/yrbobo/MQS-ECL

REFERENCES

[1] Abacha Asma Ben and Demner-Fushman Dina. 2019. On the summarization of consumer health questions. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL’19). 2228–2234.Google ScholarCross Ref
[2] Abacha Asma Ben, M’rabet Yassine, Zhang Yuhao, Shivade Chaitanya, Langlotz Curtis, and Demner-Fushman Dina. 2021. Overview of the MEDIQA 2021 shared task on summarization in the medical domain. In Proceedings of the 20th Workshop on Biomedical Language Processing (BioNLP’21). 74–85.Google ScholarCross Ref
[3] Abacha Asma Ben and Demner-Fushman Dina. 2019. A question-entailment approach to question answering. BMC Bioinformatics 20, 1 (2019), 1–23.Google Scholar
[4] Caciularu Avi, Dagan Ido, Goldberger Jacob, and Cohan Arman. 2022. Long context question answering via supervised contrastive learning. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL’22). 2872–2879.Google ScholarCross Ref
[5] Cao Shuyang and Wang Lu. 2021. CLIFF: Contrastive learning for improving faithfulness and factuality in abstractive summarization. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP’21). 6633–6649.Google ScholarCross Ref
[6] Chen Shaobin, Zhou Jie, Sun Yuling, and He Liang. 2022. An information minimization based contrastive learning model for unsupervised sentence embeddings learning. In Proceedings of the 29th International Conference on Computational Linguistics (COLING’22). 4821–4831.Google Scholar
[7] Chen Ting, Kornblith Simon, Norouzi Mohammad, and Hinton Geoffrey. 2020. A simple framework for contrastive learning of visual representations. In Proceedings of the 37th International Conference on Machine Learning (ICML’20). 1597–1607.Google ScholarDigital Library
[8] Das Sarkar Snigdha Sarathi, Katiyar Arzoo, Passonneau Rebecca J., and Zhang Rui. 2022. CONTaiNER: Few-shot named entity recognition via contrastive learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL’22). 6338–6353.Google ScholarCross Ref
[9] Gao Tianyu, Yao Xingcheng, and Chen Danqi. 2021. SimCSE: Simple contrastive learning of sentence embeddings. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP’21). 6894–6910.Google ScholarCross Ref
[10] Hadsell Raia, Chopra Sumit, and LeCun Yann. 2006. Dimensionality reduction by learning an invariant mapping. In Proceedings of the 2006 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’06). 1735–1742.Google ScholarDigital Library
[11] He Kaiming, Fan Haoqi, Wu Yuxin, Xie Saining, and Girshick Ross. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’20). 9729–9738.Google ScholarCross Ref
[12] Huang Yucheng, He Kai, Wang Yige, Zhang Xianli, Gong Tieliang, Mao Rui, and Li Chen. 2022. COPNER: Contrastive learning with prompt guiding for few-shot named entity recognition. In Proceedings of the 29th International Conference on Computational Linguistics (COLING’22). 2515–2527.Google Scholar
[13] Jaiswal Ashish, Babu Ashwin Ramesh, Zadeh Mohammad Zaki, Banerjee Debapriya, and Makedon Fillia. 2020. A survey on contrastive self-supervised learning. Technologies (2020), 9, 1 (2020), 2.Google ScholarCross Ref
[14] Nambiar Sindhya K., S David Peter, and Idicula Sumam Mary. 2023. Abstractive summarization of text document in Malayalam language: Enhancing attention model using POS tagging feature. ACM Transactions on Asian and Low-Resource Language Information Processing 22, 2 (2023), 1–14.Google ScholarDigital Library
[15] Katwe Praveen Kumar, Khamparia Aditya, Gupta Deepak, and Dutta Ashit Kumar. 2023. Methodical systematic review of abstractive summarization and natural language processing models for biomedical health informatics: Approaches, metrics and challenges. ACM Transactions on Asian and Low-Resource Language Information Processing (2023), 1–37.Google ScholarDigital Library
[16] Kilicoglu Halil, Abacha Asma Ben, Mrabet Yassine, Shooshan Sonya E., Rodriguez Laritza, Masterton Kate, and Demner-Fushman Dina. 2018. Semantic annotation of consumer health questions. BMC Bioinformatics 19, 1 (2018), 1–28.Google ScholarCross Ref
[17] Lei Chuan, Efthymiou Vasilis, Geis Rebecca, and Özcan Fatma. 2020. Expanding query answers on medical knowledge bases. In Proceedings of the 2020 International Conference on Extending Database Technology (EDBT’20). 567–578.Google Scholar
[18] Lewis Mike, Liu Yinhan, Goyal Naman, Ghazvininejad Marjan, Mohamed Abdelrahman, Levy Omer, Stoyanov Veselin, and Zettlemoyer Luke. 2020. BART: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL’20). 7871–7880.Google ScholarCross Ref
[19] Li Yaoyiran, Liu Fangyu, Collier Nigel, Korhonen Anna, and Vulić Ivan. 2022. Improving word translation via two-stage contrastive learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL’22). 4353–4374.Google ScholarCross Ref
[20] Lin Chin-Yew. 2004. ROUGE: A package for automatic evaluation of summaries. In Proceedings of the Text Summarization Branches Out. 74–81.Google Scholar
[21] Liu Yixin and Liu Pengfei. 2021. SimCLS: A simple framework for contrastive learning of abstractive summarization. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). 1065–1072.Google ScholarCross Ref
[22] Liu Yixin, Liu Pengfei, Radev Dragomir, and Neubig Graham. 2022. BRIO: Bringing order to abstractive summarization. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL’22). 2890–2903.Google ScholarCross Ref
[23] Ma Congbo, Zhang Wei Emma, Guo Mingyu, Wang Hu, and Sheng Quan Z.. 2022. Multi-document summarization via deep learning techniques: A survey. ACM Computing Surveys 55, 5 (2022), 1–37.Google ScholarDigital Library
[24] Mrini Khalil, Dernoncourt Franck, Chang Walter, Farcas Emilia, and Nakashole Ndapandula. 2021. Joint summarization-entailment optimization for consumer health question understanding. In Proceedings of the 2nd Workshop on Natural Language Processing for Medical Conversations (NLPMC’21). 58–65.Google ScholarCross Ref
[25] Mrini Khalil, Dernoncourt Franck, Yoon Seunghyun, Bui Trung, Chang Walter, Farcas Emilia, and Nakashole Ndapandula. 2021. A gradually soft multi-task and data-augmented approach to medical question understanding. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). 1505–1515.Google ScholarCross Ref
[26] Mrini Khalil, Dernoncourt Franck, Yoon Seunghyun, Bui Trung, Chang Walter, Farcas Emilia, and Nakashole Ndapandula. 2021. UCSD-Adobe at MEDIQA 2021: Transfer learning and answer sentence selection for medical summarization. In Proceedings of the 20th Workshop on Biomedical Language Processing (BioNLP’21). 257–262.Google ScholarCross Ref
[27] Pan Xiao, Wang Mingxuan, Wu Liwei, and Li Lei. 2021. Contrastive learning for many-to-many multilingual neural machine translation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). 244–258.Google ScholarCross Ref
[28] Qi Peng, Zhang Yuhao, Zhang Yuhui, Bolton Jason, and Manning Christopher D.. 2020. Stanza: A Python natural language processing toolkit for many human languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL’20). 101–108.Google ScholarCross Ref
[29] Qi Weizhen, Yan Yu, Gong Yeyun, Liu Dayiheng, Duan Nan, Chen Jiusheng, Zhang Ruofei, and Zhou Ming. 2020. ProphetNet: Predicting future N-gram for sequence-to-sequence pre-training. In Findings of the Association for Computational Linguistics: EMNLP 2020. 2401–2410.Google ScholarCross Ref
[30] Raffel Colin, Shazeer Noam, Roberts Adam, Lee Katherine, Narang Sharan, Matena Michael, Zhou Yanqi, Li Wei, and Liu Peter J.. 2020. Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research 21, 1 (2020), 5485–5551.Google ScholarDigital Library
[31] Sänger Mario, Weber Leon, and Leser Ulf. 2021. WBI at MEDIQA 2021: Summarizing consumer health questions with generative transformers. In Proceedings of the 20th Workshop on Biomedical Language Processing (BioNLP’21). 86–95.Google ScholarCross Ref
[32] See Abigail, Liu Peter J., and Manning Christopher D.. 2017. Get to the point: Summarization with pointer-generator networks. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL’17). 1073–1083.Google ScholarCross Ref
[33] Tan Haochen, Shao Wei, Wu Han, Yang Ke, and Song Linqi. 2022. A sentence is worth 128 pseudo tokens: A semantic-aware contrastive learning framework for sentence embeddings. In Findings of the Association for Computational Linguistics: ACL 2022. 246–256.Google ScholarCross Ref
[34] Vo Tham. 2021. Se4exsum: An integrated semantic-aware neural approach with graph convolutional network for extractive text summarization. Transactions on Asian and Low-Resource Language Information Processing 20, 6 (2021), 1–22.Google ScholarDigital Library
[35] Wang Dong, Ding Ning, Li Piji, and Zheng Haitao. 2021. CLINE: Contrastive learning with semantic negative examples for natural language understanding. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). 2332–2342.Google ScholarCross Ref
[36] Xu Shusheng, Zhang Xingxing, Wu Yi, and Wei Furu. 2022. Sequence level contrastive learning for text summarization. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI’22). 11556–11565.Google ScholarCross Ref
[37] Yadav Shweta, Gupta Deepak, Abacha Asma Ben, and Demner-Fushman Dina. 2021. Reinforcement learning for abstractive question summarization with question-aware semantic rewards. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). 249–255.Google ScholarCross Ref
[38] Yadav Shweta, Gupta Deepak, and Demner-Fushman Dina. 2022. CHQ-Summ: A dataset for consumer healthcare question summarization. arXiv:2206.06581. Retrieved from https://arxiv.org/abs/2206.06581Google Scholar
[39] Yadav Shweta, Sarrouti Mourad, and Gupta Deepak. 2021. NLM at MEDIQA 2021: Transfer learning-based approaches for consumer question and multi-answer summarization. In Proceedings of the 20th Workshop on Biomedical Language Processing (BioNLP’21). 291–301.Google ScholarCross Ref
[40] Yang Nan, Wei Furu, Jiao Binxing, Jiang Daxing, and Yang Linjun. 2021. xMoCo: Cross momentum contrastive learning for open-domain question answering. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP’21). 6120–6129.Google ScholarCross Ref
[41] Yang Zonghan, Cheng Yong, Liu Yang, and Sun Maosong. 2019. Reducing word omission errors in neural machine translation: A contrastive learning approach. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL’19). 6191–6196.Google ScholarCross Ref
[42] Ying Huaiyuan, Luo Shengxuan, Dang Tiantian, and Yu Sheng. 2022. Label refinement via contrastive learning for distantly-supervised named entity recognition. In Findings of the Association for Computational Linguistics: NAACL 2022. 2656–2666.Google ScholarCross Ref
[43] Guangtao Zeng, Wenmian Yang, Zeqian Ju, Yue Yang, Sicheng Wang, Ruisi Zhang, Meng Zhou, Jiaqi Zeng, Xiangyu Dong, Ruoyu Zhang, Hongchao Fang, Penghui Zhu, Shu Chen, and Pengtao Xie. 2020. MedDialog: Large-scale medical dialogue datasets. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP’20). 9241–9250.Google Scholar
[44] Zhang Jingqing, Zhao Yao, Saleh Mohammad, and Liu Peter. 2020. PEGASUS: Pre-training with extracted gap-sentences for abstractive summarization. In Proceedings of the 37th International Conference on Machine Learning (ICML’20). 11328–11339.Google Scholar
[45] Zhang Ming, Dou Shuai, Wang Ziyang, and Wu Yunfang. 2022. Focus-driven contrastive learning for medical question summarization. In Proceedings of the 29th International Conference on Computational Linguistics (COLING’22). 6176–6186.Google Scholar
[46] Zhang Miaoran, Mosbach Marius, Adelani David, Hedderich Michael, and Klakow Dietrich. 2022. MCSE: Multimodal contrastive learning of sentence embeddings. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL’22). 5959–5969.Google ScholarCross Ref
[47] Zhang Mengli, Zhou Gang, Yu Wanting, Huang Ningbo, and Liu Wenfen. 2023. Ga-scs: Graph-augmented source code summarization. ACM Transactions on Asian and Low-Resource Language Information Processing 22, 2 (2023), 1–19.Google ScholarDigital Library
[48] Zhang Tong, Ye Wei, Yang Baosong, Zhang Long, Ren Xingzhang, Liu Dayiheng, Sun Jinan, Zhang Shikun, Zhang Haibo, and Zhao Wen. 2022. Frequency-aware contrastive learning for neural machine translation. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (AAAI’22). 11712–11720.Google ScholarCross Ref
[49] Zhang Yuteng, Lu Wenpeng, Ou Weihua, Zhang Guoqiang, Zhang Xu, Cheng Jinyong, and Zhang Weiyu. 2020. Chinese medical question answer selection via hybrid models based on CNN and GRU. Multimedia Tools and Applications 79, 21-22 (2020), 14751–14776.Google Scholar
[50] Zhao Shuai, Li Qing, Yang Yuer, Wen Jinming, and Luo Weiqi. 2023. From softmax to nucleusmax: A novel sparse language model for Chinese radiology report summarization. ACM Transactions on Asian and Low-Resource Language Information Processing 22, 6 (2023), 1–21.Google ScholarDigital Library

Index Terms

Medical Question Summarization with Entity-driven Contrastive Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Natural language generation

Recommendations

ProCo: Prototype-Aware Contrastive Learning for Long-Tailed Medical Image Classification
Medical Image Computing and Computer Assisted Intervention – MICCAI 2022
Abstract
Medical image classification has been widely adopted in medical image analysis. However, due to the difficulty of collecting and labeling data in the medical area, medical image datasets are usually highly-imbalanced. To address this problem, ...
Read More
Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning
CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management

Most existing aspect-based sentiment analysis (ABSA) research efforts are devoted to extracting the aspect-dependent sentiment features from the sentence towards the given aspect. However, it is observed that about 60% of the testing aspects in commonly ...
Read More
A Semantic Similarity Distance-Aware Contrastive Learning for Abstractive Summarization
PRICAI 2023: Trends in Artificial Intelligence
Abstract
Recently, contrastive learning has been extended from visual representation to summarization tasks. Abstractive summarization aims to generate a short description for a document while retaining significant information. At present, the methods of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Asian and Low-Resource Language Information Processing Volume 23, Issue 4
April 2024
221 pages
ISSN:2375-4699
EISSN:2375-4702
DOI:10.1145/3613577
Editor:
Imed Zitouni
Google, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 April 2024
- Online AM: 11 March 2024
- Accepted: 24 February 2024
- Revised: 27 December 2023
- Received: 21 August 2023
Published in tallip Volume 23, Issue 4

Check for updates
Author Tags
Medical question summarization
medical entity
question focus
contrastive learning
hard negative samples
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 119
  Total Downloads
- Downloads (Last 12 months)119
- Downloads (Last 6 weeks)65
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

Medical Question Summarization with Entity-driven Contrastive Learning

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

ProCo: Prototype-Aware Contrastive Learning for Long-Tailed Medical Image Classification

Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning

A Semantic Similarity Distance-Aware Contrastive Learning for Abstractive Summarization

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

Caption

Medical Question Summarization with Entity-driven Contrastive Learning

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

ProCo: Prototype-Aware Contrastive Learning for Long-Tailed Medical Image Classification

Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning

A Semantic Similarity Distance-Aware Contrastive Learning for Abstractive Summarization

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

Share this Publication link

Share on Social Media