Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning

Huang, Yuxin; Gu, Huailing; Yu, Zhengtao; Gao, Yumeng; Pan, Tong; Xu, Jialong

doi:10.1631/FITEE.2300296

Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning

基于细粒度强化学习增强噪声数据的低资源跨语言摘要

Published: 27 December 2023

Volume 25, pages 121–134, (2024)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Yuxin Huang (黄于欣) ORCID: orcid.org/0000-0003-1277-6212^1,2,
Huailing Gu (顾怀领)^1,2,
Zhengtao Yu (余正涛) ORCID: orcid.org/0000-0002-4012-461X^1,2,
Yumeng Gao (高玉梦)^1,2,
Tong Pan (潘通)^1,2 &
…
Jialong Xu (徐佳龙)^1,2

133 Accesses
Explore all metrics

Abstract

Cross-lingual summarization (CLS) is the task of generating a summary in a target language from a document in a source language. Recently, end-to-end CLS models have achieved impressive results using large-scale, high-quality datasets typically constructed by translating monolingual summary corpora into CLS corpora. However, due to the limited performance of low-resource language translation models, translation noise can seriously degrade the performance of these models. In this paper, we propose a fine-grained reinforcement learning approach to address low-resource CLS based on noisy data. We introduce the source language summary as a gold signal to alleviate the impact of the translated noisy target summary. Specifically, we design a reinforcement reward by calculating the word correlation and word missing degree between the source language summary and the generated target language summary, and combine it with cross-entropy loss to optimize the CLS model. To validate the performance of our proposed model, we construct Chinese-Vietnamese and Vietnamese-Chinese CLS datasets. Experimental results show that our proposed model outperforms the baselines in terms of both the ROUGE score and BERTScore.

摘要

跨语言摘要是从源语言文档生成目标语言摘要的任务。最近,端到端跨语言摘要模型通过使用大规模、高质量数据集取得令人瞩目的结果,这些数据集通常是通过将单语摘要语料库翻译成跨语言摘要语料库而构建的。然而,由于低资源语言翻译模型性能有限,翻译噪声会严重降低模型性能。提出一种细粒度强化学习方法解决基于噪声数据的低资源跨语言摘要问题。引入源语言摘要作为黄金信号,减轻翻译后噪声目标摘要的影响。具体来说,通过计算源语言摘要和生成目标语言摘要之间的词相关性和词缺失度设计强化奖励,并将其与交叉熵损失相结合优化跨语言摘要模型。为验证所提出模型性能,构建汉语-越南语和越南语-汉语跨语言摘要数据集。实验结果表明,所提出模型在ROUGE分数和BERTScore 方面优于其他基线。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Natural language processing: state of the art, current trends and challenges

Article 14 July 2022

Large-Language-Models (LLM)-Based AI Chatbots: Architecture, In-Depth Analysis and Their Performance Evaluation

How to Fine-Tune BERT for Text Classification?

Data availability

Due to the nature of this research, participants of this study did not agree for their data to be shared publicly, so the supporting data are not available.

References

Ayana, Shen SQ, Chen Y, et al., 2018. Zero-shot cross-lingual neural headline generation. IEEE/ACM Trans Audio Speech Lang Process, 26(12):2319–2327. https://doi.org/10.1109/TASLP.2018.2842432
Article Google Scholar
Bai Y, Gao Y, Huang HY, 2021. Cross-lingual abstractive summarization with limited parallel resources. Proc 59^th Annual Meeting of the Association for Computational Linguistics and the 11^th Int Joint Conf on Natural Language Processing, p.6910–6924. https://doi.org/10.18653/v1/2021.acl-long.538
Böhm F, Gao Y, Meyer CM, et al., 2019. Better rewards yield better summaries: learning to summarise without references. Proc Conf on Empirical Methods in Natural Language Processing and the 9^th Int Joint Conf on Natural Language Processing, p.3110–3120. https://doi.org/10.18653/v1/D19-1307
Cao Y, Liu H, Wan XJ, 2020. Jointly learning to align and summarize for neural cross-lingual summarization. Proc 58^th Annual Meeting of the Association for Computational Linguistics, p.6220–6231. https://doi.org/10.18653/v1/2020.acl-main.554
Dou ZY, Kumar S, Tsvetkov Y, 2020. A deep reinforced model for zero-shot cross-lingual summarization with bilingual semantic similarity rewards. Proc 4^th Workshop on Neural Generation and Translation, p.60–68. https://doi.org/10.18653/v1/2020.ngt-1.7
Dyer C, Chahuneau V, Smith NA, 2013. A simple, fast, and effective reparameterization of IBM Model 2. Proc Conf of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, p.644–648.
Hermann KM, Kočiský T, Grefenstette E, et al., 2015. Teaching machines to read and comprehend. Proc 28^th Int Conf on Neural Information Processing Systems, p.1693–1701.
Hu BT, Chen QC, Zhu FZ, 2015. LCSTS: a large scale Chinese short text summarization dataset. Proc Conf on Empirical Methods in Natural Language Processing, p.1967–1972. https://doi.org/10.18653/v1/D15-1229
Javed A, Ali Khan A, 2022. Shot classification and replay detection for sports video summarization. Front Inform Technol Electron Eng, 23(5):790–800. https://doi.org/10.1631/FITEE.2000414
Article Google Scholar
Jiang SY, Tu DB, Chen XS, et al., 2022. ClueGraphSum: let key clues guide the cross-lingual abstractive summarization. https://arxiv.org/abs/2203.02797
Kang XM, Zhao Y, Zhang JJ, et al., 2020. Dynamic context selection for document-level neural machine translation via reinforcement learning. Proc Conf on Empirical Methods in Natural Language Processing, p.2242–2254. https://doi.org/10.18653/v1/2020.emnlp-main.175
Kim S, Jang JY, Jung M, et al., 2021. A model of cross-lingual knowledge-grounded response generation for open-domain dialogue systems. Findings of the Association for Computational Linguistics, p.352–365. https://doi.org/10.18653/v1/2021.findings-emnlp.33
Kumar G, Foster G, Cherry C, et al., 2019. Reinforcement learning based curriculum optimization for neural machine translation. Proc Conf of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, p.2054–2061. https://doi.org/10.18653/v1/N19-1208
Lai H, Gao YM, Huang YX, et al., 2022. Evaluation method of text generation based on multi-granularity feature. J Chin Inform Process, 36(3):45–53, 63 (in Chinese).
Google Scholar
Leuski A, Lin CY, Zhou L, et al., 2003. Cross-lingual C*ST*RD: English access to Hindi information. ACM Trans Asian Lang Inform Process, 2(3):245–269. https://doi.org/10.1145/979872.979877
Article Google Scholar
Li HQ, Huang J, Cao Z, et al., 2023. Stochastic pedestrian avoidance for autonomous vehicles using hybrid reinforcement learning. Front Inform Technol Electron Eng, 24(1):131–140. https://doi.org/10.1631/FITEE.2200128
Article CAS Google Scholar
Li P, Tang C, Xu XH, 2021. Video summarization with a graph convolutional attention network. Front Inform Technol Electron Eng, 22(6):902–913. https://doi.org/10.1631/FITEE.2000429
Article Google Scholar
Liang YL, Meng FD, Zhou CL, et al., 2022. A variational hierarchical model for neural cross-lingual summarization. Proc 60^th Annual Meeting of the Association for Computational Linguistics, p.2088–2099. https://doi.org/10.18653/v1/2022.acl-long.148
Lim JM, Kang IS, Lee JH, 2004. Multi-document summarization using cross-language texts. Proc NTCIR-4.
Lin CY, 2004. ROUGE: a package for automatic evaluation of summaries. Proc 4^th Workshop on Annual Meeting of the Association for Computational Linguistics, p.74–81.
Nguyen TT, Luu AT, 2022. Improving neural cross-lingual abstractive summarization via employing optimal transport distance for knowledge distillation. Proc 36^th AAAI Conf on Artificial Intelligence, 36(10):11103–11111. https://doi.org/10.1609/aaai.v36i10.21359
Article Google Scholar
Orǎsan C, Chiorean OA, 2008. Evaluation of a cross-lingual Romanian-English multi-document summariser. Proc Int Conf on Language Resources and Evaluation.
Ouyang J, Song BY, McKeown K, 2019. A robust abstractive system for cross-lingual summarization. Proc Conf of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, p.2025–2031. https://doi.org/10.18653/v1/N19-1204
Paulus R, Xiong CM, Socher R, 2017. A deep reinforced model for abstractive summarization. https://arxiv.org/abs/1705.04304
Rennie SJ, Marcheret E, Mroueh Y, et al., 2017. Self-critical sequence training for image captioning. Proc IEEE Conf on Computer Vision and Pattern Recognition, p.1179–1195. https://doi.org/10.1109/CVPR.2017.131
Rippeth E, Post M, 2022. Additive interventions yield robust multi-domain machine translation models. Proc 7^th Conf on Machine Translation, p.220–232.
Takase S, Okazaki N, 2020. Multi-task learning for cross-lingual abstractive summarization. https://arxiv.org/abs/2010.07503
Unanue IJ, Parnell J, Piccardi M, 2021. BERTTune: fine-tuning neural machine translation with BERTScore. https://arxiv.org/abs/2106.02208
Vaswani A, Shazeer N, Parmar N, et al., 2017. Attention is all you need. Proc 31^st Int Conf on Neural Information Processing Systems, p.6000–6010.
Wang JA, Meng FD, Lu ZY, et al., 2022. ClidSum: a benchmark dataset for cross-lingual dialogue summarization. Proc Conf on Empirical Methods in Natural Language Processing, p.7716–7729. https://doi.org/10.18653/v1/2022.emnlp-main.526
Wu LJ, Zhu JH, He D, et al., 2019. Machine translation with weakly paired documents. Proc Conf on Empirical Methods in Natural Language Processing and the 9^th Int Joint Conf on Natural Language Processing, p.4375–4384. https://doi.org/10.18653/v1/D19-1446
Xiong LL, Tang Y, Liu CS, et al., 2023. A home energy management approach using decoupling value and policy in reinforcement learning. Front Inform Technol Electron Eng, 24(9):1261–1272. https://doi.org/10.1631/FITEE.2200667
Article Google Scholar
Yoon W, Yeo YS, Jeong M, et al., 2021. Learning by semantic similarity makes abstractive summarization better. https://arxiv.org/abs/2002.07767
You YJ, Jia WJ, Liu TY, et al., 2019. Improving abstractive document summarization with salient information modeling. Proc 57^th Annual Meeting of the Association for Computational Linguistics, p.2132–2141. https://doi.org/10.18653/v1/P19-1205
Zhang TY, Kishore V, Wu F, et al., 2020. BERTScore: evaluating text generation with BERT. https://arxiv.org/abs/1904.09675
Zhao H, Xie J, Lv Y, et al., 2013. Common error analysis of machine translation output. The 9^th China Workshop on Machine Translation.
Zhao J, Zhao YP, Wang WX, et al., 2022. Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents. Front Inform Technol Electron Eng, 23(7):1032–1042. https://doi.org/10.1631/FITEE.2100594
Article Google Scholar
Zhou J, Ke P, Qiu XP, et al., 2023. ChatGPT: potential, prospects, and limitations. Front Inform Technol Electron Eng, early access. https://doi.org/10.1631/FITEE.2300089
Zhu JN, Wang Q, Wang YN, et al., 2019. NCLS: neural cross-lingual summarization. Proc Conf on Empirical Methods in Natural Language Processing and the 9^th Int Joint Conf on Natural Language Processing, p.3054–3064. https://doi.org/10.18653/v1/D19-1302

Download references

Author information

Authors and Affiliations

Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650504, China
Yuxin Huang (黄于欣), Huailing Gu (顾怀领), Zhengtao Yu (余正涛), Yumeng Gao (高玉梦), Tong Pan (潘通) & Jialong Xu (徐佳龙)
Yunnan Key Laboratory of Artificial Intelligence, Kunming University of Science and Technology, Kunming, 650504, China
Yuxin Huang (黄于欣), Huailing Gu (顾怀领), Zhengtao Yu (余正涛), Yumeng Gao (高玉梦), Tong Pan (潘通) & Jialong Xu (徐佳龙)

Authors

Yuxin Huang (黄于欣)
View author publications
You can also search for this author in PubMed Google Scholar
Huailing Gu (顾怀领)
View author publications
You can also search for this author in PubMed Google Scholar
Zhengtao Yu (余正涛)
View author publications
You can also search for this author in PubMed Google Scholar
Yumeng Gao (高玉梦)
View author publications
You can also search for this author in PubMed Google Scholar
Tong Pan (潘通)
View author publications
You can also search for this author in PubMed Google Scholar
Jialong Xu (徐佳龙)
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Yuxin HUANG designed the research. Yumeng GAO processed the data. Huailing GU drafted the paper. Tong PAN and Zhengtao YU helped organize the paper. Huailing GU and Jialong XU revised and finalized the paper.

Corresponding author

Correspondence to Zhengtao Yu (余正涛).

Ethics declarations

Yuxin HUANG, Huailing GU, Zhengtao YU, Yumeng GAO, Tong PAN, and Jialong XU declare that they have no conflict of interest.

Additional information

Project supported by the National Natural Science Foundation of China (Nos. U21B2027, 62266027, 61972186, 62241604), the Yunnan Provincial Major Science and Technology Special Plan Projects, China (Nos. 202302AD080003, 202103AA080015, and 202202AD080003), the General Projects of Basic Research in Yunnan Province, China (Nos. 202301AT070471 and 202301AT070393), and the Kunming University of Science and Technology “Double First-Class” Joint Project, China (No. 202201BE070001-021)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Huang, Y., Gu, H., Yu, Z. et al. Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning. Front Inform Technol Electron Eng 25, 121–134 (2024). https://doi.org/10.1631/FITEE.2300296

Download citation

Received: 27 April 2023
Accepted: 22 October 2023
Published: 27 December 2023
Issue Date: January 2024
DOI: https://doi.org/10.1631/FITEE.2300296

Key words

关键词

CLC number

TP391

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning

Abstract

摘要

Access this article

Similar content being viewed by others

Natural language processing: state of the art, current trends and challenges

Large-Language-Models (LLM)-Based AI Chatbots: Architecture, In-Depth Analysis and Their Performance Evaluation

How to Fine-Tune BERT for Text Classification?

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Additional information

Rights and permissions

About this article

Cite this article

Key words

关键词

CLC number

Navigation

Enhancing low-resource cross-lingual summarization from noisy data with fine-grained reinforcement learning

Abstract

摘要

Access this article

Similar content being viewed by others

Natural language processing: state of the art, current trends and challenges

Large-Language-Models (LLM)-Based AI Chatbots: Architecture, In-Depth Analysis and Their Performance Evaluation

How to Fine-Tune BERT for Text Classification?

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

关键词

CLC number

Search

Navigation