Abstract
News story segmentation is a challenging task mainly due to the dynamic range of topics, smooth story transitions, and varied duration of each story. This paper presents a technique to segment stories from Urdu news bulletins. The technique relies on a Long Short-Term Memory-based Siamese neural network that is trained on positive (belonging to the same story) and negative (belonging to different stories) pairs of sentences. The model, once trained, identifies the transition between stories by detecting the dissimilarity between the adjacent sentences of a given text. For algorithmic development and experimental study, we employ two datasets, a dataset of Urdu news as well as transcriptions of news bulletins from multiple news channels. Experiments report promising results in identifying story boundaries validating the ideas put forward in this study.
Similar content being viewed by others
References
Ahlers, D.: News consumption and the new electronic media. Harv. Int. J. Press-Polit. 11, 29–52 (2006). https://doi.org/10.1177/1081180X05284317
Ahmad, R., Afzal, MZ., Rashid, SF., et al.: Space anomalies in ocrs for arabic like scripts. In: 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR), IEEE, pp 67–71 (2018)
Ahmed, S.B., Naz, S., Swati, S., et al.: Ucom offline dataset-an urdu handwritten dataset generation. Int. Arab J. Inf. Technol. 14, 239–245 (2017)
Bromley, J., Guyon, I., LeCun, Y., et al.: Signature verification using a" siamese" time delay neural network. Adv. Neural Inf. Process. Syst. 6 (1993)
Chaisorn, L., Chua, TS., Koh, CK., et al.: A two-level multi-modal approach for story segmentation of large news video corpus. In: TRECVID (2003)
Chen, H., Wang, Z., Pei, Y., et al.: Story segmentation for news broadcast based on primary caption. In: Proceedings of the 2nd ACM International Conference on Multimedia in Asia. Association for Computing Machinery, New York, NY, USA, MMAsia ’20, 10.1145/3444685.3446298, (2021) https://doi.org/10.1145/3444685.3446298
Chi, Z., Zhang, B.: A sentence similarity estimation method based on improved siamese network. J. Intell. Learn. Syst. Appl. 10(4), 121–134 (2018)
Chua, T.S., Chang, S.F., Chaisorn, L., et al.: Story boundary detection in large broadcast news video archives: techniques, experience and trends. In: MULTIMEDIA ’04 (2004)
Dey, S., Dutta, A., Toledo, J.I., et al.: Signet: Convolutional siamese network for writer independent offline signature verification. (2017) arXiv preprint arXiv:1707.02131
Farooq, M.U., Adeeba, F., Rauf, S., et al.: Improving large vocabulary urdu speech recognition system using deep neural networks. In: INTERSPEECH, pp 2978–2982 (2019)
Feng, B., Chen, Z., Zheng, R., et al.: Multiple style exploration for story unit segmentation of broadcast news video. Multimed. Syst. 20(4), 347–361 (2014)
Fiscus, J.G., Doddington, G.R.: Topic detection and tracking evaluation overview. In: Topic Detection and Tracking. Springer, p 17–31 (2002)
Haloi, P., Bhuyan, M., Chatterjee, D., et al.: Unsupervised story segmentation and indexing of broadcast news video. Multimedia Tools and Applications pp 1–20 (2021)
Hashimoto, I., Wang, Y., Kawai, Y., et al.: Topic detection for video stream based on geographical relationships and its interactive viewing system. In: 2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR), IEEE, pp 28–34 (2021)
Hauptmann, A., Witbrock, M.: Story segmentation and detection of commercials in broadcast news video. In: Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL’98-, pp 168–179, 10.1109/ADL.1998.670392 (1998)
Hussain, K., Mughal, N., Ali, I., et al.: Urdu news dataset 1m. Mendeley Data 3 (2021)
Kannao, R., Guha, P.: Story segmentation in tv news broadcast. In: 2016 23rd International Conference on Pattern Recognition (ICPR), IEEE, pp 2948–2953 (2016)
Kannao, R., Guha, P.: Segmenting with style: detecting program and story boundaries in tv news broadcast videos. Multimed. Tools Appl. 78(22), 31925–31957 (2019)
Kechaou, Z., Wali, A., Ben Ammar, M., Karray, H., Alimi, A.M.: A novel system for video news’ sentiment analysis. J. Syst. Inf. Technol. 15(1), 24–44 (2013)
Këpuska, V., Bohouta, G.: Comparing speech recognition systems (microsoft api, google api and cmu sphinx). Int. J. Eng. Res. Appl. 7(03), 20–24 (2017)
Khan, N.H., Adnan, A.: Urdu optical character recognition systems: present contributions and future directions. IEEE Access 6, 46019–46046 (2018)
Kumar, M., Jindal, M.K., Sharma, R.K., et al.: Character and numeral recognition for non-indic and indic scripts: a survey. Artif. Intell. Rev. 52, 2235–2261 (2019)
Kumar, V., Sundaram, S.: Siamese based neural network for offline writer identification on word level data. (2022) arXiv preprint arXiv:2211.14443
Liu, Z., Wang, Y.: Tv news story segmentation using deep neural network. pp 1–4 (2018)
Lu, X., Leung, C.C., Xie, L., et al.: Broadcast news story segmentation using latent topics on data manifold. In: 2013 IEEE International Conference on Acoustics, pp. 8465–8469. Speech and Signal Processing, IEEE (2013)
Mansouri, S., Charhad, M., Rekik, A., et al.: A framework for semantic video content indexing using textual information. In: 2018 IEEE second international conference on data stream mining & processing (DSMP), IEEE, pp 107–110 (2018)
Mirza, A., Siddiqi, I.: Recognition of cursive video text using a deep learning framework. IET Image Proc. 14(14), 3444–3455 (2020)
Mirza, A., Zeshan, O., Atif, M., et al.: (2020) Detection and recognition of cursive text from video frames. EURASIP J. Image Video Process. 1, 1–19 (2020)
Misra, H., Hopfgartner, F., Goyal, A., et al.: Tv news story segmentation based on semantic coherence and content similarity. In: International Conference on Multimedia Modeling, Springer, pp 347–357 (2010)
Mushtaq, F., Misgar, M.M., Kumar, M., et al.: Urdudeepnet: offline handwritten Urdu character recognition using deep neural network. Neural Comput. Appl. 33(22), 15229–15252 (2021)
Ruiz, V., Linares, I., Sanchez, A., et al.: Off-line handwritten signature verification using compositional synthetic generation of signatures and siamese neural networks. Neurocomputing 374, 30–41 (2020)
Saedi, C., Dras, M.: Siamese networks for large-scale author identification. Comput. Speech Lang. 70(101), 241 (2021)
Smeaton, A., Over, P.: Trecvid 2006: Shot boundary detection task overview. In: Proceedings of the TRECVID Workshop (2006)
Spolaor, N., Lee, H.D., Takaki, W.S.R., et al.: A systematic review on content-based video retrieval. Eng. Appl. Artif. Intell. 90(103), 557 (2020)
Urala Kota, B., Davila, K., Stone, A., et al.: Generalized framework for summarization of fixed-camera lecture videos by detecting and binarizing handwritten content. Int. J. Doc. Anal. Recognit. 22(3), 221–233 (2019)
Vinciarelli, A., Favre, S.: Broadcast news story segmentation using social network analysis and hidden markov models. In: Proceedings of the 15th ACM International Conference on Multimedia. Association for Computing Machinery, New York, NY, USA, MM ’07, p 261-264 (2007)
Wang, Y., Yao, Q., Kwok, J.T., et al.: Generalizing from a few examples: a survey on few-shot learning. ACM Comput. Surv. 53(3), 1–34 (2020)
Xie, L., Yang, Y.L., Liu, Z.Q.: On the effectiveness of subwords for lexical cohesion based story segmentation of chinese broadcast news. Inf. Sci. 181(13), 2873–2891 (2011)
Yasin, D., Sohail, A., Siddiqi, I.: Semantic video retrieval using deep learning techniques. In: 2020 17th International Bhurban Conference on Applied Sciences and Technology (IBCAST), IEEE, pp 338–343 (2020)
Yu, J., Shao, H.: Broadcast news story segmentation using sticky hierarchical dirichlet process. Appl. Intell. 52(11), 12788–12800 (2022)
Yu, J., Xie, L., Xiao, X., et al.: A hybrid neural network hidden markov model approach for automatic story segmentation. J. Ambient. Intell. Humaniz. Comput. 8(6), 925–936 (2017)
Zhou, M., Fan, Z., Wang, R., et al.: News video story segmentation with multi-modality features. In: 2020 8th International Conference on Digital Home (ICDH), pp 271–275 (2020)
Author information
Authors and Affiliations
Contributions
All authors contributed equally to the preparation of the manuscript from idea conceptualization to experimentation and write-up.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Bhatti, M.N.A., Siddiqi, I. & Moetesum, M. LSTM-based Siamese neural network for Urdu news story segmentation. IJDAR 26, 363–373 (2023). https://doi.org/10.1007/s10032-023-00441-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-023-00441-y