Abstract
In this paper, we present a framework for segmenting the news programs into different story topics. The proposed method utilizes both visual and text information of the video. We represent the news video by a Shot Connectivity Graph (SCG), where the nodes in the graph represent the shots in the video, and the edges between nodes represent the transitions between shots. The cycles in the graph correspond to the story segments in the news program. We first detect the cycles in the graph by finding the anchor persons in the video. This provides us with the coarse segmentation of the news video. The initial segmentation is later refined by the detections of the weather and sporting news, and the merging of similar stories. For the weather detection, the global color information of the images and the motion of the shots are considered. We have used the text obtained from automatic speech recognition (ASR) for detecting the potential sporting shots to form the sport stories. Adjacent stories with similar semantic meanings are further merged based on the visual and text similarities. The proposed framework has been tested on a widely used data set provided by NIST, which contains the ground truth of the story boundaries, and competitive evaluation results have been obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chaisorn, L., Chua, T.-S., Lee, C.-H.: The Segmentation of News Video Into Story Units. International Conference on Multimedia and Expo (2002)
Gauvain, J.L., Lamel, L., Adda, G.: The LIMSI Broadcast News Transcription System. Speech Communication 37(1-2), 89–108 (2002)
Hoashi, K., Sugano, M., Naito, M., Matsumoto, K., Sugaya, F., Nakajima, Y.: Shot Boundary Determination on MPEG Compressed Domain and Story Segmentation Experiments for TRECVID 2004. In: TREC Video Retrieval Evaluation Forum (2004)
Hanjalic, A., Lagendijk, R.L., Biemond, J.: Automated High-Level Movie Segmentation for Advanced Video-Retrieval Systems. IEEE Transaction on Circuits and System for Video Technology 9(4) (1999)
Hsu, W., Chang, S.F.: Generative, Discriminative, and Ensemble Learning on Multi-Model Perceptual Fusion Toward News Video Story Segmentation. In: International Conference on Multimedia and Expo (2004)
Kender, J.R., Yeo, B.L.: Video Scene Segmentation Via Continuous Video Coherence. Computer Vision and Pattern Recognition (1998)
Lienhart, R., Pfeiffer, S., Effelsberg, W.: Scene Determination Based on Video and Audio Features. In: IEEE Conference on Multimedia Computing and Systems (1999)
Ngo, C.W., Zhang, H.J., Chin, R.T., Pong, T.C.: Motion-Based Video Representation for Scene Change Detection. International Journal of Computer Vision (2001)
Sundaram, H., Chang, S.F.: Video Scene Segmentation Using Video and Audio Features. In: International Conference on Multimedia and Expo (2000)
Viola, P., Jones, M.: Robust Real-Time Object Detection. International Journal of Computer Vision (2001)
Yeung, M., Yeo, B., Liu, B.: Segmentation of Videos by Clustering and Graph Analysis. Computer Vision and Image Understanding 71(1) (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhai, Y., Yilmaz, A., Shah, M. (2005). Story Segmentation in News Videos Using Visual and Text Cues. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_13
Download citation
DOI: https://doi.org/10.1007/11526346_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)