Story Segmentation in News Videos Using Visual and Text Cues

Zhai, Yun; Yilmaz, Alper; Shah, Mubarak

doi:10.1007/11526346_13

Yun Zhai²¹,
Alper Yilmaz²¹ &
Mubarak Shah²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3568))

Included in the following conference series:

International Conference on Image and Video Retrieval

1241 Accesses
21 Citations

Abstract

In this paper, we present a framework for segmenting the news programs into different story topics. The proposed method utilizes both visual and text information of the video. We represent the news video by a Shot Connectivity Graph (SCG), where the nodes in the graph represent the shots in the video, and the edges between nodes represent the transitions between shots. The cycles in the graph correspond to the story segments in the news program. We first detect the cycles in the graph by finding the anchor persons in the video. This provides us with the coarse segmentation of the news video. The initial segmentation is later refined by the detections of the weather and sporting news, and the merging of similar stories. For the weather detection, the global color information of the images and the motion of the shots are considered. We have used the text obtained from automatic speech recognition (ASR) for detecting the potential sporting shots to form the sport stories. Adjacent stories with similar semantic meanings are further merged based on the visual and text similarities. The proposed framework has been tested on a widely used data set provided by NIST, which contains the ground truth of the story boundaries, and competitive evaluation results have been obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Automatic Segmentation of TV News into Stories Using Visual and Temporal Information

Static Summarization of Video Scenes Based on Minimal Spanning Tree

Unsupervised story segmentation and indexing of broadcast news video

Article 16 September 2021

References

Chaisorn, L., Chua, T.-S., Lee, C.-H.: The Segmentation of News Video Into Story Units. International Conference on Multimedia and Expo (2002)
Google Scholar
Gauvain, J.L., Lamel, L., Adda, G.: The LIMSI Broadcast News Transcription System. Speech Communication 37(1-2), 89–108 (2002)
Article MATH Google Scholar
Hoashi, K., Sugano, M., Naito, M., Matsumoto, K., Sugaya, F., Nakajima, Y.: Shot Boundary Determination on MPEG Compressed Domain and Story Segmentation Experiments for TRECVID 2004. In: TREC Video Retrieval Evaluation Forum (2004)
Google Scholar
http://www-nlpir.nist.gov/projects/tv2004/tv2004.html#2.2
Hanjalic, A., Lagendijk, R.L., Biemond, J.: Automated High-Level Movie Segmentation for Advanced Video-Retrieval Systems. IEEE Transaction on Circuits and System for Video Technology 9(4) (1999)
Google Scholar
Hsu, W., Chang, S.F.: Generative, Discriminative, and Ensemble Learning on Multi-Model Perceptual Fusion Toward News Video Story Segmentation. In: International Conference on Multimedia and Expo (2004)
Google Scholar
Kender, J.R., Yeo, B.L.: Video Scene Segmentation Via Continuous Video Coherence. Computer Vision and Pattern Recognition (1998)
Google Scholar
Lienhart, R., Pfeiffer, S., Effelsberg, W.: Scene Determination Based on Video and Audio Features. In: IEEE Conference on Multimedia Computing and Systems (1999)
Google Scholar
Ngo, C.W., Zhang, H.J., Chin, R.T., Pong, T.C.: Motion-Based Video Representation for Scene Change Detection. International Journal of Computer Vision (2001)
Google Scholar
Sundaram, H., Chang, S.F.: Video Scene Segmentation Using Video and Audio Features. In: International Conference on Multimedia and Expo (2000)
Google Scholar
Viola, P., Jones, M.: Robust Real-Time Object Detection. International Journal of Computer Vision (2001)
Google Scholar
Yeung, M., Yeo, B., Liu, B.: Segmentation of Videos by Clustering and Graph Analysis. Computer Vision and Image Understanding 71(1) (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Central Florida, Orlando, Florida, 32816
Yun Zhai, Alper Yilmaz & Mubarak Shah

Authors

Yun Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Alper Yilmaz
View author publications
You can also search for this author in PubMed Google Scholar
Mubarak Shah
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, National University of Singapore, Computing 1, 117590, Singapore
Wee-Kheng Leow
LIACS Media Lab, Leiden University,
Michael S. Lew & Erwin M. Bakker &
National University of Singapore, 3 Science Dr, 117543, Singapore
Tat-Seng Chua
Microsoft Research Asia, 4F, Sigma Center, No.49, Zhichun Road, 100080, Beijing, P.R.China
Wei-Ying Ma
School of Computing, National University of Singapore, 3 Science Drive 2, 117543, Singapore
Lekha Chaisorn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhai, Y., Yilmaz, A., Shah, M. (2005). Story Segmentation in News Videos Using Visual and Text Cues. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_13

Download citation

DOI: https://doi.org/10.1007/11526346_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Story Segmentation in News Videos Using Visual and Text Cues

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Automatic Segmentation of TV News into Stories Using Visual and Temporal Information

Static Summarization of Video Scenes Based on Minimal Spanning Tree

Unsupervised story segmentation and indexing of broadcast news video

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Story Segmentation in News Videos Using Visual and Text Cues

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Automatic Segmentation of TV News into Stories Using Visual and Temporal Information

Static Summarization of Video Scenes Based on Minimal Spanning Tree

Unsupervised story segmentation and indexing of broadcast news video

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation