Submodular Pseudo-3D Network for Micro-Video Summarization

Authors

  • Xiaowei Gu South China University of Technology
  • Lu Lu School of Computer Science & Engineering; South China University of Technology; Guangzhou Higher Education Mega Centre, Panyu District, Guangzhou, China, 510006

DOI:

https://doi.org/10.5755/j01.itc.54.4.41063

Keywords:

video summarization, pseudo-3D network, deep diversity layers

Abstract

With the explosion of micro-videos, it is essential to develop video summariza-tion algorithms that simultaneously capture the diverse video content and represent theoriginal video. Many methods employ deep neural networks (DNNs). However, DNNsummarization models do not directly consider summary diversity. On the other hand,submodular functions can be seen as a form of diversity. However, the shallow structureprevents the data representation at a more abstract level. This paper proposes a novelsubmodular pseudo-3D network (SP3D), which equips submodular functions with a multi-layered network for micro-video summarization. Unlike standard DNNs, the proposedSP3D network and the correspondingoptimization method consider the interlock depen-dency among the selected frames, thus improving the diversity. The experimental resultsindicate that the proposed model and optimization method are effective in micro-videosummarization.

Downloads

Published

2025-12-19

Issue

Section

Articles