跳到主要导航 跳到搜索 跳到主要内容

Weakly-Supervised Movie Trailer Generation Driven by Multi-Modal Semantic Consistency

  • Sidan Zhu
  • , Yutong Wang
  • , Hongteng Xu
  • , Dixin Luo*
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

As an essential movie promotional tool, trailers are designed to capture the audience's interest through the skillful editing of key movie shots. Although some attempts have been made for automatic trailer generation, existing methods often rely on predefined rules or manual fine-grained annotations and fail to fully leverage the multi-modal information of movies, resulting in unsatisfactory trailer generation results. In this study, we introduce a weakly-supervised trailer generation method driven by multi-modal semantic consistency. Specifically, we design a multi-modal trailer generation framework that selects and sorts key movie shots based on input music and movie metadata (e.g., category tags and plot keywords) and adds narration to the generated trailer based on movie subtitles. We utilize two pseudo-scores derived from the proposed framework as labels and thus train the model under a weakly-supervised learning paradigm, ensuring trailerness consistency for key shot selection and emotion consistency for key shot sorting, respectively. As a result, we can learn the proposed model solely based on movie-trailer pairs without any fine-grained annotations. Both objective experimental results and subjective user studies demonstrate the superior performance of our method over previous works. The code is available at https://github.com/Dixin-Lab/MMSC.

源语言英语
主期刊名Proceedings of the 34th International Joint Conference on Artificial Intelligence, IJCAI 2025
编辑James Kwok
出版商International Joint Conferences on Artificial Intelligence
10234-10242
页数9
ISBN(电子版)9781956792065
DOI
出版状态已出版 - 2025
活动34th Internationa Joint Conference on Artificial Intelligence, IJCAI 2025 - Montreal, 加拿大
期限: 16 8月 202522 8月 2025

出版系列

姓名IJCAI International Joint Conference on Artificial Intelligence
ISSN(印刷版)1045-0823

会议

会议34th Internationa Joint Conference on Artificial Intelligence, IJCAI 2025
国家/地区加拿大
Montreal
时期16/08/2522/08/25

指纹

探究 'Weakly-Supervised Movie Trailer Generation Driven by Multi-Modal Semantic Consistency' 的科研主题。它们共同构成独一无二的指纹。

引用此