跳到主要导航 跳到搜索 跳到主要内容

Multi-Track Music Generation with WGAN-GP and Attention Mechanisms

  • Luyu Chen
  • , Lin Shen
  • , Dan Yu
  • , Zhihua Wang
  • , Kun Qian*
  • , Bin Hu*
  • , Björn W. Schuller
  • , Yoshiharu Yamamoto
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • The University of Tokyo
  • Imperial College London

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Music generation with artificial intelligence is a complex and captivating task. The utilisation of generative adversarial networks (GANs) has exhibited promising outcomes in producing realistic and diverse music compositions. In this paper, we propose a model based on Wasserstein GAN with gradient penalty (WGAN-GP) for multi-track music generation. This model incorporates self-attention and introduces a novel cross-attention mechanism in the generator to enhance its expressive capability. Additionally, we transpose all music to C major in training to ensure data consistency and quality. Experimental results demonstrate that our model can produce multi-track music with enhanced rhythm and sound characteristics, accelerate convergence, and improve generation quality.

源语言英语
主期刊名GCCE 2023 - 2023 IEEE 12th Global Conference on Consumer Electronics
出版商Institute of Electrical and Electronics Engineers Inc.
606-607
页数2
ISBN(电子版)9798350340181
DOI
出版状态已出版 - 2023
活动12th IEEE Global Conference on Consumer Electronics, GCCE 2023 - Nara, 日本
期限: 10 10月 202313 10月 2023

出版系列

姓名GCCE 2023 - 2023 IEEE 12th Global Conference on Consumer Electronics

会议

会议12th IEEE Global Conference on Consumer Electronics, GCCE 2023
国家/地区日本
Nara
时期10/10/2313/10/23

指纹

探究 'Multi-Track Music Generation with WGAN-GP and Attention Mechanisms' 的科研主题。它们共同构成独一无二的指纹。

引用此