Moonshot: Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions
MoonShot——Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions Research Background and Problem Statement In recent years, text-to-video diffusion models (Video Diffusion Models, VDMs) have made significant progress, enabling the generation of high-quality, visually appealing videos. However, most existing VDMs r...