Moonshot: Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions

MoonShot——Towards Controllable Video Generation and Editing with Motion-Aware Multimodal Conditions Research Background and Problem Statement In recent years, text-to-video diffusion models (Video Diffusion Models, VDMs) have made significant progress, enabling the generation of high-quality, visually appealing videos. However, most existing VDMs r...

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

High-Quality Video Generation with Cascaded Latent Diffusion Models: LaVie Academic Background In recent years, with the breakthrough progress of Diffusion Models (DMs) in the field of image generation, Text-to-Image (T2I) generation technology has achieved significant success. However, extending this technology to Text-to-Video (T2V) generation st...