huggingface/diffusers

[Looking for community contribution] support Wan 2.2 S2V: an audio-driven cinematic video generation model

Open

#12257 opened on Aug 29, 2025

View on GitHub
 (4 comments) (1 reaction) (0 assignees)Python (22,190 stars) (4,562 forks)batch import
Good second issuecontributions-welcomehelp wanted

Description

We're super excited about the Wan 2.2 S2V (Speech-to-Video) model and want to get it integrated into Diffusers! This would be an amazing addition, and we're looking for experienced community contributors to help make this happen.

This is a priority for us, so we will try review fast and actively collabrate with you throughout the process :)

Contributor guide