huggingface/diffusers
View on GitHub[Proposals Welcome] Fal Flashpack integration for faster model loading
Open
#12564 opened on Oct 31, 2025
contributions-welcomehelp wantedstale
Description
Hey! 👋
We've had a request to explore integrating Fal's Flashpack for faster DiT and Text Encoder loading (https://github.com/huggingface/diffusers/issues/12550). Before we jump into implementation, we wanted to open this up to the community to gather ideas and hear from anyone who's experimented with this.
We'd love your input on:
- Performance: Has anyone tried it? What kind of speedups did you see? Are there any performance trade-offs?
- Integration Design: How would you approach it if you were to integrating this into Diffusers? Describe your design at a high level - how would we support this in our existing framework and what would the API look like?
We're looking for proposals and ideas rather than PRs at this stage. We're genuinely interested in hearing different approaches and perspectives from the community on this.
Feel free to share your thoughts!