huggingface/diffusers

[Proposals Welcome] Fal Flashpack integration for faster model loading

Open

#12 564 ouverte le 31 oct. 2025

Voir sur GitHub
 (4 commentaires) (3 réactions) (0 assignés)Python (4 562 forks)batch import
contributions-welcomehelp wantedstale

Métriques du dépôt

Stars
 (22 190 stars)
Métriques de merge PR
 (Merge moyen 13j 1h) (96 PRs mergées en 30 j)

Description

Hey! 👋

We've had a request to explore integrating Fal's Flashpack for faster DiT and Text Encoder loading (https://github.com/huggingface/diffusers/issues/12550). Before we jump into implementation, we wanted to open this up to the community to gather ideas and hear from anyone who's experimented with this.

We'd love your input on:

  1. Performance: Has anyone tried it? What kind of speedups did you see? Are there any performance trade-offs?
  2. Integration Design: How would you approach it if you were to integrating this into Diffusers? Describe your design at a high level - how would we support this in our existing framework and what would the API look like?

We're looking for proposals and ideas rather than PRs at this stage. We're genuinely interested in hearing different approaches and perspectives from the community on this.

Feel free to share your thoughts!

Guide contributeur