huggingface/diffusers

[Proposals Welcome] Fal Flashpack integration for faster model loading

Open

#12.564 aberto em 31 de out. de 2025

Ver no GitHub
 (4 comments) (3 reactions) (0 assignees)Python (4.562 forks)batch import
contributions-welcomehelp wantedstale

Métricas do repositório

Stars
 (22.190 stars)
Métricas de merge de PR
 (Mesclagem média 13d 1h) (96 fundiu PRs em 30d)

Description

Hey! 👋

We've had a request to explore integrating Fal's Flashpack for faster DiT and Text Encoder loading (https://github.com/huggingface/diffusers/issues/12550). Before we jump into implementation, we wanted to open this up to the community to gather ideas and hear from anyone who's experimented with this.

We'd love your input on:

  1. Performance: Has anyone tried it? What kind of speedups did you see? Are there any performance trade-offs?
  2. Integration Design: How would you approach it if you were to integrating this into Diffusers? Describe your design at a high level - how would we support this in our existing framework and what would the API look like?

We're looking for proposals and ideas rather than PRs at this stage. We're genuinely interested in hearing different approaches and perspectives from the community on this.

Feel free to share your thoughts!

Guia do colaborador