[Community Contributions] examples on distributed inference using đ€ Accelerate
#3.078 geöffnet am 4. Sept. 2024
Repository-Metriken
- Stars
- Â (5.805 Stars)
- PR-Merge-Metriken
- Â (Durchschn. Merge 13T 14h)Â (24 gemergte PRs in 30 T)
Beschreibung
The inference/distributed directory houses examples on running distributed inference with accelerate:
- Phi2 for language generation
- Stable Diffusion for image generation
The strategy followed there is to load an entire model onto each GPU and sending chunks of a batch through each GPUâs model copy at a time. Synthetic data generation has become an essential toolkit for every ML Engineer. So, it'd be beneficial to extend these examples to include some more use cases:
- Image captioning
- Speech data generation
Some nice to haves:
- Include artifact serialization as done in this
- Keep the artifact serialization code under a thread to not block GPU execution
How can you help?
You could help us contribute an example on any of the above-mentioned use cases or you can come up with your own đ€ Help us make the art of synthetic data generation scalable, easy, and accessible.