[Community Contributions] examples on distributed inference using 🤗 Accelerate · huggingface/accelerate#3078

(6 评论) (2 反应) (0 负责人)Python (626 fork)batch import

contributions-welcomegood first issuewip

仓库指标

Star: (5,805 star)
PR 合并指标: (平均合并 16天 17小时) (30 天内合并 23 个 PR)

描述

The inference/distributed directory houses examples on running distributed inference with accelerate:

Phi2 for language generation
Stable Diffusion for image generation

The strategy followed there is to load an entire model onto each GPU and sending chunks of a batch through each GPU’s model copy at a time. Synthetic data generation has become an essential toolkit for every ML Engineer. So, it'd be beneficial to extend these examples to include some more use cases:

Image captioning
Speech data generation

Some nice to haves:

Include artifact serialization as done in this
Keep the artifact serialization code under a thread to not block GPU execution

How can you help?

You could help us contribute an example on any of the above-mentioned use cases or you can come up with your own 🤗 Help us make the art of synthetic data generation scalable, easy, and accessible.

贡献者指南

研究方向: 研究 `examples/inference/distributed` 目录中现有的分布式推理示例。了解如何将模型加载到每个 GPU 上并处理批次。选择一个用于图像描述（例如 BLIP）或语音生成（例如 Whisper）的模型，并调整模式。可选地，添加使用线程的工件序列化以避免阻塞 GPU 执行。
技术栈: pythonpytorch
领域: machine learningbackend
议题类型: 功能
难度: 3
预计时间: 半天
活动状态: 活跃
清晰度: 清晰
前置要求: PythonPyTorchHugging Face Accelerate
新手友好度: 65

仓库指标

描述

How can you help?

贡献者指南

每天在邮箱收到新鲜 Easy issues。