[Community Contributions] examples on distributed inference using 🤗 Accelerate · huggingface/accelerate#3078

(4 comments) (2 reactions) (0 assignees)Python (626 forks)batch import

contributions-welcomegood first issuewip

Repository metrics

Stars: (5,805 stars)
PR merge metrics: (平均マージ 13d 14h) (30d で 24 merged PRs)

説明

The inference/distributed directory houses examples on running distributed inference with accelerate:

Phi2 for language generation
Stable Diffusion for image generation

The strategy followed there is to load an entire model onto each GPU and sending chunks of a batch through each GPU’s model copy at a time. Synthetic data generation has become an essential toolkit for every ML Engineer. So, it'd be beneficial to extend these examples to include some more use cases:

Image captioning
Speech data generation

Some nice to haves:

Include artifact serialization as done in this
Keep the artifact serialization code under a thread to not block GPU execution

How can you help?

You could help us contribute an example on any of the above-mentioned use cases or you can come up with your own 🤗 Help us make the art of synthetic data generation scalable, easy, and accessible.

コントリビューターガイド

調査方針: 既存の例は `inference/distributed` ディレクトリにあります。この課題は、画像キャプション生成と音声データ生成の例を追加することを提案しています。貢献者は、既存の Phi2 と Stable Diffusion の例を参照してパターンを理解し、そのパターンに従って新しい例を実装する必要があります。オプションとして、リンクされた gist で示されているアーティファクトのシリアライゼーションを含めることもできます。
技術スタック: pythonpytorch
領域: machine learningai
Issue 種別: 機能
難度: 3
推定時間: 1-3時間
活動状況: アクティブ
明確さ: 明確
前提条件: Basic PyTorchFamiliarity with Accelerate
初心者向け度: 60

Repository metrics

説明

How can you help?

コントリビューターガイド

新着 Easy issues をメールで受け取る。