Pytorchnn only uses one gpu. It's very inefficient when training a large amount of text.
贡献者指南
技术栈
pythonpytorch
领域
machine learningbackend
议题类型
feature
难度面向新贡献者的预计实现难度,1 表示很小改动,5 表示专家级工作。
5
预计时间有经验贡献者完成调查、实现、测试并准备 pull request 的粗略时间范围。
over 1 week
活动状态议题当前的可参与程度:新鲜、活跃、陈旧、阻塞或等待维护者输入。
stale
清晰度议题是否清楚说明期望改动、验收标准和下一步。
mostly clear
前置要求
Familiarity with PyTorchUnderstanding of multi GPU trainingExperience with Kaldi's codebaseKnowledge of CUDA
新手友好度1-100 的估计分数,表示该议题对首次贡献者的友好程度。
15
研究方向
The issue requests extending Kaldi's Pytorchnn module to utilize multiple GPUs. Start by examining the current single GPU implementation in the source code (likely under src/pytorchnn/). Research PyTorch's DataParallel and DistributedDataParallel modules. Check the issue comments for any suggested approaches or pointers from maintainers. Then propose a design that integrates multi GPU support without breaking existing functionality.