Lightning-AI/pytorch-lightning

Update deepspeed activation checkpointing docs

Open

#17,621 建立於 2023年5月12日

在 GitHub 查看
 (1 留言) (0 反應) (0 負責人)Python (26,687 star) (3,233 fork)batch import
docshelp wantedstrategy: deepspeed

描述

📚 Documentation

In your documentation, you refer to the function deepspeed.checkpointing.checkpoint, but it looks like it does not exist (anymore?). Can you update that section?

While we're at it, can you provide a more common use-case as an example? The guide warns against wrapping an entire model, but having a pretrained language model from transformers is probably the most common use case. What if someone is just using GPT-2 or T5 with no further mathematical layers? What should get wrapped then?

cc @borda @awaelchli

貢獻者指南