pytorch/ignite

Support for TorchSnapshot for efficient checkpoint saving and loading

Open

Aperta il 24 ott 2022

Vedi su GitHub
 (3 commenti) (0 reazioni) (0 assegnatari)Python (4313 star) (602 fork)batch import
enhancementhelp wanted

Descrizione

🚀 Feature

TorchSnapshot is a performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind. It includes many optimizations to control for memory usage and optimize checkpoint writing for DDP-style workloads over torch.save/torch.load. For more information, please check out the readme: https://github.com/pytorch/torchsnapshot#why-torchsnapshot

This could be a nice addition to Ignite, similar to the existing Checkpoint handler

cc @yifuwang

Guida contributor