help wanted
Description
I would like to request a new feature in the code: the ability to resume training from a checkpoint.
Currently, the code can save a checkpoint of the model's state at any point during training. However, there is no way to resume training from a checkpoint.
The code can save two things along with the model state_dict: 1)the optimizer, 2)the id of the last example it has seen (assuming the data is fed sequentially to the model not randomly)