Training still OOM on 8GB gpu. · lllyasviel/ControlNet#21

(2 comments) (3 reactions) (0 assignees)Python (2.500 forks)batch import

help wanted

Métricas do repositório

Stars: (27.526 stars)
Métricas de merge de PR: (Nenhuma PRs mesclada em 30d)

Description

It seems that I already used many tricks but training still OOM for 8GB gpu. But inference is good now.

This is strange because I know some textural inversion or dreambooth can be trained on 8GB.

What is the secrect of Automatic1111's optimization? Although xformers may help a bit, the currect sliced attention should require even smaller mem than xformers.

Does it make sence to move text encoder and vae outside gpu when training?

Guia do colaborador

Direção de pesquisa: Investigue o uso de memória durante o treinamento no ControlNet. Foque no mecanismo de atenção e compare com xformers. Faça um perfil de memória usando as ferramentas de memória do PyTorch. Considere mover o codificador de texto e o VAE para a CPU conforme sugerido. Verifique se há otimizações conhecidas no repositório do Automatic1111.
Pilha de tecnologia: pythonpytorch
Domain: machine learning
Tipo Issue: Bug
Difficulty: 3
Tempo estimado: 1-2 dias
Status da atividade: Antigo
Clarity: Precisa de investigação
Prerequisites: PyTorch basicsGPU memory managementdiffusion modelsattention mechanisms
Simpatia para novatos: 20

Métricas do repositório

Description

Guia do colaborador

Receba issues Easy novas por email.