FluxML/Flux.jl

Benchmark optimizer latency on GPU

Open

#1.699 aberto em 24 de ago. de 2021

Ver no GitHub
 (0 comments) (0 reactions) (0 assignees)Julia (619 forks)batch import
cudahelp wantedperformance

Métricas do repositório

Stars
 (4.725 stars)
Métricas de merge de PR
 (Mesclagem média 4h 27m) (2 fundiu PRs em 30d)

Description

Ref. https://github.com/JuliaGPU/GPUCompiler.jl/issues/65#issuecomment-903155469. We should do this for both the current mutating optimizers and for Optimisers.jl. It may be that TTFG is only part of the problem.

Guia do colaborador