FluxML/Flux.jl

Benchmark optimizer latency on GPU

Open

#1,699 opened on Aug 24, 2021

View on GitHub
 (0 comments) (0 reactions) (0 assignees)Julia (4,725 stars) (619 forks)batch import
cudahelp wantedperformance

Description

Ref. https://github.com/JuliaGPU/GPUCompiler.jl/issues/65#issuecomment-903155469. We should do this for both the current mutating optimizers and for Optimisers.jl. It may be that TTFG is only part of the problem.

Contributor guide