cudahelp wantedperformance
描述
Ref. https://github.com/JuliaGPU/GPUCompiler.jl/issues/65#issuecomment-903155469. We should do this for both the current mutating optimizers and for Optimisers.jl. It may be that TTFG is only part of the problem.
Ref. https://github.com/JuliaGPU/GPUCompiler.jl/issues/65#issuecomment-903155469. We should do this for both the current mutating optimizers and for Optimisers.jl. It may be that TTFG is only part of the problem.