alexrosen45/IpsumGPT
A modular implementation of a causal transformer with scaled dot-product attention, layer normalization, and residual connections.
Details
仓库信息
A modular implementation of a causal transformer with scaled dot-product attention, layer normalization, and residual connections.
Stats
Loading...
Loading
--
Loading
--
Loading
--
Loading
--