An auxiliary project analysis of the characteristics of KV in DiT Attention.
(34 stars) (2 forks) (0 個已索引 issue) (0 個開放 good first issue)
倉庫
An auxiliary project analysis of the characteristics of KV in DiT Attention.
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism