Repository Issues

Lightning-AI/lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

View on GitHub
Python (5,533 stars) (473 forks) (2 indexed issues) (PR metrics pending)Last commit Jan 5, 2024

Issues

2 open indexed issues