A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
Repositories
zepingyu0512 repositories
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
Must-read Papers on Knowledge Editing for Large Language Models.
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
awesome SAE papers
awesome papers in LLM interpretability
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
sliced-rnn
AcadHomepage: A Modern and Responsive Academic Personal Homepage