[Transformers v5] Base model and LoRA used in test has incorrect `tokenizer_config.json` · vllm-project/vllm#38386

(9 commenti) (0 reazioni) (1 assegnatario)Python (16.816 fork)batch import

good first issuehelp wanted

Metriche repository

Star: (80.034 star)
Metriche merge PR: (Merge medio 9g 2h) (921 PR mergiate in 30 g)

Descrizione

This is a sub-issue forming part of the work in https://github.com/vllm-project/vllm/issues/38379, please read the description of this issue before beginning to work on this one.

Which test is failing?

The tokenizer_config.json is incorrect for both the base model and the adapter. If we duplicated these checkpoints and stored them inside https://huggingface.co/vllm-project, then we could own them and update the tokenizer class to be PreTrainedTokenizerFast which will almost always work.

$ pytest tests/lora/test_quant_model.py::test_quant_model_lora[model0]
...
AssertionError: assert ['#f07733: A ...#f08800: A v'] == ['#f07700: A ...#f00000: A v']
[2026-03-27T01:15:06Z]
[2026-03-27T01:15:06Z]   At index 0 diff: '#f07733: A v' != '#f07700: A v'
[2026-03-27T01:15:06Z]
[2026-03-27T01:15:06Z]   Full diff:
[2026-03-27T01:15:06Z]     [
[2026-03-27T01:15:06Z]   -     '#f07700: A v',
[2026-03-27T01:15:06Z]   ?           ^^
[2026-03-27T01:15:06Z]   +     '#f07733: A v',
[2026-03-27T01:15:06Z]   ?           ^^
[2026-03-27T01:15:06Z]   -     '#f00000: A v',
[2026-03-27T01:15:06Z]   ?         ^^
[2026-03-27T01:15:06Z]   +     '#f08800: A v',
[2026-03-27T01:15:06Z]   ?         ^^
[2026-03-27T01:15:06Z]     ]

How to configure my environment?

It's very important that you install both vLLM and Transformers from source so that your test results reflect the current state of both libraries.

# Or your fork
git clone https://github.com/huggingface/transformers.git
git clone https://github.com/vllm-project/vllm.git

cd vllm
VLLM_USE_PRECOMPILED=1 uv pip install -e .
uv pip install -e ../transformers

Guida contributor

Direzione di ricerca: Ispeziona il test fallito e traccia la causa della mancata corrispondenza della configurazione del tokenizer. Verifica i file tokenizer config.json del modello base e dell'adattatore, quindi considera di duplicare i checkpoint nell'organizzazione Hugging Face di vllm project e aggiornare la classe del tokenizer a PreTrainedTokenizerFast.
Tech stack: python
Dominio: backend
Tipo issue: Bug
Difficoltà: 2
Tempo stimato: 1-3 ore
Stato attività: Attiva
Chiarezza: Chiara
Prerequisiti: PythonGitHugging Face TransformersvLLM
Adatta ai principianti: 65

Metriche repository

Descrizione

Which test is failing?

How to configure my environment?

Guida contributor

Ricevi issue Easy fresche nella tua inbox.