[Transformers v5] Base model and LoRA used in test has incorrect `tokenizer_config.json` · vllm-project/vllm#38386

Métricas do repositório

Stars: (80.034 stars)
Métricas de merge de PR: (Mesclagem média 9d 2h) (921 fundiu PRs em 30d)

Description

This is a sub-issue forming part of the work in https://github.com/vllm-project/vllm/issues/38379, please read the description of this issue before beginning to work on this one.

Which test is failing?

The tokenizer_config.json is incorrect for both the base model and the adapter. If we duplicated these checkpoints and stored them inside https://huggingface.co/vllm-project, then we could own them and update the tokenizer class to be PreTrainedTokenizerFast which will almost always work.

$ pytest tests/lora/test_quant_model.py::test_quant_model_lora[model0]
...
AssertionError: assert ['#f07733: A ...#f08800: A v'] == ['#f07700: A ...#f00000: A v']
[2026-03-27T01:15:06Z]
[2026-03-27T01:15:06Z]   At index 0 diff: '#f07733: A v' != '#f07700: A v'
[2026-03-27T01:15:06Z]
[2026-03-27T01:15:06Z]   Full diff:
[2026-03-27T01:15:06Z]     [
[2026-03-27T01:15:06Z]   -     '#f07700: A v',
[2026-03-27T01:15:06Z]   ?           ^^
[2026-03-27T01:15:06Z]   +     '#f07733: A v',
[2026-03-27T01:15:06Z]   ?           ^^
[2026-03-27T01:15:06Z]   -     '#f00000: A v',
[2026-03-27T01:15:06Z]   ?         ^^
[2026-03-27T01:15:06Z]   +     '#f08800: A v',
[2026-03-27T01:15:06Z]   ?         ^^
[2026-03-27T01:15:06Z]     ]

How to configure my environment?

It's very important that you install both vLLM and Transformers from source so that your test results reflect the current state of both libraries.

# Or your fork
git clone https://github.com/huggingface/transformers.git
git clone https://github.com/vllm-project/vllm.git

cd vllm
VLLM_USE_PRECOMPILED=1 uv pip install -e .
uv pip install -e ../transformers

Guia do colaborador

Direção de pesquisa: Inspecione o teste com falha e rastreie a causa da incompatibilidade da configuração do tokenizer. Verifique os arquivos tokenizer config.json do modelo base e do adaptador, e considere duplicar os checkpoints na organização Hugging Face do vllm project e atualizar a classe do tokenizer para PreTrainedTokenizerFast.
Pilha de tecnologia: python
Domain: backend
Tipo Issue: Bug
Difficulty: 2
Tempo estimado: 1-3 horas
Status da atividade: Ativo
Clarity: Claro
Prerequisites: PythonGitHugging Face TransformersvLLM
Simpatia para novatos: 65

Métricas do repositório

Description

Which test is failing?

How to configure my environment?

Guia do colaborador

Receba issues Easy novas por email.