[Transformers v5] InternVL2 · vllm-project/vllm#38425

Repository metrics

Stars: (80,034 stars)
PR merge metrics: (平均マージ 3d 17h) (30d で 993 merged PRs)

説明

This is a sub-issue forming part of the work in https://github.com/vllm-project/vllm/issues/38379, please read the description of this issue before beginning to work on this one.

Which test is failing?

Transformers v5 creates the model on the meta device first, then loads the weights, similarly to what vLLM does. The issue here is that the custom model code in the checkpoint tries to use real tensors as part of model structure construction.

Since the issue here is with the HF reference generation, this cannot be fixed in vLLM (other than skipping the tests until the model works with Transformers v5). The proper solution to this issue is to upstream this architecture, which shouldn't be too hard using Modular Transformers as the text backbone is Qwen2 so that can be reused.

$ pytest tests/models/multimodal/generation/test_common.py::test_single_image_models[intern_vl-test_case25]
...
RuntimeError: Tensor.item() cannot be called on meta tensors

How to configure my environment?

It's very important that you install both vLLM and Transformers from source so that your test results reflect the current state of both libraries.

# Or your fork
git clone https://github.com/huggingface/transformers.git
git clone https://github.com/vllm-project/vllm.git

cd vllm
VLLM_USE_PRECOMPILED=1 uv pip install -e .
uv pip install -e ../transformers

コントリビューターガイド

調査方針: 失敗しているテストとカスタムモデルコードを調査し、「Tensor.item()をmetaテンソルで呼び出せない」エラーを解決します。おそらく、Modular Transformersを使用してアーキテクチャをアップストリームし、Qwen2テキストバックボーンを再利用する必要があります。
技術スタック: python
領域: backendinfrastructure
Issue 種別: バグ
難度: 3
推定時間: 1-2日
活動状況: アクティブ
明確さ: 明確
前提条件: GitPythonPyTorch
初心者向け度: 40

Repository metrics

説明

Which test is failing?

How to configure my environment?

コントリビューターガイド

新着 Easy issues をメールで受け取る。