Repositórios

Repositórios de vllm-project

This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

Último commit 6 de jun. de 2026

 (42 stars) (68 forks) (0 issues indexadas) (0 good first issues abertas)

Fast and memory-efficient exact attention

Último commit 30 de mai. de 2026

 (124 stars) (148 forks) (0 issues indexadas) (0 good first issues abertas)

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Último commit 22 de mai. de 2026

 (1.166 stars) (156 forks) (0 issues indexadas) (0 good first issues abertas)

Common recipes to run vLLM

Último commit 7 de jun. de 2026

 (833 stars) (292 forks) (0 issues indexadas) (0 good first issues abertas)

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Último commit 8 de jun. de 2026

 (4.293 stars) (699 forks) (0 issues indexadas) (0 good first issues abertas)

TPU inference for vLLM, with unified JAX and PyTorch support.

Último commit 7 de jun. de 2026

 (348 stars) (205 forks) (0 issues indexadas) (0 good first issues abertas)

A high-throughput and memory-efficient inference and serving engine for LLMs

Último commit 15 de mai. de 2026

 (80.034 stars) (16.816 forks) (61 issues indexadas) (55 good first issues abertas)

Community maintained hardware plugin for vLLM on Ascend

Último commit 2 de jun. de 2026

 (2.180 stars) (1.318 forks) (5 issues indexadas) (5 good first issues abertas)

Manages vllm-nccl dependency

Último commit 3 de jun. de 2024

 (18 stars) (3 forks) (0 issues indexadas) (0 good first issues abertas)

A framework for efficient model inference with omni-modality models

Último commit 8 de jun. de 2026

 (4.990 stars) (1.067 forks) (0 issues indexadas) (0 good first issues abertas)

Último commit 5 de jun. de 2026

 (43 stars) (101 forks) (0 issues indexadas) (0 good first issues abertas)