ggml-org/llama.cpp

Eval bug: wrong default threads in llama-bench

Open

#17611 opened on Nov 30, 2025

View on GitHub
 (6 comments) (0 reactions) (0 assignees)C++ (110,169 stars) (18,202 forks)batch import
good first issuelow severityperformance

Description

Name and Version

latest commit: https://github.com/ggml-org/llama.cpp/commit/fa0465954faef9d7170b967ad89f8bc5303a32f3

llama-bench using by default wrong threads.

Linux server with NUMA nodes

Operating systems

Linux

GGML backends

CPU

Hardware

Supermicro X10dri Dual CPU

Contributor guide