Eval bug: wrong default threads in llama-bench · ggml-org/llama.cpp#17611

(11 comments) (0 reactions) (0 assignees)C++ (18,202 forks)batch import

good first issuelow severityperformance

Repository metrics

llama-bench using by default wrong threads.

Linux server with NUMA nodes

Linux

CPU

Supermicro X10dri Dual CPU

Research direction: Investigate the default thread count selection logic in llama bench, focusing on how it detects CPU topology and handles NUMA systems.
Tech stack: cpp
Domain: backend
Issue type: Bug
Difficulty: 2
Estimated time: 1-3 hours
Activity status: Active
Clarity: Mostly clear
Prerequisites: C++threading concepts
Newbie friendliness: 70