good first issuelow severityperformance
Description
Name and Version
latest commit: https://github.com/ggml-org/llama.cpp/commit/fa0465954faef9d7170b967ad89f8bc5303a32f3
llama-bench using by default wrong threads.
Linux server with NUMA nodes
Operating systems
Linux
GGML backends
CPU
Hardware
Supermicro X10dri Dual CPU