ggml-org/whisper.cpp

NPU support in whisper.cpp

Open

#1 557 ouverte le 27 nov. 2023

Voir sur GitHub
 (18 commentaires) (12 réactions) (0 assignés)C++ (49 693 stars) (5 535 forks)batch import
good first issueperformanceresearch🔬

Description

Christmas is coming soon, and I want to take some time to research something interesting, such as edge low-power inference. Although current whisper.cpp can run on Raspberry Pi, the inference performance cannot achieve real-time transcription. Fortunately, there are now some development boards that use processors with NPUs, which can be used to achieve real-time transcription of large models. My primary goal is to first support RK3566 and RK3588.

Roadmap:

  • MatMul offloading
  • Conv-Gelu offloading
  • LayerNorm offloading ...

Reference:

https://github.com/rockchip-linux/rknpu2

Guide contributeur