ggml-org/whisper.cpp

NPU support in whisper.cpp

Open

#1,557 opened on 2023年11月27日

GitHub で見る
 (18 comments) (12 reactions) (0 assignees)C++ (49,693 stars) (5,535 forks)batch import
good first issueperformanceresearch🔬

説明

Christmas is coming soon, and I want to take some time to research something interesting, such as edge low-power inference. Although current whisper.cpp can run on Raspberry Pi, the inference performance cannot achieve real-time transcription. Fortunately, there are now some development boards that use processors with NPUs, which can be used to achieve real-time transcription of large models. My primary goal is to first support RK3566 and RK3588.

Roadmap:

  • MatMul offloading
  • Conv-Gelu offloading
  • LayerNorm offloading ...

Reference:

https://github.com/rockchip-linux/rknpu2

コントリビューターガイド