ggml-org/whisper.cpp

NPU support in whisper.cpp

Open

#1,557 建立於 2023年11月27日

在 GitHub 查看
 (18 留言) (12 反應) (0 負責人)C++ (49,693 star) (5,535 fork)batch import
good first issueperformanceresearch🔬

描述

Christmas is coming soon, and I want to take some time to research something interesting, such as edge low-power inference. Although current whisper.cpp can run on Raspberry Pi, the inference performance cannot achieve real-time transcription. Fortunately, there are now some development boards that use processors with NPUs, which can be used to achieve real-time transcription of large models. My primary goal is to first support RK3566 and RK3588.

Roadmap:

  • MatMul offloading
  • Conv-Gelu offloading
  • LayerNorm offloading ...

Reference:

https://github.com/rockchip-linux/rknpu2

貢獻者指南