ggml-org/whisper.cpp

NPU support in whisper.cpp

Open

Aperta il 27 nov 2023

Vedi su GitHub
 (18 commenti) (12 reazioni) (0 assegnatari)C++ (49.693 star) (5535 fork)batch import
good first issueperformanceresearch🔬

Descrizione

Christmas is coming soon, and I want to take some time to research something interesting, such as edge low-power inference. Although current whisper.cpp can run on Raspberry Pi, the inference performance cannot achieve real-time transcription. Fortunately, there are now some development boards that use processors with NPUs, which can be used to achieve real-time transcription of large models. My primary goal is to first support RK3566 and RK3588.

Roadmap:

  • MatMul offloading
  • Conv-Gelu offloading
  • LayerNorm offloading ...

Reference:

https://github.com/rockchip-linux/rknpu2

Guida contributor