[TASK][CHALLENGE] Offline GPT backend for Chat engine · apache/kyuubi#4555

Métricas do repositório

Currently, the Kyuubi supports Chat engine by invoking the ChatGPT online open API

Client => Kyuubi Server => Kyuubi Chat engine (invoke ChatGPT REST API)

But, is there any chance we can add a built-in offline ChatGPT engine? The answer is YES.

Client => Kyuubi Server => Kyuubi Chat engine (do prediction using local GPT model)

There is a project https://github.com/karpathy/nanoGPT which can train the GPT-2 model.

... currently the file train.py reproduces GPT-2 (124M) on OpenWebText, running on a single 8XA100 40GB node in about 4 days of training.

So, the basic idea is, training a GPT-like model, and Kyuubi Chat engine invoking this model to answer the question.

There are some specific questions:

Given that Kyuubi is a kind of SQL gateway, we may want it to be smart in SQL area, then how to choose the training DataSet?
karpathy/nanoGPT written in PyTorch, how can Kyuubi Chat engine invoke it? The options may be
- use PyTorch Java SDK(does such thing exist?) to load model and Kyuubi Chat engine simply calls function to do prediction
- launch a PyTorch serving service which expose RESTful/gRPC/thrift api, and Kyuubi Chat engine using RPC call for prediction

Also, there is another interesting project https://github.com/ggerganov/llama.cpp

Yes. I would be willing to submit a PR with guidance from the Kyuubi community to improve.
No. I cannot submit a PR at this time.

Direção de pesquisa: Pesquise opções para integração GPT offline: 1) Escolha um conjunto de dados focado em SQL para treinar um modelo similar ao GPT (por exemplo, usando nanoGPT). 2) Avalie métodos de integração: PyTorch Java SDK ou implantação de um serviço PyTorch serving. 3) Prototipe uma integração simples para entender viabilidade e desempenho.
Pilha de tecnologia: pytorchpythonscala
Domain: backendaimachine learning
Tipo Issue: Funcionalidade
Difficulty: 4
Tempo estimado: Mais de 1 semana
Status da atividade: Ativo
Clarity: Precisa de investigação
Prerequisites: PyTorchPythonScala
Simpatia para novatos: 25