Instruct GPT-J · ggml-org/ggml#50

(1 commento) (2 reazioni) (0 assegnatari)C++ (14.741 star) (1646 fork)auto 404

enhancementgood first issue

Descrizione

Someone fine-tuned GPT-J on the Alpaca instruction dataset using PETF:

peft_model_id = "crumb/Instruct-GPT-J"
config = PeftConfig.from_pretrained(peft_model_id)
model = AutoModelForCausalLM.from_pretrained(config.base_model_name_or_path, return_dict=True, load_in_8bit=True, device_map='auto', revision='sharded')
tokenizer = AutoTokenizer.from_pretrained(config.base_model_name_or_path)
# Load the Lora model
model = PeftModel.from_pretrained(model, peft_model_id)

# This example is in the alpaca training set
batch = tokenizer("Below is an instruction that describes a task. Write a response that appropriately completes the request. ### Instruction: How can we reduce air pollution? ### Response:", return_tensors='pt')

Recently I have successfully tried GPT-J model itself on GGML, using converted binary provided, so I suppose InstructGPT-J it should work off the shelf converting the checkpoint and then doing quantization.

Model adapter is here

Guida contributor

Tech stack: pythoncpp
Dominio: machine learning
Tipo issue: feature
Difficoltà: 3
Tempo stimato: 1-2 days
Stato attività: active
Chiarezza: clear
Prerequisiti: GitPythonC++
Adatta ai principianti: 40
Direzione di ricerca: Examine the adapter model.bin file, understand the PEFT and GGML conversion scripts, and implement a conversion or loading path for Instruct GPT J.