Instruct GPT-J · ggml-org/ggml#50

(1 comment) (2 reactions) (0 assignees)C++ (1,646 forks)auto 404

enhancementgood first issue

Repository metrics

Stars: (14,741 stars)
PR merge metrics: (Avg merge 33m) (12 merged PRs in 30d)

Description

Someone fine-tuned GPT-J on the Alpaca instruction dataset using PETF:

peft_model_id = "crumb/Instruct-GPT-J"
config = PeftConfig.from_pretrained(peft_model_id)
model = AutoModelForCausalLM.from_pretrained(config.base_model_name_or_path, return_dict=True, load_in_8bit=True, device_map='auto', revision='sharded')
tokenizer = AutoTokenizer.from_pretrained(config.base_model_name_or_path)
# Load the Lora model
model = PeftModel.from_pretrained(model, peft_model_id)

# This example is in the alpaca training set
batch = tokenizer("Below is an instruction that describes a task. Write a response that appropriately completes the request. ### Instruction: How can we reduce air pollution? ### Response:", return_tensors='pt')

Recently I have successfully tried GPT-J model itself on GGML, using converted binary provided, so I suppose InstructGPT-J it should work off the shelf converting the checkpoint and then doing quantization.

Model adapter is here

Contributor guide

Research direction: Examine the adapter model.bin file, understand the PEFT and GGML conversion scripts, and implement a conversion or loading path for Instruct GPT J.
Tech stack: pythoncpp
Domain: machine learning
Issue type: Feature
Difficulty: 3
Estimated time: 1-2 days
Activity status: Active
Clarity: Clear
Prerequisites: GitPythonC++
Newbie friendliness: 40

Repository metrics

Description

Contributor guide

Get fresh easy issues in your inbox.