Instructions to use robinsmits/Schaapje-2B-Chat-V1.0 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use robinsmits/Schaapje-2B-Chat-V1.0 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="robinsmits/Schaapje-2B-Chat-V1.0") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("robinsmits/Schaapje-2B-Chat-V1.0") model = AutoModelForCausalLM.from_pretrained("robinsmits/Schaapje-2B-Chat-V1.0") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use robinsmits/Schaapje-2B-Chat-V1.0 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "robinsmits/Schaapje-2B-Chat-V1.0" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "robinsmits/Schaapje-2B-Chat-V1.0", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/robinsmits/Schaapje-2B-Chat-V1.0
- SGLang
How to use robinsmits/Schaapje-2B-Chat-V1.0 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "robinsmits/Schaapje-2B-Chat-V1.0" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "robinsmits/Schaapje-2B-Chat-V1.0", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "robinsmits/Schaapje-2B-Chat-V1.0" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "robinsmits/Schaapje-2B-Chat-V1.0", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use robinsmits/Schaapje-2B-Chat-V1.0 with Docker Model Runner:
docker model run hf.co/robinsmits/Schaapje-2B-Chat-V1.0
Schaapje-2B-Chat-V1.0
Model description
This is the DPO aligned model based on the SFT trained model Schaapje-2B-Chat-SFT-V1.0.
General Dutch Chat and/or Instruction following works quitte well with this model.
Model usage
A basic example of how to use this DPO aligned model for Chat or Instruction following.
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
device = 'cuda'
model_name = 'robinsmits/Schaapje-2B-Chat-V1.0'
model = AutoModelForCausalLM.from_pretrained(model_name,
device_map = "auto",
torch_dtype = torch.bfloat16)
tokenizer = AutoTokenizer.from_pretrained(model_name)
messages = [{"role": "user", "content": "Hoi hoe gaat het ermee?"}]
chat = tokenizer.apply_chat_template(messages,
tokenize = False,
add_generation_prompt = True)
input_tokens = tokenizer(chat, return_tensors = "pt").to('cuda')
output = model.generate(**input_tokens,
max_new_tokens = 512,
do_sample = True)
output = tokenizer.decode(output[0], skip_special_tokens = False)
print(output)
Intended uses & limitations
As with all LLM's this model can also experience bias and hallucinations. Regardless of how you use this model always perform the necessary testing and validation.
Datasets and Licenses
The following dataset was used for DPO alignment:
- BramVanroy/ultra_feedback_dutch_cleaned: apache-2.0
Model Training
The notebook used to train this DPO aligned model is available at the following link: Schaapje-2B-Chat-DPO-V1.0
- Downloads last month
- 5
Model tree for robinsmits/Schaapje-2B-Chat-V1.0
Base model
ibm-granite/granite-3.0-2b-base