Text Generation
Transformers
English
vortex
science
physics
chemistry
biology
mathematics
ssm
mamba
hybrid-architecture
custom-tokenizer
from-scratch
matrix-corp
Instructions to use Matrix-Corp/Vortex-7b-V1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Matrix-Corp/Vortex-7b-V1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="Matrix-Corp/Vortex-7b-V1")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Matrix-Corp/Vortex-7b-V1", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use Matrix-Corp/Vortex-7b-V1 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "Matrix-Corp/Vortex-7b-V1" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Matrix-Corp/Vortex-7b-V1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/Matrix-Corp/Vortex-7b-V1
- SGLang
How to use Matrix-Corp/Vortex-7b-V1 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "Matrix-Corp/Vortex-7b-V1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Matrix-Corp/Vortex-7b-V1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "Matrix-Corp/Vortex-7b-V1" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "Matrix-Corp/Vortex-7b-V1", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use Matrix-Corp/Vortex-7b-V1 with Docker Model Runner:
docker model run hf.co/Matrix-Corp/Vortex-7b-V1
| # Core dependencies | |
| torch>=2.2.0 | |
| transformers>=4.40.0 | |
| accelerate>=0.30.0 | |
| datasets>=2.18.0 | |
| tokenizers>=0.19.0 | |
| # Quantization | |
| bitsandbytes>=0.43.0 | |
| # Flash Attention (CUDA only) | |
| flash-attn>=2.5.0 | |
| # Scientific computing | |
| numpy>=1.26.0 | |
| scipy>=1.12.0 | |
| scikit-learn>=1.4.0 | |
| # Chemistry/Biology | |
| rdkit>=2023.9.0 | |
| pubchempy>=1.0.4 | |
| # Web scraping | |
| arxiv>=2.1.0 | |
| beautifulsoup4>=4.12.0 | |
| requests>=2.31.0 | |
| # Data processing | |
| pandas>=2.0.0 | |
| pyarrow>=14.0.0 | |
| # LaTeX parsing | |
| pylatexenc>=2.10 | |
| # Deduplication | |
| minhash>=0.1.0 | |
| # Utilities | |
| tqdm>=4.65.0 | |
| psutil>=5.9.0 | |
| jsonlines>=3.1.0 | |
| # Optional: wandb for logging | |
| # wandb>=0.16.0 | |
| # Development/testing | |
| pytest>=7.0.0 | |
| black>=23.0.0 | |
| flake8>=6.0.0 | |
| mypy>=1.0.0 | |