Inference Providers
Active filters: torchao
medmekk/Llama-3.2-1B-ao-autoquant-1
Text Generation
• Updated • 7
medmekk/Llama-3.2-1B-ao-float8wo-2
Text Generation
• Updated • 8
medmekk/Llama-3.2-1B-ao-float8wo-3
Text Generation
• Updated • 5
medmekk/Llama-3.2-1B-ao-int8wo-gs256
Text Generation
• Updated • 7
medmekk/Llama-3.2-1B-ao-int4wo-gs128
Text Generation
• Updated • 6
medmekk/Qwen2.5-0.5B-Instruct-ao-float8wo
Text Generation
• Updated • 5
medmekk/Llama-3.2-1B-ao-int4wo-gs256
Text Generation
• Updated • 5
medmekk/Qwen2.5-VL-7B-Instruct-ao-float8wo
medmekk/Qwen2.5-VL-7B-Instruct-ao-int8wo
medmekk/Llama-3.1-8B-Instruct-ao-int8wo
Text Generation
• Updated • 2
medmekk/Qwen2.5-VL-7B-Instruct-ao-int8da8w8
medmekk/Llama-3.1-8B-Instruct-ao-autoquant
Text Generation
• Updated • 2
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs128
Text Generation
• Updated • 2
medmekk/Llama-3.1-8B-Instruct-ao-float8wo
Text Generation
• Updated • 5
medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8
Text Generation
• Updated • 5
medmekk/Llama-3.1-8B-Instruct-ao-int8da8w8
Text Generation
• Updated • 4
medmekk/Llama-3.1-8B-Instruct-ao-float8da8w8-2
Text Generation
• Updated • 2
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs32
Text Generation
• Updated • 6
medmekk/Llama-3.1-8B-Instruct-ao-int4wo-gs16
Text Generation
• Updated • 2
Erland/vanilla-340M-4096-model-AO-W4
Text Generation
• Updated • 4
irresistiblegrace97/TinyLlama-1.1B-Chat-v1.0-torchao-int4_weight_only-gs_4096
Erland/softpick-340M-4096-model-AO-W4
Text Generation
• Updated • 6
Erland/softpick-340M-4096-model-AO-W4A4
Text Generation
• Updated • 5
Erland/vanilla-340M-4096-model-AO-W4A4
Text Generation
• Updated • 4
irresistiblegrace97/tinyllama.gguf
jerryzh168/opt-125m-int4wo
Text Generation
• Updated • 5
Text Generation
• Updated • 89
• 2
Text Generation
• Updated • 2.32k
jerryzh168/opt-125m-int4wo-per-module
Text Generation
• Updated • 25
pytorch/Qwen3-4B-INT8-INT4
Text Generation
• Updated • 3.81k
• 2