-
-
-
-
-
-
Inference Providers
Active filters:
fp_quant
ISTA-DASLab/Qwen3-8B-FPQuant-RTN-MXFP4
Text Generation
•
5B
•
Updated
•
10
ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-GPTQ-MXFP4
5B
•
Updated
•
6
ISTA-DASLab/Qwen3-8B-FPQuant-RTNv2-MXFP4
5B
•
Updated
•
6
ISTA-DASLab/Qwen3-8B-FPQuant-GPTQ-MXFP4
5B
•
Updated
•
7
ISTA-DASLab/Qwen3-14B-FPQuant-GPTQ-MXFP4
9B
•
Updated
•
8
ISTA-DASLab/Qwen3-32B-FPQuant-GPTQ-MXFP4
18B
•
Updated
•
4
ISTA-DASLab/Qwen3-0.6B-FPQuant-RTN-MXFP4
Text Generation
•
0.4B
•
Updated
•
38
•
1
ISTA-DASLab/Qwen3-0.6B-FPQuant-RTN-NVFP4
Text Generation
•
0.4B
•
Updated
•
29
ISTA-DASLab/Qwen3-4B-FPQuant-RTN-MXFP4
Text Generation
•
2B
•
Updated
•
5
ISTA-DASLab/Qwen3-4B-FPQuant-RTN-NVFP4
Text Generation
•
2B
•
Updated
•
6
ISTA-DASLab/Qwen3-1.7B-FPQuant-RTN-NVFP4
Text Generation
•
1B
•
Updated
•
7
ISTA-DASLab/Qwen3-1.7B-FPQuant-RTN-MXFP4
Text Generation
•
1B
•
Updated
•
6
ISTA-DASLab/Qwen3-8B-FPQuant-RTN-NVFP4
Text Generation
•
5B
•
Updated
•
5
ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4-200steps
Text Generation
•
1B
•
Updated
•
5
ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4-600steps
Text Generation
•
1B
•
Updated
•
4
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-200steps
Text Generation
•
5B
•
Updated
•
9
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-600steps
Text Generation
•
5B
•
Updated
•
8
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-1400steps
Text Generation
•
5B
•
Updated
•
5
ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4-1000steps
Text Generation
•
5B
•
Updated
•
6
ISTA-DASLab/Llama-3.1-8B-Instruct-MR-GPTQ-nvfp
Image-Text-to-Text
•
5B
•
Updated
•
38
ISTA-DASLab/Llama-3.1-8B-Instruct-MR-GPTQ-mxfp
Image-Text-to-Text
•
5B
•
Updated
•
7
ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-NVFP4
5B
•
Updated
•
17
ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4
0.8B
•
Updated
•
11
ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-MXFP4
0.8B
•
Updated
•
15
ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-NVFP4
2B
•
Updated
•
9
ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-MXFP4
2B
•
Updated
•
12
ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-MXFP4
5B
•
Updated
•
40
ISTA-DASLab/Qwen3-0.6B-FPQuant-QAT-NVFP4
Text Generation
•
0.4B
•
Updated
•
31
ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4
Text Generation
•
1B
•
Updated
•
12
ISTA-DASLab/Qwen3-4B-FPQuant-QAT-NVFP4
Text Generation
•
2B
•
Updated
•
7