Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
447
20
404
John Leimgruber III
PRO
ubergarm
Follow
kzoltan's profile picture
Kiruya's profile picture
setianke's profile picture
428 followers
Β·
67 following
https://blog.aifoundry.org/p/adventures-in-model-quantization
ubergarm
john-leimgruber
AI & ML interests
Open LLMs and Astrophotography image processing.
Recent Activity
new
activity
about 21 hours ago
ubergarm/GLM-5.1-GGUF:
Slow prompt processing?
new
activity
1 day ago
turboderp/Qwen3.6-27B-DFlash-exl3:
Nice Work!
new
activity
2 days ago
unsloth/Qwen3.6-27B-MTP-GGUF:
These shoudl work on ik_llama.cpp too
View all activity
Organizations
ubergarm
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
ubergarm/GLM-5.1-GGUF
about 21 hours ago
Slow prompt processing?
4
#9 opened 4 days ago by
malamen4
New activity in
turboderp/Qwen3.6-27B-DFlash-exl3
1 day ago
Nice Work!
β€οΈ
1
1
#1 opened 2 days ago by
ubergarm
New activity in
unsloth/Qwen3.6-27B-MTP-GGUF
2 days ago
These shoudl work on ik_llama.cpp too
π
π
6
#3 opened 2 days ago by
ubergarm
New activity in
ubergarm/Qwen3-Coder-Next-GGUF
2 days ago
Quant requests?
13
#1 opened 3 months ago by
ubergarm
New activity in
ubergarm/GLM-5.1-GGUF
2 days ago
Draft llama.cpp PR for DSA (Deepseek Sparse Attention)
π
1
1
#8 opened 6 days ago by
whoisjeremylam
liked
a model
2 days ago
turboderp/Qwen3.6-27B-DFlash-exl3
Updated
7 days ago
β’
75
β’
11
New activity in
RDson/Qwen3.6-27B-MTP-IQ4_KS-GGUF
7 days ago
Yay more ik quants!
β€οΈ
1
#1 opened 7 days ago by
ubergarm
liked
a model
7 days ago
RDson/Qwen3.6-27B-MTP-IQ4_KS-GGUF
Text Generation
β’
27B
β’
Updated
11 days ago
β’
21.8k
β’
6
New activity in
RDson/Qwen3.6-27B-MTP-Q4_K_M-GGUF
8 days ago
ζιζζδΈηζ³
7
#1 opened 13 days ago by
androidli
liked
a model
8 days ago
Lorbus/Qwen3.6-27B-int4-AutoRound
Image-Text-to-Text
β’
6B
β’
Updated
21 days ago
β’
575k
β’
94
New activity in
Lorbus/Qwen3.6-27B-int4-AutoRound
8 days ago
How does this fair against other quants without MTP like unsloth?
π
1
1
#4 opened 19 days ago by
Crigges
New activity in
ubergarm/Qwen3.6-27B-GGUF
8 days ago
How to use MTP in GGUF?
18
#2 opened 13 days ago by
Friedland
liked
2 models
8 days ago
google/gemma-4-31B-it-assistant
Any-to-Any
β’
0.5B
β’
Updated
2 days ago
β’
93.2k
β’
226
RDson/Qwen3.6-27B-MTP-Q4_K_M-GGUF
27B
β’
Updated
14 days ago
β’
6.72k
β’
29
updated
a model
9 days ago
ubergarm/Qwen3.6-27B-GGUF
Text Generation
β’
27B
β’
Updated
9 days ago
β’
11.6k
β’
18
New activity in
ubergarm/Qwen3.6-27B-GGUF
12 days ago
Q6_0 use over Q6_K?
2
#3 opened 12 days ago by
resynth
New activity in
ubergarm/Qwen3.6-27B-GGUF
14 days ago
Great model for single GPU use cases.
π₯
4
16
#1 opened 18 days ago by
phakio
liked
a model
16 days ago
XiaomiMiMo/MiMo-V2.5-Pro
Text Generation
β’
1T
β’
Updated
5 days ago
β’
48.2k
β’
512
New activity in
ubergarm/Qwen3.5-122B-A10B-GGUF
16 days ago
How to split this model between 2 (3) GPUs and CPU/RAM ?
30
#12 opened about 2 months ago by
mancub
updated
a model
16 days ago
ubergarm/Kimi-K2.6-GGUF
Text Generation
β’
1T
β’
Updated
16 days ago
β’
6.04k
β’
35
Load more