Prabhat Kumar Gupta

pkghf

https://prabhat-gupta-tech.netlify.app/

AI & ML interests

LLM, AI Agents, NLP, Visual LM

Recent Activity

upvoted an article about 4 hours ago

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

upvoted a collection about 7 hours ago

Embedding Model Datasets

upvoted an article 3 days ago

Train 400x faster Static Embedding Models with Sentence Transformers

View all activity

Organizations

upvoted an article about 4 hours ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

aamirshakir, tomaarsen, SeanLee97

•

Mar 22, 2024

• 134

upvoted a collection about 7 hours ago

Embedding Model Datasets

Collection

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 171

upvoted an article 3 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

tomaarsen

•

Jan 15, 2025

• 230

upvoted a paper 3 days ago

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 26

upvoted an article 4 days ago

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 208

upvoted 2 articles 7 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 71

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 59

upvoted an article 10 days ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers

tomaarsen, arthurbresnu

•

Jul 1, 2025

• 138

upvoted an article 14 days ago

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 274

upvoted 2 articles 3 months ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

Unso, lewtun, luketheduke, danielkorat, orenpereg, moshew

•

Sep 26, 2022

• 40

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 160

commented on Assisted Generation: a new direction toward low-latency text generation 3 months ago

Great Blog. The accuracy is totally dependent on few factors like use of Greedy decoding and high quality of Assistant models.

Also the latency gains is significant with smaller assistant models which creates a tradeoff between its accuracy vs speed

upvoted 2 articles 3 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

joaogante

•

May 11, 2023

• 78

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

updated a dataset 6 months ago

pkghf/ecom-product-catalog

Viewer • Updated Nov 17, 2025 • 266 • 22

published a dataset 6 months ago

pkghf/ecom-product-catalog

Viewer • Updated Nov 17, 2025 • 266 • 22

commented on Supercharge your OCR Pipelines with Open Models 7 months ago

Superb insights and breaking down the benchmarks. I will be using the datasets here to do the evaluations.

Prabhat Kumar Gupta

AI & ML interests

Recent Activity

Organizations

pkghf's activity

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Train 400x faster Static Embedding Models with Sentence Transformers

🪆 Introduction to Matryoshka Embedding Models

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Multimodal Embedding & Reranker Models with Sentence Transformers

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Training and Finetuning Embedding Models with Sentence Transformers

SetFit: Efficient Few-Shot Learning Without Prompts

Mixture of Experts (MoEs) in Transformers

Assisted Generation: a new direction toward low-latency text generation

We Got Claude to Build CUDA Kernels and teach open models!