Eric Bezzam's picture

Eric Bezzam PRO

bezzam

huggingface

·

AI & ML interests

speech, audio, imaging

Recent Activity

updated a dataset about 21 hours ago

hf-audio/asr-leaderboard-longform

updated a model 2 days ago

liked a model 3 days ago

ibm-granite/granite-speech-4.1-2b

View all activity

Organizations

upvoted an article 5 days ago

Article

Safetensors is Joining the PyTorch Foundation

25 days ago

•

37

upvoted a paper 8 days ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published Jan 29 • 37

upvoted an article 9 days ago

Article

mlinter: a linter for Transformers modeling files

10 days ago

•

8

upvoted a collection 12 days ago

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 7 items • Updated about 3 hours ago • 52

upvoted 2 collections 16 days ago

Canary ASR/AST

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 6 items • Updated 12 days ago • 34

Parakeet ASR

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 12 days ago • 70

upvoted an article 16 days ago

Article

The PR you would have opened yourself

17 days ago

•

67

upvoted an article 29 days ago

Article

Liberate your OpenClaw

+6

Mar 27

•

45

upvoted a collection 30 days ago

Gemma 4

8 items • Updated about 1 month ago • 709

upvoted 3 articles about 1 month ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

+2

Mar 31

•

50

Article

How I contributed a new model to the Transformers library using Codex

Mar 30

•

50

Article

Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets

Mar 21

•

17

upvoted an article about 2 months ago

Article

LLM based Audio models

Dec 18, 2025

•

58

upvoted a collection about 2 months ago

ALARM

Official checkpoints and data for "ALARM: Audio–Language Alignment for Reasoning Models" • 8 items • Updated Mar 9 • 1

upvoted an article about 2 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

Mar 10

•

194

upvoted a paper about 2 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 19

upvoted an article about 2 months ago

Article

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

+2

Mar 5

•

51

upvoted 3 articles 2 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

505

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

Feb 19

•

62

Article

Compute and Competition in AI: Different FlOPs for Different Folks

Feb 12

•

14