Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
audio-flamingo-next-hf
like
38
Follow
NVIDIA
55.5k
Audio-Text-to-Text
Transformers
Safetensors
4 datasets
English
audioflamingonext
text-generation
audio
speech
sound
music
reasoning
audio understanding
ASR
audio captioning
long-context
audio-language-model
long-audio
timestamp-grounding
instruction-tuned
arxiv:
2604.10905
License:
other
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
KeyError: 'audioflamingonext'
1
#3 opened 8 days ago by
lby01
Poor word-level audio analysis performance
#2 opened 9 days ago by
empeza
FP8 version
#1 opened 10 days ago by
JermemyHaschal