Hoang Nguyen

hnguy7

12 5

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

ServiceNow-AI/asr_codeswitched

upvoted an article about 2 months ago

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

liked a dataset about 2 months ago

ServiceNow-AI/eva-bench

View all activity

Organizations

liked a dataset about 2 months ago

ServiceNow-AI/asr_codeswitched

Viewer • Updated 11 days ago • 1.21k • 157 • 5

upvoted an article about 2 months ago

Article

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

ServiceNow-AI

•

Jun 9

• 45

liked a dataset about 2 months ago

ServiceNow-AI/eva-bench

Viewer • Updated May 14 • 213 • 91 • 24

upvoted an article about 2 months ago

Article

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

ServiceNow-AI

•

Jun 4

• 42

published an article about 2 months ago

Article

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

ServiceNow-AI

•

Jun 4

• 42

authored 5 papers 3 months ago

M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models

Paper • 2406.16783 • Published Jun 24, 2024 • 4

Prompting with Phonemes: Enhancing LLM Multilinguality for non-Latin Script Languages

Paper • 2411.02398 • Published Nov 4, 2024 • 1

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback

Paper • 2510.06186 • Published Oct 7, 2025 • 1

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 76

upvoted 2 papers 3 months ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 76

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65

upvoted an article 4 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 918

upvoted a paper 4 months ago

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 98

liked a dataset 4 months ago

ServiceNow-AI/eva

Viewer • Updated Mar 24 • 50 • 59 • 71

upvoted an article 4 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 96

published an article 4 months ago

Article

A New Framework for Evaluating Voice Agents (EVA)

ServiceNow-AI

•

Mar 24

• 96

upvoted a paper 5 months ago

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Paper • 2603.13594 • Published Mar 13 • 150

liked a dataset 5 months ago

ServiceNow-AI/EnterpriseOps-Gym

Viewer • Updated Apr 30 • 2.56k • 11.8k • 98

upvoted a collection 11 months ago

AU-Harness datasets

Collection

3 items • Updated Sep 12, 2025 • 6

Hoang Nguyen

AI & ML interests

Recent Activity

Organizations

hnguy7's activity

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Welcome Gemma 4: Frontier multimodal intelligence on device

A New Framework for Evaluating Voice Agents (EVA)

A New Framework for Evaluating Voice Agents (EVA)