Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Xiaozhe Yao's picture

Xiaozhe Yao

xzyao
5 7 68
adamm-hf's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a bucket 12 days ago
xiaozheyao/datasets
published a bucket 12 days ago
xiaozheyao/datasets
published a bucket 22 days ago
researchcomputer/kernels
View all activity

Organizations

Research Computer's profile picture DS3Lab's profile picture Aurora-M/MDEL's profile picture AutoAI's profile picture eth-easl's profile picture ICML2023's profile picture Compressed LMs's profile picture Xiaozhe Yao and Friends's profile picture DeltaZip's profile picture vagents's profile picture Scaling VIT's profile picture Benchmaker's profile picture Apertus Community's profile picture

authored a paper 10 months ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 21
authored a paper about 1 year ago

DataPerf: Benchmarks for Data-Centric AI Development

Paper • 2207.10062 • Published Jul 20, 2022 • 1
authored a paper over 1 year ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 59
authored 2 papers about 2 years ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 42

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

Paper • 2311.13028 • Published Nov 21, 2023 • 2
authored a paper over 2 years ago

DeltaZip: Multi-Tenant Language Model Serving via Delta Compression

Paper • 2312.05215 • Published Dec 8, 2023 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs