Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gabrielbo 's Collections
SmartRouter
SPaRK-RL

SPaRK-RL

updated Jun 17, 2025

combines reinforcement learning (RL) and large language models (LLMs) to improve exploration using diverse tool generation during inference

Upvote
1

  • gabrielbo/explore-rl-hotpota-trajectories

    Updated May 9, 2025 • 3

  • gabrielbo/swirl-trajectories-mmlu-pro

    Viewer • Updated May 20, 2025 • 24.8k • 9 • 2

  • gabrielbo/spark-model-QLoRA

    Text Generation • Updated May 24, 2025 • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs