Dominique Mariko's picture

15 33

Dominique Mariko PRO

tiptales

·

tiptales

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

burtenshaw/karpathy-llm-council

updated a collection 3 months ago

upvoted a paper 3 months ago

Flow-GRPO: Training Flow Matching Models via Online RL

View all activity

Organizations

liked a Space about 1 month ago

Karpathy Llm Council

Ask a question to get a consensus answer from multiple models

updated a collection 3 months ago

agens

5 items • Updated Sep 30, 2025

upvoted 2 papers 3 months ago

Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8, 2025 • 86

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28, 2025 • 46

updated a collection 3 months ago

agens

5 items • Updated Sep 30, 2025

upvoted 4 papers 3 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 124

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25, 2025 • 347

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 661

upvoted a paper 4 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 48

updated a collection 4 months ago

data open access

Open and clean datasets • 7 items • Updated Sep 7, 2025

liked a dataset 5 months ago

promptfoo/political-questions

Preview • Updated Jul 25, 2025 • 31 • 3

liked a model 5 months ago

pytorch/SmolLM3-3B-INT8-INT4

Text Generation • Updated Sep 11, 2025 • 13 • 37

upvoted a collection 6 months ago

Releases July 4

25 items • Updated Jul 7, 2025 • 7

updated 2 collections 6 months ago

data open access

Open and clean datasets • 7 items • Updated Sep 7, 2025

slm

4 items • Updated Jul 4, 2025