BOUKOUFFALLAH Abdallah's picture

31 4

BOUKOUFFALLAH Abdallah

iBado

·

Abdellahbado

AI & ML interests

None yet

Organizations

None yet

upvoted 5 papers 7 months ago

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17, 2025 • 24

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 261

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14, 2025 • 90

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8, 2025 • 30

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

Paper • 2507.01925 • Published Jul 2, 2025 • 39

upvoted 7 papers 8 months ago

From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Paper • 2506.14761 • Published Jun 17, 2025 • 17

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17, 2025 • 45

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 63

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 120

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published May 23, 2025 • 220

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30, 2025 • 14

upvoted 8 papers 9 months ago

Text Generation Beyond Discrete Token Sampling

Paper • 2505.14827 • Published May 20, 2025 • 10

Reward Reasoning Model

Paper • 2505.14674 • Published May 20, 2025 • 37

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17, 2025 • 121

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83

Transformer Interpretability Beyond Attention Visualization

Paper • 2012.09838 • Published Dec 17, 2020 • 1

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15, 2025 • 120

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 185

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 189