Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Sean McLeish's picture
12 42 3

Sean McLeish PRO

smcleish
dvilasuero's profile picture KevinDavidHayes's profile picture dymil's profile picture
·
https://mcleish7.github.io/
  • SeanMcleish
  • mcleish7

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
upvoted a paper 18 days ago
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models
updated a model 24 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5
View all activity

Organizations

Tom Goldstein's Lab at University of Maryland, College Park's profile picture Leon Sean Dev's profile picture University of Maryland's profile picture Gemstones 💎: A Model Suite for Multi-Faceted Scaling Laws's profile picture Gemstones 💎: A Model Suite for Multi-Faceted Scaling Laws (Cooldowns)'s profile picture Gemstones 💎: A Model Suite for Multi-Faceted Scaling Laws (LR Ablation)'s profile picture Latent Context Language Model's profile picture

authored a paper 6 months ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
authored a paper over 1 year ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 155
authored 3 papers almost 2 years ago

Benchmarking ChatGPT on Algorithmic Reasoning

Paper • 2404.03441 • Published Apr 4, 2024

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4

Transformers Can Do Arithmetic with the Right Embeddings

Paper • 2405.17399 • Published May 27, 2024 • 54
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs