taesiri's picture

Open to Collab

taesiri PRO

taesiri

·

https://taesiri.ai/

AI & ML interests

AGI ... one linear layer at a time

Recent Activity

upvoted a paper about 8 hours ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

commented on a paper about 12 hours ago

Learning Situated Awareness in the Real World

upvoted a paper about 12 hours ago

Learning Situated Awareness in the Real World

View all activity

Organizations

upvoted a paper about 8 hours ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published 6 days ago • 35

upvoted 2 papers about 12 hours ago

Learning Situated Awareness in the Real World

Paper • 2602.16682 • Published about 20 hours ago • 3

World Action Models are Zero-shot Policies

Paper • 2602.15922 • Published 2 days ago • 6

upvoted a paper about 22 hours ago

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

Paper • 2602.12670 • Published 6 days ago • 43

upvoted 5 papers 1 day ago

Revisiting the Platonic Representation Hypothesis: An Aristotelian View

Paper • 2602.14486 • Published 3 days ago • 9

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

Paper • 2602.15112 • Published 3 days ago • 16

Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook

Paper • 2602.14299 • Published 4 days ago • 24

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 2 days ago • 48

Recursive Language Models

Paper • 2512.24601 • Published Dec 31, 2025 • 89

upvoted 5 papers 2 days ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published 5 days ago • 53

VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval

Paper • 2602.08099 • Published 11 days ago • 120

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Paper • 2602.14041 • Published 4 days ago • 37

UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model

Paper • 2602.14178 • Published 4 days ago • 11

FireRed-Image-Edit-1.0 Techinical Report

Paper • 2602.13344 • Published 7 days ago • 4

upvoted 6 papers 3 days ago

CoPE-VideoLM: Codec Primitives For Efficient Video Language Models

Paper • 2602.13191 • Published 6 days ago • 29

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

Paper • 2602.12684 • Published 6 days ago • 5

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 9 days ago • 215

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

Paper • 2602.12705 • Published 6 days ago • 57

What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

Paper • 2602.12395 • Published 7 days ago • 14

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published 10 days ago • 45