Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
4
quinn
jwhe
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
13 days ago
harborframework/parity-experiments:
[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)
new
activity
23 days ago
harborframework/parity-experiments:
[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)
authored
a paper
2 months ago
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
View all activity
Organizations
jwhe
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
harborframework/parity-experiments
13 days ago
[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)
1
#230 opened 13 days ago by
jwhe
New activity in
harborframework/parity-experiments
23 days ago
[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)
#210 opened 23 days ago by
jwhe