DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems
Paper
•
2601.13591
•
Published
•
2
None defined yet.
Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm
DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems