view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas • Dec 9, 2022 • 411
Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers Paper • 2507.10787 • Published Jul 14, 2025 • 13
Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning Paper • 2508.19202 • Published Aug 26, 2025 • 7
Evidence-Aware Generative Reranker Collection SFTed Reranker checkpoints backup. NFCorpus, Trec-CDS, Trec-CT, Trec-PM data used for training • 0 items • Updated 23 days ago
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language +4 davidberenstein1957, sdiazlor, Leiyre, dvilasuero, Ameeeee, burtenshaw • Dec 16, 2024 • 158
view article Article Training and Finetuning Reranker Models with Sentence Transformers tomaarsen • Mar 26, 2025 • 193