CoP: Agentic Red-teaming for Large Language Models using Composition of PrinciplesChen XiongPin-Yu Chenet al.2025NeurIPS 2025
Deferring Concept Bottleneck Models: Learning to Defer Interventions to Inaccurate ExpertsAndrea PugnanaRiccardo Massiddaet al.2025NeurIPS 2025
Fair Continuous Resource Allocation with Equality of ImpactBlossom MetevierDennis Weiet al.2025NeurIPS 2025
BenchmarkCards: Standardized Documentation for Large Language Model BenchmarksAnna SokolElizabeth Dalyet al.2025NeurIPS 2025
Foundation Models Enabling Multi-Scale Battery Materials Discovery: From Molecules To DevicesVidushi SharmaAndy Teket al.2025NeurIPS 2025
The Shepherd Test: How Will Superintelligent Agents Balance Care and Control in Asymmetric Relationships?Djallel BouneffoufMatthew Riemeret al.2025NeurIPS 2025
Toward a Coherent Virtual Cell Model: Probing Biological World-Model Coherence in Transcriptomic Foundation ModelsNoa MorielYishai Shimoniet al.2025NeurIPS 2025
Specifying exact circuit algorithms in universal transformersTaku ItoRuchir Puriet al.2025NeurIPS 2025
Vintage Code, Modern Judges: Meta-Validation in Low Data RegimesGal AmramOra Nova Fandinaet al.2025ASE 2025
Enhancing Study-Level Inference from Clinical Trial Papers via Reinforcement Learning-Based Numeric ReasoningMassimiliano PronestiMichela Lorandiet al.2025EMNLP 2025