Join us to solve the problem of cheating on AI Benchmarking
🎉
Our paper "Benchmarking is Broken - Don't Let AI be its Own Judge" just got accepted to NeurIPS 2025
Read the paper on arXivPeerBench.ai is an open-source, non-profit community implementation of the paper, bringing the research to life for everyone.