Book: The Emerging Science of Machine Learning Benchmarks

loveparade · 2026-03-19T00:53:48 1773881628

Very cool book. I think a reason why ML has seen so much progress despite benchmark overfitting/abuse is that results are "regularized" by real world applications and the Lindy effect. Methods, or research, that abuse benchmarks aren't adopted by follow-up research so they tend not to survive. And they aren't adopted because people try them but then find out that they don't generalize to other/newer benchmarks. So the system works not because of specific benchmarks, but because of how the community as a whole deals with benchmarks.

erdemo · 2026-03-19T11:12:15 1773918735

https://mlbenchmarks.org/

This is the actual link to reach the book. There is no navigation link back to the index on the shared link.

NeutralForest · 2026-03-19T11:55:54 1773921354

Thanks, I had to do the same.

trostaft · 2026-03-19T00:48:43 1773881323

If I'm recall correctly, this was also a keynote at MDS24? That was also a great talk, Hardt is an excellent speaker.

lazrgatr · 2026-03-18T21:39:38 1773869978

A little rule I live by is that if Moritz Hardt writes it, I will read it

TrainedMonkey · 2026-03-19T00:17:32 1773879452

Why is that?

kaycey2022 · 2026-03-19T04:46:54 1773895614

You honestly don't know of Moritz Hardt?

fxwin · 2026-03-19T12:07:32 1773922052

Why so snarky? I also didn't know who he was:

I'm a director at the Max Planck Institute for Intelligent Systems. Prior to joining the institute, I was Associate Professor for Electrical Engineering and Computer Sciences at the University of California, Berkeley. My research contributes to the scientific foundations of machine learning and algorithmic decision making with a focus on social questions.[0]

Also simply knowing of him doesn't answer the question.

[0] https://mrtz.org/

kaycey2022 · 2026-03-20T10:40:28 1774003228

sry just a joke man

khafra · 2026-03-19T12:12:25 1773922345

xkcd 1053, my friend.

pakapica · 2026-03-19T12:00:56 1773921656

added to my reading list :)

salberts · 2026-03-19T05:31:33 1773898293

Read the preface.

1. It sounds like this book can be summarized in a practical blog post or a series of posts

2. Is using the term crisis so many times really necessary?