The AI Evaluation Substack
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
2025 September "AI Evaluation" Digest
What could possibly go wrong?
Sep 26
•
AI Evaluation
5
August 2025
2025 August "AI Evaluation" Digest
Between a rock and a hard place
Aug 29
•
AI Evaluation
8
July 2025
2025 July "AI Evaluation" Digest
Long live OpenML!
Jul 25
•
AI Evaluation
11
3
June 2025
2025 June "AI Evaluation" Digest
Illusion is all you need
Jun 27
•
AI Evaluation
10
May 2025
2025 May "AI Evaluation" Digest
Ethical standards in AI evaluation
May 30
•
AI Evaluation
13
April 2025
2025 April "AI Evaluation" Digest
En attendant Turing: a Tragicomedy in Two Acts
Apr 25
•
AI Evaluation
10
March 2025
2025 March "AI Evaluation" Digest
Overhauling Difficulty in Item Response Theory.
Mar 28
•
AI Evaluation
9
February 2025
2025 February "AI Evaluation" Digest
It’s high time to change the paradigm.
Feb 28
•
AI Evaluation
6
January 2025
2025 January "AI Evaluation" Digest
Distil, baby, distil!
Jan 31
•
AI Evaluation
11
December 2024
2024 December "AI Evaluation" Digest
Think before you act!
Dec 27, 2024
•
AI Evaluation
10
November 2024
2024 November "AI Evaluation" Digest
Mission Impossible: AI Evaluation
Nov 29, 2024
6
October 2024
2024 October "AI Evaluation" Digest
News from ECAI 2024 and more!
Oct 25, 2024
•
AI Evaluation
8
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts