Discussion about this post

User's avatar
Suhrab Khan's avatar

This digest is a masterclass in AI evaluation, especially the emphasis on modality confounders and construct validity. Emerging frameworks like BeTaL and RIDE show how much methodology shapes what we think AI can do.

I talk about the latest AI trends and insights. Do check out my Substack, I am sure you’ll find it very relevant and relatable.

Expand full comment

No posts

Ready for more?