Models evaluations¶
Evaluating a machine learning model consists of computing its performance and behavior on a set of data called the Evaluation set. Model evaluations are the cornerstone of MLOps capabilities. They permit Drift analysis, Model Comparisons and automating retraining of models
Evaluation of LLMs and Agents uses similar concepts, adapted to the specificities of these use cases.