Glossaire

Surveillance du modèle

Découvre l'importance de la surveillance des modèles pour garantir la précision de l'IA, détecter la dérive des données et maintenir la fiabilité dans les environnements dynamiques du monde réel.

Model monitoring is the continuous process of tracking and evaluating the performance of machine learning (ML) models once they are deployed into production. It involves observing key metrics related to model accuracy, operational health, and data characteristics to ensure the model behaves as expected over time. This practice is a crucial part of the Machine Learning Operations (MLOps) lifecycle, ensuring that deployed Artificial Intelligence (AI) systems remain reliable, effective, and trustworthy in real-world environments. Without monitoring, model performance can degrade silently, leading to poor predictions and negative business outcomes.

Why Is Model Monitoring Important?

ML models are trained on historical data, but the real world is dynamic. Changes in data patterns, user behavior, or the environment can cause a model's performance to decline after deployment. Key reasons for monitoring include:

Detecting Performance Degradation: Models can become less accurate over time. Monitoring helps identify drops in performance metrics like precision, recall, or F1-score. You can learn more about YOLO performance metrics in our guide.
Identifying Data Drift: The statistical properties of the input data can change, a phenomenon known as data drift. This can happen when the data the model sees in production differs significantly from the training data. Monitoring input features helps detect such shifts.
Spotting Concept Drift: The relationship between input features and the target variable can change over time (concept drift). For example, customer preferences might evolve, making old prediction patterns obsolete. Concept drift requires model retraining or adaptation.
Ensuring Operational Health: Monitoring tracks operational metrics like inference latency, throughput, and error rates to ensure the model serving infrastructure (model serving) is running smoothly.
Maintaining Fairness and Ethics: Monitoring can help detect and mitigate bias in AI by tracking performance across different demographic groups or data segments, promoting AI ethics.

What Aspects Are Monitored?

Effective model monitoring typically involves tracking several categories of metrics:

Prediction Performance: Metrics like accuracy, Mean Average Precision (mAP), AUC, and error rates, often compared against benchmarks established during validation.
Data Quality and Integrity: Tracking missing values, data type mismatches, and range violations in input data.
Input Data Drift: Statistical measures (e.g., population stability index, Kolmogorov-Smirnov test) to compare the distribution of production input features against the training data distribution.
Prediction/Output Drift: Monitoring the distribution of model predictions to detect significant shifts.
Operational Metrics: System-level metrics like CPU/GPU utilization, memory usage, request latency, and throughput. Platforms like Prometheus are often used for this.
Fairness and Bias Metrics: Evaluating model performance disparities across sensitive attributes (e.g., age, gender, ethnicity) using metrics like demographic parity or equalized odds.