Metrics

API Reference

Signature

sp.accuracy_score(y_true, y_pred)               -> float
sp.mean_squared_error(y_true, y_pred)           -> float
sp.mean_absolute_error(y_true, y_pred)          -> float
sp.r2_score(y_true, y_pred)                     -> float
sp.classification_report(y_true, y_pred)        -> str
sp.confusion_matrix(y_true, y_pred)             -> list[list[int]]

Function signatures

Function	Input	Output	Description
`accuracy_score`	`y_true, y_pred: list[int]`	`float`	Fraction of correct predictions
`mean_squared_error`	`y_true, y_pred: list[float]`	`float`	Average squared error
`mean_absolute_error`	`y_true, y_pred: list[float]`	`float`	Average absolute error
`r2_score`	`y_true, y_pred: list[float]`	`float`	Coefficient of determination
`classification_report`	`y_true, y_pred: list[int]`	`str`	Per-class precision / recall / F1 table
`confusion_matrix`	`y_true, y_pred: list[int]`	`list[list[int]]`	$K \times K$ confusion matrix

Classification metrics example

import seraplot as sp

y_true = [0, 0, 1, 1, 2, 2, 2]
y_pred = [0, 1, 1, 1, 2, 0, 2]

print(f"Accuracy: {sp.accuracy_score(y_true, y_pred):.4f}")
print(sp.classification_report(y_true, y_pred))
print(sp.confusion_matrix(y_true, y_pred))

Regression metrics example

import seraplot as sp
import numpy as np

y_true = [3.0, -0.5, 2.0, 7.0]
y_pred = [2.5, 0.0, 2.1, 7.8]

print(f"MSE: {sp.mean_squared_error(y_true, y_pred):.4f}")
print(f"MAE: {sp.mean_absolute_error(y_true, y_pred):.4f}")
print(f"R²:  {sp.r2_score(y_true, y_pred):.4f}")

Algorithmic Functioning

Classification metrics

Accuracy — fraction of predictions that match the true label:

$$\text{Accuracy} = \frac{1}{n}\sum_{i=1}^n \mathbf{1}[\hat{y}_i = y_i]$$

Confusion matrix — $K \times K$ matrix where entry $(k, j)$ is the number of samples of true class $k$ predicted as class $j$:

$$C_{kj} = |\{i : y_i = k,\; \hat{y}_i = j\}|$$

Per-class metrics derived from the confusion matrix (TP, FP, FN per class $k$):

$$\text{Precision}_k = \frac{C_{kk}}{\sum_j C_{jk}}, \qquad \text{Recall}_k = \frac{C_{kk}}{\sum_j C_{kj}}$$

$$\text{F1}_k = 2 \cdot \frac{\text{Precision}_k \cdot \text{Recall}_k}{\text{Precision}_k + \text{Recall}_k}$$

classification_report aggregates these per class and also reports macro (unweighted mean) and weighted (weighted by support) averages.

Regression metrics

Mean Squared Error (MSE):

$$\text{MSE} = \frac{1}{n}\sum_{i=1}^n (y_i - \hat{y}_i)^2$$

Mean Absolute Error (MAE):

$$\text{MAE} = \frac{1}{n}\sum_{i=1}^n |y_i - \hat{y}_i|$$

MAE is less sensitive to outliers than MSE since it uses $|\cdot|$ instead of $(\cdot)^2$.

$R^2$ score (coefficient of determination) — proportion of variance explained by the model:

$$R^2 = 1 - \frac{\sum_i (y_i - \hat{y}_i)^2}{\sum_i (y_i - \bar{y})^2}, \qquad \bar{y} = \frac{1}{n}\sum_i y_i$$

$R^2 = 1$ means a perfect fit; $R^2 = 0$ means the model predicts the mean; $R^2 < 0$ means worse than the mean predictor.

Référence API

Signature

sp.accuracy_score(y_true, y_pred)               -> float
sp.mean_squared_error(y_true, y_pred)           -> float
sp.mean_absolute_error(y_true, y_pred)          -> float
sp.r2_score(y_true, y_pred)                     -> float
sp.classification_report(y_true, y_pred)        -> str
sp.confusion_matrix(y_true, y_pred)             -> list[list[int]]

Signatures des fonctions

Fonction	Entrée	Sortie	Description
`accuracy_score`	`y_true, y_pred: list[int]`	`float`	Fraction de prédictions correctes
`mean_squared_error`	`y_true, y_pred: list[float]`	`float`	Erreur quadratique moyenne
`mean_absolute_error`	`y_true, y_pred: list[float]`	`float`	Erreur absolue moyenne
`r2_score`	`y_true, y_pred: list[float]`	`float`	Coefficient de détermination
`classification_report`	`y_true, y_pred: list[int]`	`str`	Tableau précision / rappel / F1 par classe
`confusion_matrix`	`y_true, y_pred: list[int]`	`list[list[int]]`	Matrice de confusion $K \times K$

Exemple métriques de classification

import seraplot as sp

y_true = [0, 0, 1, 1, 2, 2, 2]
y_pred = [0, 1, 1, 1, 2, 0, 2]

print(f"Précision : {sp.accuracy_score(y_true, y_pred):.4f}")
print(sp.classification_report(y_true, y_pred))
print(sp.confusion_matrix(y_true, y_pred))

Exemple métriques de régression

import seraplot as sp
import numpy as np

y_true = [3.0, -0.5, 2.0, 7.0]
y_pred = [2.5, 0.0, 2.1, 7.8]

print(f"MSE : {sp.mean_squared_error(y_true, y_pred):.4f}")
print(f"MAE : {sp.mean_absolute_error(y_true, y_pred):.4f}")
print(f"R² :  {sp.r2_score(y_true, y_pred):.4f}")

Fonctionnement algorithmique

Métriques de classification

Exactitude — fraction des prédictions qui correspondent à la vraie étiquette :

$$\text{Exactitude} = \frac{1}{n}\sum_{i=1}^n \mathbf{1}[\hat{y}_i = y_i]$$

Matrice de confusion — matrice $K \times K$ où l'entrée $(k, j)$ est le nombre d'échantillons de la vraie classe $k$ prédits comme classe $j$ :

$$C_{kj} = |\{i : y_i = k,\; \hat{y}_i = j\}|$$

Métriques par classe dérivées de la matrice de confusion (VP, FP, FN par classe $k$) :

$$\text{Précision}_k = \frac{C_{kk}}{\sum_j C_{jk}}, \qquad \text{Rappel}_k = \frac{C_{kk}}{\sum_j C_{kj}}$$

$$\text{F1}_k = 2 \cdot \frac{\text{Précision}_k \cdot \text{Rappel}_k}{\text{Précision}_k + \text{Rappel}_k}$$

classification_report agrège ces métriques par classe et rapporte également les moyennes macro (moyenne non pondérée) et pondérée (pondérée par le support).

Métriques de régression

Erreur Quadratique Moyenne (MSE) :

$$\text{MSE} = \frac{1}{n}\sum_{i=1}^n (y_i - \hat{y}_i)^2$$

Erreur Absolue Moyenne (MAE) :

$$\text{MAE} = \frac{1}{n}\sum_{i=1}^n |y_i - \hat{y}_i|$$

La MAE est moins sensible aux valeurs aberrantes que la MSE car elle utilise $|\cdot|$ au lieu de $(\cdot)^2$.

Score $R^2$ (coefficient de détermination) — proportion de variance expliquée par le modèle :

$$R^2 = 1 - \frac{\sum_i (y_i - \hat{y}_i)^2}{\sum_i (y_i - \bar{y})^2}, \qquad \bar{y} = \frac{1}{n}\sum_i y_i$$

$R^2 = 1$ signifie un ajustement parfait ; $R^2 = 0$ signifie que le modèle prédit la moyenne ; $R^2 < 0$ signifie pire que le prédicteur moyen.

Keyboard shortcuts

SeraPlot Documentation