ScorecardHistogramDrift

Compares score distributions between reference and monitoring datasets for each class.

Purpose

The Scorecard Histogram Drift test is designed to evaluate changes in the model's scoring patterns over time. By comparing score distributions between reference and monitoring datasets for each class, this test helps identify whether the model's scoring behavior remains stable in production. This is crucial for understanding if the model's risk assessment maintains consistent patterns and whether specific score ranges have experienced significant shifts in their distribution.

Test Mechanism

This test proceeds by generating histograms of scores for each class in both reference and monitoring datasets. It analyzes distribution characteristics through multiple statistical moments: mean, variance, skewness, and kurtosis. The test quantifies drift as percentage changes in these moments between datasets, providing both visual and numerical assessments of distribution stability. Special attention is paid to class-specific distribution changes.

Signs of High Risk

Significant shifts in score distribution shapes
Large drifts in distribution moments exceeding threshold
Changes in the relative positioning of class distributions
Appearance of new modes or peaks in monitoring data
Unexpected changes in score spread or concentration
Systematic shifts in class-specific scoring patterns

Strengths

Provides class-specific distribution analysis
Identifies detailed changes in scoring patterns
Enables visual comparison of distributions
Includes comprehensive moment analysis
Supports multiple class evaluation
Maintains interpretable score scale

Limitations

Sensitive to binning choices in visualization
Requires sufficient samples per class
Cannot suggest score adjustments
May not capture subtle distribution changes
Complex interpretation with multiple classes
Limited to univariate score analysis