Note: the current releases of this toolbox are a beta release, to test working with Haskell's, Python's, and R's code repositories.
Metrics provides implementations of various supervised machine learning evaluation metrics in the following languages:
- Python
easy_install ml_metrics
- R
install.packages("Metrics")
from the R prompt - Haskell
cabal install Metrics
- MATLAB / Octave (clone the repo & run setup from the MATLAB command line)
For more detailed installation instructions, see the README for each implementation.
Evaluation Metric | Python | R | Haskell | MATLAB / Octave |
Absolute Error (AE) | β | β | β | β |
Average Precision at K (APK, AP@K) | β | β | β | β |
Area Under the ROC (AUC) | β | β | β | β |
Classification Error (CE) | β | β | β | β |
F1 Score (F1) | β | |||
Gini | β | |||
Levenshtein | β | β | β | |
Log Loss (LL) | β | β | β | β |
Mean Log Loss (LogLoss) | β | β | β | β |
Mean Absolute Error (MAE) | β | β | β | β |
Mean Average Precision at K (MAPK, MAP@K) | β | β | β | β |
Mean Quadratic Weighted Kappa | β | β | β | |
Mean Squared Error (MSE) | β | β | β | β |
Mean Squared Log Error (MSLE) | β | β | β | β |
Normalized Gini | β | |||
Quadratic Weighted Kappa | β | β | β | |
Relative Absolute Error (RAE) | β | |||
Root Mean Squared Error (RMSE) | β | β | β | β |
Relative Squared Error (RSE) | β | |||
Root Relative Squared Error (RRSE) | β | |||
Root Mean Squared Log Error (RMSLE) | β | β | β | β |
Squared Error (SE) | β | β | β | β |
Squared Log Error (SLE) | β | β | β | β |
- F1 score
- Multiclass log loss
- Lift
- Average Precision for binary classification
- precision / recall break-even point
- cross-entropy
- True Pos / False Pos / True Neg / False Neg rates
- precision / recall / sensitivity / specificity
- mutual information
- GroupBy / Reduce
- Weight individual samples or groups
(Nonexhaustive and to be added in the future)
- Min or Max (optimize through minimization or maximization)
- Binary Classification
- Scores predicted class labels
- Scores predicted ranking (most likely to least likely for being in one class)
- Scores predicted probabilities
- Multiclass Classification
- Scores predicted class labels
- Scores predicted probabilities
- Regression
- Discrete Rater Comparison (confusion matrix)