eval_toolkit.bootstrap#
|
Convert a string or number to a floating-point number, if possible. |
|
str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str |
|
int([x]) -> integer int(x, base=10) -> integer |
|
int([x]) -> integer int(x, base=10) -> integer |
|
95% CI for a metric on a single condition. |
|
Result of a DeLong paired ROC-AUC comparison. |
|
Minimum detectable Δ at the requested (α, 1-β). |
|
|
|
95% CI for |
|
|
|
|
|
Per-condition CI via |
|
K-fold cross-validation of a metric on caller-supplied scores. |
|
CV-corrected confidence interval per Bayle et al. 2020 [#bayle]_ Theorem 3.1. |
|
DeLong's variance of the paired ROC-AUC difference. |
|
Derive MDE from an existing |
|
Paired-bootstrap CI on |
|
Paired-bootstrap CI on |
|
Two-level paired bootstrap for operating-point lifts. |
|
Minimum detectable paired Δ at (α, power). |