Stats
Apply the Friedman test with Iman-Davenport correction [1].
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
data
|
EvaluationData
|
EvaluateData instance containing the calculated ranks. |
required |
Returns:
| Type | Description |
|---|---|
tuple[float, float]
|
An tuple containing (chi-squared stats, p-value) |
References: .. [1] Demšar, Janez. "Statistical comparisons of classifiers over multiple data sets." Journal of Machine learning research 7.Jan (2006): 1-30.
Source code in labicompare/stats/friedman.py
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 | |
Runs the Paired T-Test (Parametric) between two models.
Source code in labicompare/stats/pairwise.py
25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 | |
Runs the sign-rank test (non-parametric). Based in only who wins or loose each round, ignoring the scale of each differences.
Source code in labicompare/stats/pairwise.py
74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 | |
Executes the Wilcoxon signed-rank test (non-parametric) between two models. This is the ideal alternative for paired T-Test when the data do not follow a normal distribution. Consider the direction (who wins) and the scale of ranking differences.
Source code in labicompare/stats/pairwise.py
117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 | |