Every player in the Glicko-2 system has a rating,
,
a rating deviation,
, and a rating volatility
.
The volatility measure indicates the degree of expected
fluctuation in a player's rating.
The volatility measure is high when a player has erratic
performances (e.g., when the player has had exceptionally
strong results after a period of stability), and the
volatility measure is low when the player performs at a
consistent level.
As with the original Glicko system,
it is usually informative to summarize
a player's strength in the form of an interval
(rather than merely report a rating).
One way to do this is to report a 95% confidence interval.
The lowest value in the interval is the player's rating minus
twice the RD, and the highest value is the player's rating
plus twice the RD.
So, for example, if a player's rating is 1850 and the RD is
50, the interval would go from 1750 to 1950.
We would then say that we're 95% confident that the
player's actual strength is between 1750 and 1950.
When a player has a low RD, the interval would be narrow, so
that we would be 95% confident about a player's strength
being in a small interval of values.
The volatility measure does not appear in the calculation of
this interval.
The formulas:
To apply the rating algorithm, we treat a collection of games within a ``rating period'' to have occurred simultaneously. Players would have ratings, RD's, and volatilities at the beginning of the rating period, game outcomes would be observed, and then updated ratings, RD's and volatilities would be computed at the end of the rating period (which would then be used as the pre-period information for the subsequent rating period). The Glicko-2 system works best when the number of games in a rating period is moderate to large, say an average of at least 10-15 games per player in a rating period. The length of time for a rating period is at the discretion of the administrator.
The rating scale for Glicko-2 is different from that of the original Glicko system. However, it is easy to go back and forth between the two scales. The following steps assume that ratings are on the original Glicko scale, but the formulas convert to the Glicko-2 scale, and then convert back at the end to Glicko.
We now want to update the rating of a player with (Glicko-2)
rating
, rating deviation
, and volatility
.
He plays against
opponents with ratings
,
rating deviations
.
Let
be the scores against each opponent
(0 for a loss, 1 for a win).
The opponents' volatilities are not relevant in the calculations.
Note that if a player does not compete during the rating
period, then only Step 6 applies.
In this case, the player's rating and volatility parameters
remain the same, but the RD increases according to
Example calculation:
Suppose
a player rated 1500 competes against players rated
1400, 1550 and 1700, winning the first game and
losing the next two.
Assume the 1500-rated player's rating deviation
is 200, and his opponents' are 30, 100 and 300,
respectively.
Assume the 1500 player has volatility
,
and the system constant
is 0.5.
Converting to the Glicko-2 scale, the player's rating and RD become 0 and 1.1513. For the opponents:
|
|
|||||
| 1 | 0.1727 | 0.9955 | 0.639 | 1 | |
| 2 | 0.2878 | 0.5756 | 0.9531 | 0.432 | 0 |
| 3 | 1.1513 | 1.7269 | 0.7242 | 0.303 | 0 |
We then compute
And now
For the iterative procedure to determine
,
we set
and have
.
Performing the calculations iteratively, we obtain
Now update to the new value of
:
Next, update to the new values of
and
:
Finally, convert back to the Glicko scale:
Note that the resulting rating for this computation does not
differ much from the original Glicko computation because the
game outcomes do not provide any evidence of inconsistent
performance.