What is a Diplomacy Rating?The Diplomacy Rating System is a method for calculating the relative skill levels of players in Diplomacy games. It is based on the Elo Rating System for 2-player games, and expanded to accommodate the varying number of players in a game of Diplomacy. Elo was originally invented as a chess rating system although it is used in many other games today. It is used as a rating system for competitive multi-player play in a number of computer games, and has been adapted to team sports including international soccer / football, Major League Baseball, and American college football and basketball. One's Diplomacy Rating is a number, typically between 1000 and 2000, with new ratings starting around 1200-1300. A player's rating is changed any time a game is completed in which the person played. With the completion of each game, points are reassigned from losers to winners. Points are awarded or removed on a scale that has solo wins at one end, followed by increasingly larger draws, down to survivals and the various types of eliminations. These point totals are affected to a lesser degree by the player's existing rating, the number and quality of his opponents, how long he survived, and with how many centers. The Diplomacy Rating is a closed system: No points enter or leave the system via a game completion. New points are introduced only when new players join, and points leave the system only if a player retires. Provisional RatingsOne's initial rating is the average of his ratings calculated from the result of his first 3 games:
Established RatingsAfter the completion of 3 games, a rating becomes established. One's established rating change for any subsequent game is calculated as the sum of 3 components:
WLC = Win-Loss ChangeThe measure of one's actual result in a game: Win, Lose, or Draw. To calculate the Win-Loss change, one needs only to know the number of players in the game, and the result achieved by the player. A solo is the best possible result, followed by a small draw, then larger draws, and last, a loss.
Base Points (BP) are calculated, as 13 + P
The Points Awarded (PA) are reduced by multiplying the Base Points by the draw factor: PA = BP * DF Next, The Points Won (WLC) are calculated by dividing the Actual Points Awarded, evenly among the victors: Large draws including most of the players will result in very few points being won, whereas small draws or solos have the greatest gains.
Any result other than a Win or a Draw is a Loss. The points each player loses is simply the Base Points for the game divided by (the number of players minus 1). Each player loses the same amount: In a 5-player game, each losing player would lose 18 / 4 = 4.50 points.In a 7-player game, each losing player would lose 20 / 6 = 3.33 points. In a 10-player game, each losing player would lose 23 / 9 = 2.56 points. In a 17-player game, each losing player would lose 30 / 16 = 1.88 points. Zero Sum: In all the scenarios above, the points won will always sum up to equal the points lost: SC = Survival Change:The measure of one's ability to stay alive and control centers in a game. To calculate the Survival Base, one needs to understand two components: The Duration Base and Center Base. Survival scores only take effect for players that score a loss in a game. They act as a consolation score, to indicate a player has played a certain portion of the game, or even survived to the end controlling what might be a substantial number of centers (often referred to as Seconds). Players who survived longer will get a better score than those who were eliminated sooner, and players who end the game in control of more supply centers will score better than those who control fewer centers or were eliminated.
DB = Duration Base = 2 * (Player_Seasons - Player_NMRs) / (Total_Seasons) CB = Center Base = 4 * (Player_Centers) / (Total_Centers) Combined Survival Base amounts to a number between 0 and 4 points. A player that is eliminated may score up to 2 points as their Center Base will be 0. A player that survives to the end will score between 2 and 4 points, as their Survival Base will be 2 (for survival to the end) plus up to another 2 based on the number of centers they control. A Second will score close to 4 points. Any country that goes into Civil Disorder scores 0 for their Survival Score, irrelevant of how long they played or centers controlled - a player must play the game out to conclusion to qualify for their Survival Score. Zero Sum: After calculation of all the Survival Bases, one has a list of numbers between 0 and 4. These scores are shifted linearly about 0 to create a set that will sum to zero. The Survival Base scores are averaged, then that average is subtracted from each base to create the Survival Change, or SC.
A sample Survival calculation: 7-Player Game, 3-Way Draw, game time 5 years (10 turns):
PC = Performance Change:The measure of one's actual result in a game relative to how well he should theoretically perform. To calculate the Performance Base, one needs only know his rating, the average rating of his opposition, and his actual result.
A player's Performace Factor is equal to his Actual Score (SA) minus his Expected Score (EA). This will always produce a number between -1 and +1, indicating the degree to which a player exceeded (positive) or underperformed (negative) his expected result. The Performance Factor is multiplied by K to determine the Performance Base for points gained / lost. K = A constant, 10. SA = Actual Score, a fixed number based on result:
EA = Expectation, a number between 0 and 1, calculated based on the players' ratings
Some sample EA calculations:
Zero Sum: After calculation of all the Performance Bases, one has a list of numbers between -10 and +10. These scores are shifted linearly about 0 to create a set that will sum to zero. The Performance Base scores are averaged, then that average is subtracted from each base to create the Performance Change, or PC.
Replacement PositionsSome GMs allow the use of Replacement players to take over a country for which the original player is unable to continue playing. The Diplomacy Rating system is designed to accommodate this replacement phenomenon. No implication is made as to the intention of replaced players, whether their replacement is for good reason or otherwise - the rating system is intent-neutral, and only considers the statistics as of the time the one player left the game, and the final result that the replacement player obtains. Replacement positions with a Provisional rating:
Replacement positions with an Established rating:
There is a strong incentive for replacement players to take on games, for the worse the position taken over (by center count), the less liability the new player takes on, while still retaining the high potential for gain. There is also an incentive for a player that needs to leave a game - to find a good substitute to take over his position. The original player is still liable for the bulk of the loss, but could possibly minimize that loss by finding a player that can play the position to hopefully a good result. Balance & ImplicationsThe relative impact of the 3 components WLC, SC, and PC varies with differing game results:
Performance can't be measured absolutely; it can only be inferred from wins and losses. Ratings therefore have meaning only relative to other ratings. Both the average and the spread of ratings are arbitrarily chosen. The average Diplomacy rating was chosen to be around 1200-1300, with a low end of about 1100 and an upper end near 2000. The extreme values have yet to be seen as insufficient games have been run to create such diversity. VariablesP = Number of Players R = Result (# of players sharing in the solo / draw) DF = Draw_Factor = (P-R)/(P-1) Ra_Avg = The average of the effective ratings for all countries in a game. Ra_Eff = The effective rating of any country. If a country was played by one player from start to finish, then Ra_Eff = Ra_Sta. If a country was played by multiple players, R_Eff is the average of all the players that played the country, weighted by the number of turns they played. Ra_Opp = The average of the effective ratings of one's opponents. Ra_Sta = A player's rating of record for a game. A player's rating is marked as of the date the game started (or as of the date joined if a replacement). Even tho the player's rating may change during the course of the game, his rating for the purpose of all calculations related to the game, is that rating he had when the game began. DB = Duration Base = 2 * (Player_Seasons - Player_NMRs) / (Total_Seasons) CB = Center Base = 4 * (Player_Centers) / (Total_Centers) |
