Yes it's yet another rating system for SC2. This one is an implementation of a paper called Whole-History Rating: A Bayesian Rating System for Players of Time-Varying Strength

WHR is a Bayesian rating system, which there have already been some examples here on TL. The thing WHR adds is taking into account that player's skill changes with time, and keeps track of those changes, adjusting them as it gets new information (more game results).

Here are the top 20 using this method:

         NesTea    1 2418.01
            MVP    2 2401.75
           PuMa    3 2386.88
         Bomber    4 2380.75
           Polt    5 2337.95
          GanZi    6 2335.29
          Ryung    7 2331.04
           CoCa    8 2329.84
           July    9 2327.26
             MC   10 2324.36
           Seal   11 2313.90
            MMA   12 2313.53
        RevivaL   13 2307.55
        Nerchio   14 2304.30
       Noblesse   15 2299.56
           MaNa   16 2297.97
      DongRaeGu   17 2295.45
            Sen   18 2286.58
       White-Ra   19 2285.35
           Keen   20 2284.79

This includes games from most of the major LAN tournaments: GSL+prelims, GSTL, MLG, NASL (final lan games only), IEM finals, HomeStory, DreamHack

Full list: http://pastebin.com/CkxVS7U7

For fellow rating math nerds, source code is here: https://bitbucket.org/KillerDucky/misc

Also as an illustration of how the algorthm keeps track of skill over time, here is Polt's history:
+ Show Spoiler +

Polt
2098 day=2010-09-08 numgames=2 winrate=1.000
2091 day=2010-09-16 numgames=2 winrate=0.000
2095 day=2010-10-20 numgames=3 winrate=0.333
2124 day=2010-11-15 numgames=1 winrate=1.000
2131 day=2010-11-25 numgames=2 winrate=1.000
2128 day=2010-12-01 numgames=3 winrate=0.667
2126 day=2010-12-03 numgames=2 winrate=0.000
2118 day=2011-01-02 numgames=3 winrate=0.333
2124 day=2011-01-24 numgames=4 winrate=0.000
2148 day=2011-02-01 numgames=8 winrate=0.750
2156 day=2011-02-07 numgames=2 winrate=0.500
2177 day=2011-02-24 numgames=3 winrate=0.333
2227 day=2011-03-17 numgames=2 winrate=1.000
2232 day=2011-03-22 numgames=1 winrate=0.000
2287 day=2011-04-19 numgames=3 winrate=0.667
2297 day=2011-04-28 numgames=3 winrate=0.333
2353 day=2011-05-28 numgames=3 winrate=0.667
2368 day=2011-06-07 numgames=2 winrate=1.000
2369 day=2011-06-09 numgames=3 winrate=0.667
2371 day=2011-06-11 numgames=3 winrate=1.000
2371 day=2011-06-13 numgames=5 winrate=0.600
2372 day=2011-06-18 numgames=4 winrate=1.000
2348 day=2011-06-30 numgames=1 winrate=0.000
2339 day=2011-07-06 numgames=3 winrate=0.667
2332 day=2011-07-13 numgames=2 winrate=0.000
2338 day=2011-08-09 numgames=3 winrate=0.667
2335 day=2011-08-18 numgames=1 winrate=0.000
2337 day=2011-08-23 numgames=3 winrate=0.667
The numbers from the past can still change when new results are added, if the algorithm decides some of Polt's opponents on e.g. 2011-02-01 were actually better/worse than it though at first, etc. The more recent results will change more rapidly though than the older ones.

Primadog

United States4411 Posts

August 31 2011 10:03 GMT

Looking at the paper. It seems like this algorithm is a incremental improvement (all the rating systems rated within 1% in predictive power of each other) in predictive power, but not one that justified the significant increase in calculation complexity. Although I really like that it doesn't assume skill level is a time-invariant figure like most modern rating algorithms.

Another concern I have is how complex and effort it is to pharse game results into format readable by this program. Seems like it wasn't a trivial task putting together Polt's dataset.

MasterOfChaos

Germany2896 Posts

August 31 2011 10:33 GMT

IMO WHR is the best rating system I've seen so far. I've been long thinking about implementing it myself. Do you hardcode the parameters, or does your program automatically optimize some of them(for example the rank-time correlation parameter).

An interesting way to improve this might be introducing a per-matchup rating. Of course with some force pushing them towards their average.

Another might be a map imbalance parameter that represents a boost to the player skill difference. Then you optimize the map imbalance parameters like all other parameters in your iteration step.

The input set for this were even KGS matches. The KGS ranking system is already pretty good especially for players who don't improve rapidly or who play many games. I assume the differences become bigger when the sample size becomes smaller.

On August 31 2011 19:03 Primadog wrote:
Another concern I have is how complex and effort it is to pharse game results into format readable by this program. Seems like it wasn't a trivial task putting together Polt's dataset.

Its the same effort you need for all ranking systems. You need a list of (Date,Player1,Player2, Result) tuples. Or if you with my suggestions (Date, Map, Player1, Player2, Race1, Race2, Result) tuples. All of these should be available in TLPD.

The list yoyoma posted is Polt's rating history which is part of the output of the program.

kenkou

United States235 Posts

August 31 2011 11:02 GMT

BitByBit at 97. Something is wrong. I'm guessing it doesn't take into account how long the player hasn't played?

robih

Austria1084 Posts

August 31 2011 11:24 GMT

all those ranking are fucked up in some way
they are pretty worthless statistics

S2Lunar

1051 Posts

August 31 2011 11:25 GMT

Seal is 11.... -_-

He hasn't even qualified for Code A or been to any tournaments... (except GSTL if you count that)

legaton

France1763 Posts

August 31 2011 11:27 GMT

How is it possible for a 50/50 player with 20 games and minor wins to be at the 11 place?

Sina92

Sweden1303 Posts

August 31 2011 11:28 GMT

why are people so obsessed with rating players?

Mios

United States686 Posts

August 31 2011 11:34 GMT

3 protoss

Tuthur

France985 Posts

August 31 2011 11:36 GMT

#10

On August 31 2011 20:28 Sina92 wrote:
why are people so obsessed with rating players?

must...rate...them...all

Wrongspeedy

United States1655 Posts

August 31 2011 11:36 GMT

#11

At least there is a few Protoss in the bottom 20 (10 and down). If there were none I'd probably just quit.

sixfour

England11060 Posts

August 31 2011 11:38 GMT

#12

On August 31 2011 20:28 Sina92 wrote:
why are people so obsessed with rating players?

because people kind of like to know who's the best at something? and actual single tournament results are not a reliable method of measurement for sample size reasons?

JustPassingBy

10776 Posts

August 31 2011 11:40 GMT

#13

On August 31 2011 20:28 Sina92 wrote:
why are people so obsessed with rating players?

Because it is fundamental for people who are either betting on stuff or running betting sites.
As for me, I just like the mathematical challenge inside of it, how to calculate a player's "skill"
just through his results.

KillerDucky

United States498 Posts

August 31 2011 15:51 GMT

#14

On August 31 2011 19:33 MasterOfChaos wrote:
IMO WHR is the best rating system I've seen so far. I've been long thinking about implementing it myself. Do you hardcode the parameters, or does your program automatically optimize some of them(for example the rank-time correlation parameter).

An interesting way to improve this might be introducing a per-matchup rating. Of course with some force pushing them towards their average.

Another might be a map imbalance parameter that represents a boost to the player skill difference. Then you optimize the map imbalance parameters like all other parameters in your iteration step.

I just manually tuned the parameters by looking at the results and adjusting them. If you're interested in looking at the code the key parameters are:
PRIOR_WEIGHT = 2.0 # How strong players are assumed to be average 2000 skill
LINK_STRENGTH = 500.0 # How fast players skill can change in time

On August 31 2011 20:25 Toppp wrote:
Seal is 11.... -_-

He hasn't even qualified for Code A or been to any tournaments... (except GSTL if you count that)

Yes I noticed that too, what happened for Seal is he played a few games in 2010, and then didn't have any results for almost a year. So when he started back in July 2011, his rating is very uncertain and therefore moves very quickly. And he has done very well in his games since July 2011, going 5-1. I will look into accounting for uncertainty. See below a more detailed view of his results and how the algorithm reacts.

Seal
   2074.32 day=2010-11-15 numgames=1 winrate=1.000
      + 1991.06 August[Kor]
   2074.84 day=2010-11-25 numgames=2 winrate=0.000
      - 2131.21 Polt
      - 2131.21 Polt
   2239.42 day=2011-07-08 numgames=1 winrate=0.000
      - 2132.18 Twilight
   2269.28 day=2011-07-22 numgames=1 winrate=1.000
      + 2360.21 Bomber
   2313.90 day=2011-08-26 numgames=4 winrate=1.000
      + 2080.85 Line
      + 2028.14 Jjun
      + 2121.58 Hack
      + 2166.27 Byun

On August 31 2011 20:02 kenkou wrote:
BitByBit at 97. Something is wrong. I'm guessing it doesn't take into account how long the player hasn't played?

Yes that's about what happened. BitByBit's made a deep run in an early GSL and then disappeared from major LANs (GSL, GSTL). If he still played I assume he's doing not so well, but I don't have those results in here. Another way to deal with this would be similar to Seal, by accounting for uncertainty.

Why?

Well it's just a fun hobby for me. Fellow ratings math nerds understand. ;-)

Montana[TK]

1624 Posts

August 31 2011 15:55 GMT

#15

July at 9
DongRaeGu at 17

makes no sense whatsoever whichever way you look at it.

Soulish

Canada1403 Posts

August 31 2011 15:59 GMT

#16

On September 01 2011 00:55 Montana[TK] wrote:
July at 9
DongRaeGu at 17

makes no sense whatsoever whichever way you look at it.

Do you not grasp this concept?

QTIP.

United States2113 Posts

August 31 2011 16:03 GMT

#17

I was never very good at Stats.... this is interesting info. Thanks.

Montana[TK]

1624 Posts

August 31 2011 16:12 GMT

#18

On September 01 2011 00:59 Soulish wrote:

Show nested quote +

Do you not grasp this concept?

I understand the concept, but a few recent wins against the likes of Hongun and Ensnare shouldn't count more than the months of utter dominance DRG displayed, especially considering he just all-killed Prime and came 3rd in MLG.

All I'm saying is I wanna see how the ranking system came to those conclusions and if I'm overseeing anything.