SSCAIT scores - compared to rating differences

This scatter chart shows rating differences versus score ratios. For each SSCAIT game in the dataset which ended with both scores above zero (about 80,000 games), the x-axis has the rating difference, winner’s rating minus loser’s rating. The ratings are calculated as of that game, so changing versions don’t mess things up (much). Check the Elo table to see what the rating differences mean. The y-axis has the score ratio, winner’s score divided by loser’s score. The y-axis is on a logarithmic scale and shows ratios from 0.2 to 200. A small number of points are off the top or bottom; no points are off the left or right. You can click through the graph to get the same image on its own, which may make it easier to zoom in and out.

Since it’s from the winner’s point of view, most games have rating difference > 0 (the higher rated bot won) and score ratio > 1 (the bot with the higher score won).

There is complex structure here, but I’m at a loss to interpret much of it. Horizontal lines show that some score ratios are popular, which seems like a quirk of the game. Beyond the sharp lines, some fuzzier stratification in the score ratios is visible. When the stronger bot wins, it is usually by a score ratio of at least 2, increasing slowly as the strength differences goes up. The slowly rising “soft floor” in the score ratio is interesting and surprising. There are other clear structures in the chart, but I don’t know what they mean. It is mildly interesting that the left side, when the stronger bot lost, looks less structured.

Games where the score ratio is less than 1 are games where stopping early and adjudicating by score would give the wrong answer. It’s rare... but not rare enough to give me confidence in the timeout adjudication procedure. Some points are off the bottom, so some bots won despite having less than 1/5 the score by SSCAIT rules. The adjudication procedure will make occasional extreme mistakes.

Next: The winning attitude.

Trackbacks

No Trackbacks

Comments

krasi0 on Sunday, October 30. 2016:

I wonder if the distinct horizontal lines could all be games of a specific bot with a strictly followed strategy like Iron.

It'd be also interesting to check what the cases of the outliers at the top left are... Perhaps relatively a strong bot losing to a weak 4-pooler?
Additionally, I'd expect to see many more dots in the bottom left quadrant than the top left one, but there seems to be something like a symmetry. Go figure...

Jay Scott on Sunday, October 30. 2016:

I have looked at the histogram of the scores, and (for whatever reason) it showed that some scores are way more popular than others. For example, 1291 games had score 50 and 5987 games had score 100, but only 22 games had other scores between 1 and 100. All scores are possible, but scores divisible by 50 and especially scores divisible by 100 are way more common. I think the horizontal lines are ratios of the popular scores that happened to occur together.

krasi0 on Monday, October 31. 2016:

Given that we're talking about killing enemy units and destroying enemy buildings, I could see where numbers cleanly divisible by 50 and 100 could come from. Scores between [1, 99] OTOH... what the heck?!

Jay Scott on Sunday, October 30. 2016:

I’ll post a data file so we can look into which bots are behind which dots. I’ve just generated a fresh one that includes the bot names. Hang on....

krasi0 on Monday, October 31. 2016:

I guess we'd need the total game score (including mined resources, etc.) logged, too. Even if it's not used for the purpose of adjudication.

Jay Scott on Monday, October 31. 2016:

That’s the thing about data—even when there is too much, there is never enough. Now you understand why Google and Facebook are fighting for world conquest. :-)

tscmoo on Monday, October 31. 2016:

The most likely scenario I can imagine for the horizontal lines is when a 4 pool bot beats a bot which does not manage to kill any of the zerglings; for instance when a terran opens with 11 rax, refinery and constant scv production, then the zerglings just come and kill everything. Multiple terran bots might open exactly the same and the score could be exactly the same every game. The different horizontal lines could be just variations of one less/one more worker etc.

tscmoo on Monday, October 31. 2016:

Since you need to kill something to get any score, I guess the terran would need to kill at least one zergling..

Jay Scott on Monday, October 31. 2016:

Also testable! The horizontal lines in the lower group are at integer ratios, 12 13 14 15 16, with the strongest at 16. Stand by, I’ll extract some opponents.

Jay Scott on Monday, October 31. 2016:

Hypothesis disproved. There are 348 games with score ratio 16, and the opponents are highly varied (reading up from the bottom of the file): WuliBot, Zia, Flash, Christoffer Artmann, Sijia Xu, NUS Bot, Andrew Smith, Florian Richoux.... The actual scores behind the ratios are mostly 1600 and 100, but there are others. The game somehow makes certain scores more common than others.

Add Comment

Name*

Homepage

Comment*

In reply to

E-Mail addresses will not be displayed and will only be used for E-Mail notifications.

To prevent automated Bots from commentspamming, please enter the string you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly.
CAPTCHA