archive by month
Skip to content

AIIDE 2022 - map tables by bot

For each bot, its win rate by map and opponent. You can abbreviate it as bot x (map x opponent) if you like. Yesterday’s tables showed that maps make little difference when averaged across opponents. Today’s show that (as usual) maps do make a difference for specific opponents.

Each cell represents 22 or 23 games, sometimes fewer when games did not complete. No cell has fewer than 20 games. The same tables last year had 15 games per cell. The numbers are a trifle more reliable this year, but there is still a lot of statistical noise.

#bananabrainoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
2stardust52%52%65%32%32%77%45%50%50%50%64%
3dragon78%78%78%77%77%86%86%77%64%82%73%
4steamhammer89%87%96%82%86%91%86%77%100%86%95%
5purplewave69%87%65%73%64%86%50%55%68%68%73%
6mcrave94%91%83%95%95%91%95%91%95%100%100%
7microwave91%91%91%91%100%86%95%82%95%91%91%
8ualbertabot97%91%100%95%100%95%100%95%95%95%100%
9pylonpuller92%91%87%95%100%100%91%91%86%100%77%
10styx94%100%100%100%100%86%100%95%95%73%91%
11cunybot100%100%100%100%100%100%100%100%100%100%100%
overall85.53%87%87%84%85%90%85%81%85%85%86%

#1 BananaBrain was solid against most opponents, but inconsistent across maps versus its top protoss competition, #2 Stardust and #5 PurpleWave.

#stardustoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain48%48%35%68%68%23%55%50%50%50%36%
3dragon92%100%95%77%100%100%95%64%100%91%95%
4steamhammer83%100%82%73%86%91%77%50%95%95%82%
5purplewave52%57%50%73%68%27%64%18%50%59%50%
6mcrave89%83%83%86%95%95%100%95%91%77%82%
7microwave93%96%100%95%100%100%95%95%77%82%86%
8ualbertabot83%83%86%91%100%77%100%55%77%82%82%
9pylonpuller95%96%95%100%100%100%100%86%77%91%100%
10styx84%100%95%100%95%77%91%32%86%86%73%
11cunybot97%96%100%100%100%100%100%95%95%86%95%
overall81.48%86%82%86%91%79%88%64%80%80%78%

Here is the source of #2 Stardust’s relative weakness on Empire of the Sun: #5 PurpleWave and #10 Styx found holes in its play on the map. The upset by Styx on that map only is particularly extreme. Heartbreak Ridge, Longinus, and Empire of the Sun are the maps where the main bases are on the same level as the naturals, with no ramp, and all of them had at least one opponent that could exploit Stardust. But if that’s the cause, then why is Aztec fine for Stardust? On Aztec, the naturals are uphill from the mains.

#dragonoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain22%22%22%23%23%14%14%23%36%18%27%
2stardust8%0%5%23%0%0%5%36%0%9%5%
4steamhammer21%26%32%14%18%32%32%14%5%36%5%
5purplewave97%91%95%91%100%100%100%100%100%95%100%
6mcrave95%96%87%95%100%100%95%91%95%100%95%
7microwave56%65%55%36%45%64%73%59%55%59%50%
8ualbertabot77%82%64%95%80%73%77%77%68%77%73%
9pylonpuller98%100%100%95%100%91%95%100%100%100%100%
10styx94%96%100%100%91%91%95%100%100%82%86%
11cunybot95%96%96%95%91%95%100%91%100%95%95%
overall66.46%67%65%67%65%66%69%69%66%67%64%

Last year and the year before I thought that #3 Dragon was inconsistent across maps. This year it doesn’t look that way. It’s the same bot carried over. The difference seems to be that this year Dragon either smashed its opponents or got smashed by them. It remains inconsistent against #7 Microwave and #8 UAlbertaBot, the opponents scoring closest to 50%.

#steamhammeroverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain11%13%4%18%14%9%14%23%0%14%5%
2stardust17%0%18%27%14%9%23%50%5%5%18%
3dragon79%74%68%86%82%68%68%86%95%64%95%
5purplewave43%57%55%50%41%27%41%27%36%32%59%
6mcrave43%48%74%45%45%45%45%23%32%45%27%
7microwave73%70%57%50%82%68%91%77%77%91%68%
8ualbertabot95%91%100%100%86%100%95%95%100%91%95%
9pylonpuller80%70%64%82%86%86%82%91%82%77%82%
10styx90%91%86%100%95%86%100%95%95%77%73%
11cunybot97%100%96%100%100%91%95%95%100%100%95%
overall62.71%61%62%66%65%59%65%66%62%60%61%

Someday I will get Steamhammer to adapt properly to the map it is playing on.

#4 Steamhammer owes its ranking in large part to its strong performance against the carryover bots that it specifically prepared for. Versus #3 Dragon: Last year 63%, this year 79%. Versus #8 UAlbertaBot: Last year 92%, this year 95%. I knew that both would be up. I’m surprised that other bots seem to have been unprepared for Dragon in particular.

#purplewaveoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain31%13%35%27%36%14%50%45%32%32%27%
2stardust48%43%50%27%32%73%36%82%50%41%50%
3dragon3%9%5%9%0%0%0%0%0%5%0%
4steamhammer57%43%45%50%59%73%59%73%64%68%41%
6mcrave84%26%57%100%91%86%91%100%100%95%100%
7microwave50%35%17%55%45%55%36%82%77%45%50%
8ualbertabot66%78%55%100%68%68%55%64%55%64%50%
9pylonpuller87%91%74%100%95%77%91%77%95%82%86%
10styx96%100%100%73%100%100%100%95%100%95%100%
11cunybot89%83%96%82%86%86%100%77%95%100%86%
overall61.17%52%53%62%61%63%62%70%67%63%59%

#5 PurpleWave struggled versus #6 McRave on the 2-player maps Destination and Heartbreak Ridge, but scored 100% on the other 2-player map Polaris Rhapsody. It smells like a bug—but see the next table.

#mcraveoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain6%9%17%5%5%9%5%9%5%0%0%
2stardust11%17%17%14%5%5%0%5%9%23%18%
3dragon5%4%13%5%0%0%5%9%5%0%5%
4steamhammer57%52%26%55%55%55%55%77%68%55%73%
5purplewave16%74%43%0%9%14%9%0%0%5%0%
7microwave92%100%91%68%100%100%77%91%95%100%100%
8ualbertabot29%74%41%43%14%36%5%14%9%36%10%
9pylonpuller62%65%70%82%73%50%68%36%68%59%50%
10styx91%100%70%82%91%100%95%100%95%82%91%
11cunybot100%100%100%100%100%95%100%100%100%100%100%
overall46.79%60%49%45%45%46%42%44%45%46%45%

Why does #6 McRave like Destination? Mainly because of upsets against #5 PurpleWave and #8 UAlbertaBot that otherwise defeat it. If the win over PurpleWave is due to PurpleWave’s putative bug, then what explains the win over UAlbertaBot?

#microwaveoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain9%9%9%9%0%14%5%18%5%9%9%
2stardust7%4%0%5%0%0%5%5%23%18%14%
3dragon44%35%45%64%55%36%27%41%45%41%50%
4steamhammer27%30%43%50%18%32%9%23%23%9%32%
5purplewave50%65%83%45%55%45%64%18%23%55%50%
6mcrave8%0%9%32%0%0%23%9%5%0%0%
8ualbertabot57%70%65%55%43%36%77%73%59%55%32%
9pylonpuller67%57%61%91%68%50%55%77%77%64%68%
10styx99%100%100%100%95%100%100%100%95%100%100%
11cunybot99%100%100%100%100%95%100%95%100%100%100%
overall46.62%47%52%55%43%41%46%46%45%45%45%

#ualbertabotoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain3%9%0%5%0%5%0%5%5%5%0%
2stardust17%17%14%9%0%23%0%45%23%18%18%
3dragon23%18%36%5%20%27%23%23%32%23%27%
4steamhammer5%9%0%0%14%0%5%5%0%9%5%
5purplewave34%22%45%0%32%32%45%36%45%36%50%
6mcrave71%26%59%57%86%64%95%86%91%64%90%
7microwave43%30%35%45%57%64%23%27%41%45%68%
9pylonpuller66%74%64%91%82%59%64%43%45%64%77%
10styx95%86%100%100%95%95%86%82%100%100%100%
11cunybot98%100%100%95%100%100%100%95%100%91%100%
overall45.74%39%45%41%49%47%44%45%48%45%54%

It’s interesting that #8 UAlbertaBot does better against #6 McRave on the 4-player maps. You might think that UAlbertaBot’s rushes would work better on 2-player maps with a short rush distance, but it’s the opposite. I imagine it is because McRave takes longer to scout, so it can’t adapt as quickly.

#pylonpulleroverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain8%9%13%5%0%0%9%9%14%0%23%
2stardust5%4%5%0%0%0%0%14%23%9%0%
3dragon2%0%0%5%0%9%5%0%0%0%0%
4steamhammer20%30%36%18%14%14%18%9%18%23%18%
5purplewave13%9%26%0%5%23%9%23%5%18%14%
6mcrave38%35%30%18%27%50%32%64%32%41%50%
7microwave33%43%39%9%32%50%45%23%23%36%32%
8ualbertabot34%26%36%9%18%41%36%57%55%36%23%
10styx62%87%64%18%73%82%45%64%73%59%55%
11cunybot74%83%83%36%86%55%77%82%68%91%77%
overall28.91%33%33%12%25%32%28%34%31%31%29%

Wow, look at results versus #10 Styx. Polaris Rhapsody does seem to be an outlier among the 2-player maps.

#styxoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain6%0%0%0%0%14%0%5%5%27%9%
2stardust16%0%5%0%5%23%9%68%14%14%27%
3dragon6%4%0%0%9%9%5%0%0%18%14%
4steamhammer10%9%14%0%5%14%0%5%5%23%27%
5purplewave4%0%0%27%0%0%0%5%0%5%0%
6mcrave9%0%30%18%9%0%5%0%5%18%9%
7microwave1%0%0%0%5%0%0%0%5%0%0%
8ualbertabot5%14%0%0%5%5%14%18%0%0%0%
9pylonpuller38%13%36%82%27%18%55%36%27%41%45%
11cunybot44%48%17%59%68%50%23%32%45%55%41%
overall13.92%9%10%19%13%13%11%17%10%20%17%

Only a few pinprick upsets, but one of them is extreme.

#cunybotoverallDestinHeartbPolariAztecLonginCircuiEmpireFightiPythonRoadki
1bananabrain0%0%0%0%0%0%0%0%0%0%0%
2stardust3%4%0%0%0%0%0%5%5%14%5%
3dragon5%4%4%5%9%5%0%9%0%5%5%
4steamhammer3%0%4%0%0%9%5%5%0%0%5%
5purplewave11%17%4%18%14%14%0%23%5%0%14%
6mcrave0%0%0%0%0%5%0%0%0%0%0%
7microwave1%0%0%0%0%5%0%5%0%0%0%
8ualbertabot2%0%0%5%0%0%0%5%0%9%0%
9pylonpuller26%17%17%64%14%45%23%18%32%9%23%
10styx56%52%83%41%32%50%77%68%55%45%59%
overall10.69%10%11%13%7%13%10%14%10%8%11%

Trackbacks

No Trackbacks

Comments

Bytekeeper on :

Stardust crashed a lot vs Styx on Empire (within the first few frames of the game).

Filtering out those, it's more like ~50% - which is still a good upset. Not as extreme, but still unexpected.

Bytekeeper on :

I think you lost a post?
The comment above was for another post that has gone missing.

Jay Scott on :

Thanks! It turns out that I was clever enough to overwrite this post with the next one instead of making a new post, and not clever enough to notice. Those measuring my degree of cleverness now have upper and lower bounds. Luckily I’m organized and don’t lose data to mistakes like that.

Fixed.

MicroDK on :

I think it is worth to mention that #5 PurpleWave also struggled vs #7 Microwave on Heartbreak Ridge. Overall that matchup was very inconsistent for each map going from 17% on Heartbreak Ridge to 82% on Empire of the Sun.

Jay Scott on :

Yes. Also Microwave did better in general on the 2-player maps. The difference is not big, but every 2-player map result was better than any result on the other maps.

Add Comment

E-Mail addresses will not be displayed and will only be used for E-Mail notifications.

To prevent automated Bots from commentspamming, please enter the string you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly.
CAPTCHA

Form options

Submitted comments will be subject to moderation before being displayed.