AIIDE 2023 - results
This crosstable exactly matches the official results. I had to make two adjustments compared to last year to get it to match:
1. In some failure conditions, the game is recorded anyway, provided it started in the first place. If the game did not start, last year its duration was recorded as 0 frames in detailed_results.txt
. This year it is recorded as -1. The HTML detailed results still gives the -1 as duration 00:00:00.
2. Game ID 979, between UAlbertaBot and BananaBrain, is recorded in detailed_results.txt
with finishing status NORMAL. But it’s obviously not normal; it looks like BananaBrain’s Starcraft instance crashed. There is no replay saved for BananaBrain, and no final score is recorded either. The official results correctly recognize that the game did not finish, and the HTML detailed results give the finishing status NO_REPORT, which means the game is not counted. But it’s still NORMAL in detailed_results.txt
, so I coded it as a one-game special case for this analysis.
# | bot | overall | bana | star | purp | stea | mcra | micr | drag | ualb | infe |
---|---|---|---|---|---|---|---|---|---|---|---|
1 | bananabrain | 84.54% | 48% | 58% | 97% | 86% | 95% | 94% | 99% | 100% | |
2 | stardust | 83.43% | 52% | 55% | 92% | 95% | 90% | 92% | 91% | 100% | |
3 | purplewave | 57.70% | 42% | 45% | 84% | 60% | 29% | 49% | 82% | 70% | |
4 | steamhammer | 50.11% | 3% | 8% | 16% | 50% | 62% | 67% | 97% | 99% | |
5 | mcrave | 49.81% | 14% | 5% | 40% | 50% | 89% | 65% | 35% | 99% | |
6 | microwave | 43.44% | 5% | 10% | 71% | 38% | 11% | 56% | 61% | 96% | |
7 | dragon | 42.28% | 6% | 8% | 51% | 33% | 35% | 44% | 83% | 78% | |
8 | ualbertabot | 30.45% | 1% | 9% | 18% | 3% | 65% | 39% | 17% | 91% | |
9 | infestedartosis | 8.28% | 0% | 0% | 30% | 1% | 1% | 4% | 22% | 9% |
The same as the official crosstable except for the presentation. Microwave upset PurpleWave even more strongly than last year. And McRave and UAlbertaBot are both carryovers, so of course UAlbertaBot upset McRave like last year.
# | bot | overall | bana | star | purp | stea | mcra | micr | drag | ualb | infe |
---|---|---|---|---|---|---|---|---|---|---|---|
1 | bananabrain | 10:47 14:37 | 17:22 14:46 | 11:33 12:16 | 11:02 8:01 | 11:46 18:46 | 11:09 5:48 | 9:02 22:59 | 10:25 8:52 | 6:04 - | |
2 | stardust | 10:43 16:51 | 14:46 17:22 | 14:10 18:06 | 11:01 14:57 | 11:57 16:54 | 10:07 6:10 | 10:40 24:26 | 9:36 6:13 | 9:58 - | |
3 | purplewave | 10:26 12:02 | 12:16 11:33 | 18:06 14:10 | 10:14 9:19 | 11:10 13:18 | 9:28 7:20 | 10:06 16:15 | 10:08 10:47 | 9:34 8:55 | |
4 | steamhammer | 7:02 10:45 | 8:01 11:02 | 14:57 11:01 | 9:19 10:14 | 8:14 8:50 | 13:42 11:23 | 12:26 19:05 | 8:56 11:36 | 5:09 12:25 | |
5 | mcrave | 9:15 11:27 | 18:46 11:46 | 16:54 11:57 | 13:18 11:10 | 8:50 8:14 | 8:22 6:04 | 20:43 21:33 | 12:49 9:09 | 5:47 9:03 | |
6 | microwave | 7:03 10:05 | 5:48 11:09 | 6:10 10:07 | 7:20 9:28 | 11:23 13:42 | 6:04 8:22 | 16:35 25:12 | 8:50 7:06 | 5:03 10:40 | |
7 | dragon | 15:45 12:05 | 22:59 9:02 | 24:26 10:40 | 16:15 10:06 | 19:05 12:26 | 21:33 20:43 | 25:12 16:35 | 12:09 8:39 | 11:28 12:40 | |
8 | ualbertabot | 6:54 10:08 | 8:52 10:25 | 6:13 9:36 | 10:47 10:08 | 11:36 8:56 | 9:09 12:49 | 7:06 8:50 | 8:39 12:09 | 6:28 8:37 | |
9 | infestedartosis | 9:52 6:31 | - 6:04 | - 9:58 | 8:55 9:34 | 12:25 5:09 | 9:03 5:47 | 10:40 5:03 | 12:40 11:28 | 8:37 6:28 |
The top number in each cell is the median duration of winning games for the bot on the left, and the bottom number is the median duration of losses. Winning fast and losing slowly suggests rush strategies or timing attacks. The opposite suggests cautious play and winning by defense into strong macro, or by gradually outplaying the opponent.
For example, Stardust likes to win with one big attack, and often wins faster than it loses—though it depends on the opponent. UAlbertaBot rushes with every race, and it shows in the numbers. Dragon likes to win by slowly pushing its opponents over. PurpleWave is comparatively balanced.
overall | vT | vP | vZ | vR | |
---|---|---|---|---|---|
terran | 42% | 21% | 48% | 83% | |
protoss | 75% | 79% | 83% | 91% | |
zerg | 38% | 52% | 17% | 50% | |
random | 30% | 17% | 9% | 50% |
Um, yeah, protoss came out on top. I think we knew that.
# | bot | overall | vT | vP | vZ | vR |
---|---|---|---|---|---|---|
1 | bananabrain | 84.54% | 94% | 53% | 94% | 99% |
2 | stardust | 83.43% | 92% | 53% | 94% | 91% |
3 | purplewave | 57.70% | 49% | 44% | 61% | 82% |
4 | steamhammer | 50.11% | 67% | 9% | 70% | 97% |
5 | mcrave | 49.81% | 65% | 20% | 79% | 35% |
6 | microwave | 43.44% | 56% | 29% | 48% | 61% |
7 | dragon | 42.28% | - | 21% | 48% | 83% |
8 | ualbertabot | 30.45% | 17% | 9% | 50% | - |
9 | infestedartosis | 8.28% | 22% | 10% | 2% | 9% |
For me, the lesson of the per-race table is that Steamhammer is very bad against protoss. Also, if it weren’t for the relative performance against UAlbertaBot, McRave would have come out well ahead of Steamhammer.
Comments