Starcraft AI blog | Entries from October 2021

AIIDE 2021 - Microwave versus BananaBrain

Blue is good for Microwave, red is good for BananaBrain.

microwave strategies versus bananabrain strategies

	overall	10/12gate	1basespeedzeal	2basespeedzeal	4gate2archon	5gategoon	9/9gate	9/9proxygate	bisu	neobisu	sairdt	sairgoon	sairreaver	stove
overall	21/157 13%	3/86 3%	1/1 100%	1/2 50%	1/2 50%	5/30 17%	3/15 20%	1/2 50%	1/2 50%	1/2 50%	1/5 20%	1/5 20%	1/2 50%	1/3 33%
11Gas10PoolLurker	0/3 0%	0/2 0%	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
11HatchTurtleHydra	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
11HatchTurtleMuta	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
12Hatch	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
12HatchTurtle	0/3 0%	0/1 0%	-	-	-	-	0/2 0%	-	-	-	-	-	-	-
12PoolMain	3/15 20%	1/10 10%	-	-	-	2/4 50%	-	0/1 0%	-	-	-	-	-	-
12PoolMuta	0/1 0%	-	-	-	-	-	0/1 0%	-	-	-	-	-	-	-
2HatchMuta	0/2 0%	0/1 0%	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
2HatchMuta_Sparkle	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
3HatchHydra	1/7 14%	0/3 0%	-	-	-	0/3 0%	-	-	-	-	-	1/1 100%	-	-
3HatchLingBust	0/4 0%	0/2 0%	-	-	-	-	0/1 0%	-	-	-	-	0/1 0%	-	-
3HatchLurker	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
3HatchMuta	0/2 0%	0/2 0%	-	-	-	-	-	-	-	-	-	-	-	-
3HatchPoolHydra	0/3 0%	0/1 0%	-	-	-	0/1 0%	-	-	-	-	-	0/1 0%	-	-
4HatchBeforeGas	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
4HatchPool	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
4HatchPoolHydra	0/3 0%	0/2 0%	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
4PoolHard	0/7 0%	0/3 0%	-	-	-	0/1 0%	0/1 0%	-	-	-	0/1 0%	-	-	0/1 0%
4PoolSoft	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
6Pool	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
7Pool	0/2 0%	0/2 0%	-	-	-	-	-	-	-	-	-	-	-	-
7PoolHydraRush7D	0/2 0%	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-
8Pool	0/2 0%	0/1 0%	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
9Hatch9Pool9Gas	0/2 0%	0/2 0%	-	-	-	-	-	-	-	-	-	-	-	-
9HatchTurtleHydra	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-	-
9Pool	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
9PoolGasHatchSpeed8D	16/57 28%	2/24 8%	1/1 100%	1/2 50%	1/2 50%	3/9 33%	3/8 38%	-	1/2 50%	1/1 100%	1/3 33%	0/2 0%	1/1 100%	1/2 50%
9PoolHatchGasSpeed8D	1/11 9%	0/7 0%	-	-	-	0/1 0%	0/2 0%	1/1 100%	-	-	-	-	-	-
9PoolSpeed	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
9PoolSpeedLing	0/8 0%	0/7 0%	-	-	-	0/1 0%	-	-	-	-	-	-	-	-
Overpool	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
OverpoolTurtle	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
ZvP_10Hatch9Pool	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	0/1 0%	-
ZvP_2HatchHydra	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
ZvP_9Hatch9Pool	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
ZvZ_Overgas11Pool	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
ZvZ_Overgas9Pool	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
ZvZ_Overpool9Gas	0/3 0%	0/3 0%	-	-	-	-	-	-	-	-	-	-	-	-
ZvZ_OverpoolTurtle	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-

That 9PoolGasHatchSpeed8D (a variant of the Styx build) must be behind many of the NakedExpand wins. Well, we already knew that, because it is behind most of the wins altogether. The interesting discovery is that it had some success against a wide range of BananaBrain openings.

The overall impression from this table is that both bots were feeling around in the dark. And we know how they work, so we know it’s true!

microwave as seen by bananabrain

microwave played	#	bananabrain recognized
11Gas10PoolLurker	3	3 12pool
11HatchTurtleHydra	1	1 12hatch
11HatchTurtleMuta	1	1 12hatch
12Hatch	1	1 12hatch
12HatchTurtle	3	3 12hatch
12PoolMain	15	9 12pool \| 4 unknown \| 2 10hatch
12PoolMuta	1	1 12pool
2HatchMuta	2	2 12hatch
2HatchMuta_Sparkle	1	1 12hatch
3HatchHydra	7	7 12hatch
3HatchLingBust	4	4 12hatch
3HatchLurker	1	1 12hatch
3HatchMuta	2	2 12hatch
3HatchPoolHydra	3	3 12hatch
4HatchBeforeGas	1	1 12hatch
4HatchPool	1	1 12hatch
4HatchPoolHydra	3	2 12hatch \| 1 unknown
4PoolHard	7	7 4/5pool
4PoolSoft	1	1 4/5pool
6Pool	1	1 4/5pool
7Pool	2	2 4/5pool
7PoolHydraRush7D	2	2 4/5pool
8Pool	2	2 9pool
9Hatch9Pool9Gas	2	2 10hatch
9HatchTurtleHydra	1	1 10hatch
9Pool	1	1 9pool
9PoolGasHatchSpeed8D	57	49 9pool \| 8 overpool
9PoolHatchGasSpeed8D	11	6 9pool \| 3 overpool \| 2 unknown
9PoolSpeed	1	1 9poolspeed
9PoolSpeedLing	8	5 9poolspeed \| 3 overpool
Overpool	1	1 overpool
OverpoolTurtle	1	1 overpool
ZvP_10Hatch9Pool	1	1 10hatch
ZvP_2HatchHydra	1	1 12hatch
ZvP_9Hatch9Pool	1	1 10hatch
ZvZ_Overgas11Pool	1	1 10hatch
ZvZ_Overgas9Pool	1	1 unknown
ZvZ_Overpool9Gas	3	3 overpool
ZvZ_OverpoolTurtle	1	1 overpool

BananaBrain’s recognition is generally close enough. 11 hatch can be treated like 12 hatch, though it is sometimes possible to exploit the slight slowness of 12 hatch. Treating 7 pool like 5 pool is probably close enough for bot play. The only harmful mistake was in recognizing overpool when facing the aggressive 9PoolGasHatchSpeed8D.

bananabrain as seen by microwave

bananabrain played	#	microwave recognized
10/12gate	86	61 HeavyRush \| 16 Unknown \| 8 NakedExpand \| 1 SafeExpand
1basespeedzeal	1	1 Unknown
2basespeedzeal	2	1 NakedExpand \| 1 Turtle
4gate2archon	2	1 Turtle \| 1 NakedExpand
5gategoon	30	15 SafeExpand \| 7 NakedExpand \| 5 Turtle \| 2 HeavyRush \| 1 Unknown
9/9gate	15	9 HeavyRush \| 4 Unknown \| 2 NakedExpand
9/9proxygate	2	1 Unknown \| 1 HeavyRush
bisu	2	1 SafeExpand \| 1 Turtle
neobisu	2	1 NakedExpand \| 1 SafeExpand
sairdt	5	3 HeavyRush \| 2 Unknown
sairgoon	5	4 SafeExpand \| 1 NakedExpand
sairreaver	2	1 Turtle \| 1 NakedExpand
stove	3	1 Unknown \| 1 HeavyRush \| 1 NakedExpand

Here we see where BananaBrain’s NakedExpand losses came from: It didn’t simply play 12 nexus or whatever, but expanded behind the cover of pressure from some other build. Could it be that Microwave often didn’t scout the nexus unless it had defeated the pressure? Looking at this, it seems more like a BananaBrain weakness—eagerness to expand at a given time whether it is safe or not—than a Microwave strength. But then again, Microwave scored well against NakedExpand for almost every opponent that played it.

AIIDE 2021 - what Microwave learned

Microwave’s history files include large doses of training data prepared before the tournament. I snipped that data out, so the tables here include only tournament games and the 7 post-tournament games that are excluded from the official results.

#1 stardust

opening	games	wins	first	last
10Hatch9Pool9gas	2	0%	95	115
10HatchMain9Pool9Gas	1	0%	85	85
10HatchTurtleHydra	1	0%	50	50
11Gas10PoolLurker	1	0%	60	60
11HatchTurtleHydra	5	0%	4	153
11HatchTurtleLurker	2	0%	70	122
11HatchTurtleMuta	2	0%	30	76
12Hatch	2	0%	72	105
12HatchMain	2	0%	88	89
12HatchTurtle	1	0%	0	0
12Pool	1	0%	145	145
12PoolMain	1	0%	127	127
12PoolMuta	1	0%	108	108
1HatchMuta_Sparkle	2	0%	54	155
2HatchHydra	1	0%	98	98
2HatchLurker	2	0%	10	23
2HatchMuta	2	0%	43	81
2HatchMuta_Sparkle	3	0%	19	96
3Hatch	2	0%	13	132
3HatchExpo	1	0%	136	136
3HatchHydra	2	0%	59	114
3HatchHydraBust	2	0%	40	41
3HatchHydraExpo	1	0%	24	24
3HatchLingBust	4	0%	46	151
3HatchMuta	1	0%	8	8
3HatchMutaExpo	7	0%	25	137
3HatchMuta_Sparkle	1	0%	12	12
3HatchPool	2	0%	15	62
3HatchPoolHydra	4	0%	65	113
3HatchPoolHydraExpo	2	0%	47	75
4HatchBeforeGas	5	0%	16	130
4HatchPool	1	0%	129	129
4HatchPoolHydra	1	0%	149	149
4PoolHard	3	0%	38	124
4PoolSoft	6	0%	2	121
5HatchPoolHydra	5	0%	6	117
5Pool	5	0%	18	138
5PoolSpeed	1	0%	103	103
6Pool	1	0%	128	128
6PoolSpeed	2	0%	79	99
7PoolHydraLingRush7D	1	0%	29	29
7PoolHydraRush7D	2	0%	56	152
8Pool	3	0%	48	110
9Hatch9Pool9Gas	3	0%	17	51
9HatchMain8Pool8Gas	2	0%	69	140
9Pool	2	0%	74	92
9PoolExpo	1	0%	123	123
9PoolGasHatchSpeed7D	1	0%	71	71
9PoolGasHatchSpeed8D	6	0%	5	90
9PoolHatch	3	0%	7	119
9PoolHatchGasSpeed7D	1	0%	3	3
9PoolHatchGasSpeed8D	2	0%	106	142
9PoolLurker	3	0%	34	120
9PoolSpeed	3	0%	11	147
9PoolSpeedLing	4	0%	36	154
9PoolSunken	1	0%	118	118
OverpoolSpeed	6	0%	9	146
OverpoolSunken	1	0%	148	148
OverpoolTurtle	4	0%	45	135
ZvP_10Hatch9Pool	2	0%	44	78
ZvP_11Hatch10Pool	4	0%	14	150
ZvP_2HatchHydra	1	0%	156	156
ZvP_9Hatch9Pool	4	0%	100	144
ZvZ_Overgas9Pool	2	0%	1	139
ZvZ_Overpool11Gas	1	0%	53	53
ZvZ_Overpool9Gas	2	0%	26	57
ZvZ_OverpoolTurtle	1	0%	109	109
67 openings	157	0%

enemy	games	wins
HeavyRush	23	0%
NakedExpand	13	0%
Unknown	121	0%
3 openings	157	0%

Microwave has many strategies. I counted 79, compared to 73 last year (my first impression was that there were many more this year, but it was not true). Like Steamhammer, when losing badly it flails, trying anything.

#2 bananabrain

opening	games	wins	first	last
11Gas10PoolLurker	3	0%	45	133
11HatchTurtleHydra	1	0%	85	85
11HatchTurtleMuta	1	0%	89	89
12Hatch	1	0%	120	120
12HatchTurtle	3	0%	67	72
12PoolMain	15	20%	39	145
12PoolMuta	1	0%	81	81
2HatchMuta	2	0%	52	122
2HatchMuta_Sparkle	1	0%	83	83
3HatchHydra	7	14%	30	100
3HatchLingBust	4	0%	12	137
3HatchLurker	1	0%	44	44
3HatchMuta	2	0%	91	111
3HatchPoolHydra	3	0%	27	148
4HatchBeforeGas	1	0%	48	48
4HatchPool	1	0%	58	58
4HatchPoolHydra	3	0%	46	134
4PoolHard	7	0%	0	141
4PoolSoft	1	0%	77	77
6Pool	1	0%	37	37
7Pool	2	0%	32	128
7PoolHydraRush7D	2	0%	2	152
8Pool	2	0%	56	121
9Hatch9Pool9Gas	2	0%	101	125
9HatchTurtleHydra	1	0%	21	21
9Pool	1	0%	41	41
9PoolGasHatchSpeed8D	57	28%	1	154
9PoolHatchGasSpeed8D	11	9%	13	147
9PoolSpeed	1	0%	61	61
9PoolSpeedLing	8	0%	36	156
Overpool	1	0%	86	86
OverpoolTurtle	1	0%	155	155
ZvP_10Hatch9Pool	1	0%	5	5
ZvP_2HatchHydra	1	0%	112	112
ZvP_9Hatch9Pool	1	0%	98	98
ZvZ_Overgas11Pool	1	0%	119	119
ZvZ_Overgas9Pool	1	0%	106	106
ZvZ_Overpool9Gas	3	0%	66	151
ZvZ_OverpoolTurtle	1	0%	146	146
39 openings	157	13%

enemy	games	wins
HeavyRush	77	4%
NakedExpand	23	48%
SafeExpand	22	5%
Turtle	9	11%
Unknown	26	19%
5 openings	157	13%

See NakedExpand in the enemy table: Microwave was able to punish BananaBrain when BananaBrain made nexus before cannons. It’s a theme. I think it indicates general aggressiveness or rushiness in the early game. I get 6:52 for Microwave’s median game length when defeating BananaBrain.

#3 dragon

opening	games	wins	first	last
10Hatch9Pool9gas	1	0%	87	87
11Gas10PoolMuta	10	60%	62	156
11HatchTurtleHydra	1	0%	59	59
12HatchTurtle	2	50%	92	93
12PoolMuta	1	0%	142	142
1HatchMuta_Sparkle	3	33%	30	103
2HatchLurker	1	0%	98	98
2HatchMuta	6	33%	7	107
3HatchHydra	2	0%	20	24
3HatchLingBust	1	0%	41	41
3HatchLurker	17	47%	76	110
3HatchMuta	2	0%	134	135
3HatchMutaExpo	1	0%	84	84
4PoolHard	4	0%	0	63
4PoolSoft	25	12%	1	136
5HatchPool	10	60%	114	132
5HatchPoolHydra	22	68%	5	154
7Pool	1	0%	34	34
7PoolHydraRush7D	1	0%	64	64
9Pool	3	0%	9	73
9PoolHatch	1	0%	105	105
9PoolHatchGasSpeed8D	1	0%	32	32
9PoolLurker	2	0%	72	113
9PoolSpeed	26	42%	3	101
9PoolSpeedLing	8	12%	53	131
ZvZ_Overgas11Pool	1	0%	21	21
ZvZ_Overgas9Pool	1	0%	54	54
ZvZ_Overpool9Gas	3	33%	12	127
28 openings	157	35%

enemy	games	wins
Factory	57	33%
HeavyRush	10	10%
SafeExpand	42	36%
Turtle	9	33%
Unknown	37	41%
WorkerRush	2	100%
6 openings	157	35%

Microwave had 3 builds that scored above 50%, and others that were close, but experimented too much. It would have scored much higher if it had exploited more and explored less. I think the lesson is that the learning algorithm’s exploration parameter should be set depending on the opponent’s expected strength. If you’re expecting to score 35% and you find a choice that scores 60%, reduce your exploration and exploit the winner unless and until the opponent adapts. If you have more than one high-scoring build (and they’re not too much alike), switch between them and your opponent will have more trouble adapting. If you’re expecting to score 70%, keep exploring if your best choice is only 60%.

#4 steamhammer

opening	games	wins	first	last
11HatchTurtleLurker	1	0%	91	91
12Hatch	1	0%	105	105
12HatchTurtle	7	43%	148	156
12Pool	1	0%	19	19
2HatchLurker	1	0%	138	138
3HatchExpo	9	33%	96	136
3HatchHydraExpo	2	0%	88	126
3HatchLingBust	6	33%	83	128
3HatchMuta_Sparkle	3	33%	142	144
3HatchPool	1	0%	38	38
3HatchPoolExpo	1	0%	82	82
4PoolHard	9	11%	18	137
4PoolSoft	6	17%	0	85
5HatchPool	1	0%	43	43
5HatchPoolHydra	1	0%	1	1
5Pool	11	9%	6	119
5PoolSpeed	1	0%	52	52
6Pool	1	0%	101	101
7PoolHydraRush7D	1	0%	111	111
8Pool	1	0%	79	79
9PoolExpo	2	0%	68	106
9PoolGasHatchSpeed8D	1	0%	41	41
9PoolHatch	4	25%	34	125
9PoolHydra	1	0%	78	78
9PoolLurker	1	0%	14	14
9PoolSpeed	21	33%	4	134
9PoolSpeedLing	27	48%	48	157
9PoolSunkHatch	1	0%	92	92
9PoolSunken	1	0%	147	147
OverpoolSpeed	1	0%	20	20
ZvP_11Hatch10Pool	1	0%	2	2
ZvP_2HatchHydra	1	0%	121	121
ZvZ_Overgas11Pool	25	40%	25	135
ZvZ_Overpool11Gas	5	0%	9	141
ZvZ_Overpool9Gas	1	0%	21	21
35 openings	158	27%

enemy	games	wins
HeavyRush	134	22%
NakedExpand	6	83%
Turtle	8	25%
Unknown	10	60%
4 openings	158	27%

Even against a higher-finishing zerg, Microwave punishes the fast expansion when it recognizes it. It’s striking. Zerg is not able to add static defense to a base before the base finishes, so this ability to restrict the oppoent’s strategy could be even more powerful in ZvZ.

#5 mcrave

opening	games	wins	first	last
10HatchMain9Pool9Gas	2	0%	24	124
10HatchTurtleHydra	4	25%	27	67
12Hatch	1	0%	109	109
12HatchMain	1	0%	4	4
12PoolMain	2	0%	2	77
12PoolMuta	1	0%	8	8
2HatchHydra	1	0%	128	128
2HatchLurkerAllIn	1	0%	90	90
2HatchMuta	1	0%	17	17
3Hatch	8	38%	81	101
3HatchMuta	1	0%	76	76
3HatchPoolHydraExpo	2	0%	71	88
4HatchBeforeGas	1	0%	84	84
4HatchPoolHydra	1	0%	126	126
4PoolHard	5	20%	9	135
4PoolSoft	5	40%	6	156
5Pool	1	0%	19	19
6Pool	2	0%	16	125
8Pool	3	0%	0	18
8PoolHydraRush8D	1	0%	49	49
9HatchTurtleHydra	1	0%	57	57
9Pool	8	38%	12	122
9PoolExpo	6	33%	96	117
9PoolGasHatchSpeed7D	1	0%	14	14
9PoolHatch	1	0%	142	142
9PoolSpeed	11	36%	1	123
Overpool	1	0%	73	73
OverpoolLurker	1	0%	63	63
ZvP_10Hatch9Pool	49	51%	38	155
ZvP_11Hatch10Pool	1	0%	91	91
ZvP_9Hatch9Pool	31	65%	10	153
ZvZ_Overpool9Gas	2	0%	3	7
32 openings	157	39%

enemy	games	wins
FastRush	4	0%
HeavyRush	140	40%
Turtle	7	14%
Unknown	6	67%
4 openings	157	39%

#6 willyt

opening	games	wins	first	last
10Hatch9Pool9gas	3	33%	36	149
10HatchMain9Pool9Gas	3	33%	126	146
12HatchMain	2	0%	31	108
12HatchTurtle	1	0%	105	105
12Pool	12	50%	78	139
12PoolMuta	5	20%	35	141
2HatchHydra	2	0%	77	101
2HatchLurker	8	25%	3	150
3HatchExpo	3	33%	115	145
3HatchHydraBust	1	0%	69	69
3HatchLingBust	6	50%	95	156
3HatchMuta	11	36%	0	57
3HatchMutaExpo	2	0%	54	125
3HatchPool	1	0%	12	12
3HatchPoolHydraExpo	3	33%	33	47
4PoolSoft	22	36%	59	154
5HatchPool	1	0%	56	56
7PoolHydraRush7D	1	0%	117	117
8PoolHydraRush8D	1	0%	73	73
9Hatch9Pool9Gas	7	29%	2	63
9HatchMain8Pool8Gas	1	0%	106	106
9Pool	6	33%	1	39
9PoolExpo	12	50%	7	153
9PoolGasHatchSpeed7D	4	50%	71	81
9PoolGasHatchSpeed8D	2	50%	62	82
9PoolHatchGasSpeed7D	1	0%	29	29
9PoolHydra	1	0%	91	91
9PoolLurker	7	29%	43	142
9PoolSpeed	4	50%	14	58
9PoolSpeedLing	1	0%	103	103
9PoolSunkHatch	1	0%	51	51
9PoolSunken	12	42%	22	94
OverpoolLurker	1	0%	32	32
OverpoolSpeed	1	0%	55	55
OverpoolSunken	1	0%	72	72
ZvP_10Hatch9Pool	1	0%	129	129
ZvP_9Hatch9Pool	3	33%	41	61
ZvZ_Overgas9Pool	1	0%	66	66
ZvZ_Overpool11Gas	1	0%	52	52
ZvZ_Overpool9Gas	1	0%	88	88
40 openings	157	32%

enemy	games	wins
Factory	53	26%
HeavyRush	22	14%
NakedExpand	16	100%
Proxy	1	0%
SafeExpand	11	18%
Unknown	54	30%
6 openings	157	32%

And against terran, too. Against Microwave, if expanding early, apparently terran and protoss should add defenses at the natural first.

#8 daqin

opening	games	wins	first	last
1HatchMuta_Sparkle	33	82%	45	153
3HatchHydra	1	0%	136	136
3HatchLurker	1	0%	38	38
3HatchMuta	106	90%	0	156
3HatchMutaExpo	1	0%	108	108
4HatchPoolHydra	1	100%	25	25
5HatchPoolHydra	2	50%	10	132
6Pool	1	0%	92	92
6PoolSpeed	1	0%	110	110
9PoolHatchGasSpeed7D	3	33%	6	46
9PoolHatchGasSpeed8D	6	50%	2	65
9PoolSpeedLing	1	0%	44	44
12 openings	157	82%

enemy	games	wins
HeavyRush	3	100%
NakedExpand	4	75%
SafeExpand	43	72%
Turtle	89	88%
Unknown	18	72%
5 openings	157	82%

The first opponent that Microwave outscored, and it was a runaway. Steamhammer struggled versus DaQin, but the other zergs were fine. Later I’ll examine why to see if there are lessons for Steamhammer.

#9 freshmeat

opening	games	wins	first	last
2HatchLurker	1	0%	26	26
4PoolSoft	13	46%	0	144
9PoolHatch	44	86%	5	156
9PoolSpeedLing	65	89%	1	154
OverpoolSpeed	34	82%	3	149
5 openings	157	83%

enemy	games	wins
FastRush	15	87%
HeavyRush	36	61%
NakedExpand	11	100%
Turtle	41	85%
Unknown	54	91%
5 openings	157	83%

The strong results against NakedExpand show here too.

#10 ualbertabot

opening	games	wins	first	last
10Hatch9Pool9gas	2	50%	87	107
11Gas10PoolLurker	68	74%	27	147
12Hatch	2	0%	24	140
12Pool	1	0%	13	13
1HatchMuta_Sparkle	1	0%	148	148
2HatchLurker	1	0%	121	121
2HatchLurkerAllIn	1	0%	14	14
2HatchMuta_Sparkle	1	0%	122	122
3HatchHydraExpo	1	0%	33	33
3HatchLurker	1	0%	20	20
4PoolSoft	6	17%	0	18
5Pool	3	67%	135	145
5PoolSpeed	9	33%	1	116
9Pool	1	0%	7	7
9PoolExpo	1	0%	52	52
9PoolGasHatchSpeed8D	33	58%	9	95
9PoolHatch	5	40%	43	99
9PoolSpeed	1	0%	137	137
9PoolSpeedLing	6	100%	149	154
9PoolSunken	3	33%	98	141
ZvP_10Hatch9Pool	1	0%	110	110
ZvP_2HatchHydra	1	0%	143	143
ZvZ_Overgas11Pool	5	60%	111	142
ZvZ_Overpool9Gas	1	0%	5	5
24 openings	155	57%

enemy	games	wins
Factory	14	93%
FastRush	32	53%
HeavyRush	78	45%
NakedExpand	8	100%
Unknown	23	65%
5 openings	155	57%

Like McRave but apparently for a different reason, Microwave had unnecessary trouble with UAlbertaBot. It takes more than a simpleminded learning algorithm to adapt to a random opponent with such different rushes for each race. The ideal answer is to adapt during the game after scouting. Steamhammer’s answer is a super-turtle build that defeats all of UAlbertaBot’s rushes. But still, see that 100% next to NakedExpand?

AIIDE 2021 - McRave versus WillyT

These tables tell more about McRave than about WillyT. Blue is good for McRave, red is good for WillyT.

mcrave strategies versus willyt strategies

	overall	1 rush	2 fe bio-mech	3 fe mech	4 tonk
overall	48/157 31%	16/41 39%	9/54 17%	12/47 26%	11/15 73%
HatchPool,12Hatch,2HatchMuta	29/89 33%	10/20 50%	6/32 19%	10/33 30%	3/4 75%
PoolHatch,12Pool,3HatchMuta	19/55 35%	6/18 33%	3/16 19%	2/12 17%	8/9 89%
PoolHatch,Overpool,2HatchMuta	0/13 0%	0/3 0%	0/6 0%	0/2 0%	0/2 0%

I find it strange that McRave’s overpool into 2 hatch muta failed in every case. Is it a reaction build that turned out to be a misreaction to what WillyT does? Probably not, it was tried against every WillyT opener. McRave’s other 2 builds were about equal, though the table shows that they were best in different cases. Switching between them was likely correct. The ratio that they were tried in also looks good to me: You want a ratio that leads to the final results being about equal.

WillyT would have done better without 15 tonk builds.

willyt as seen by mcrave

willyt played	#	mcrave recognized
1 rush	41	40 Unknown,Unknown,Unknown \| 1 RaxCC,1RaxFE,Unknown
2 fe bio-mech	54	23 RaxCC,1RaxFE,Unknown \| 20 RaxCC,1RaxFE,1FactTanks \| 6 Unknown,Unknown,Unknown \| 5 RaxCC,1RaxFE,5FactGoliath
3 fe mech	47	33 RaxCC,1RaxFE,5FactGoliath \| 8 Unknown,Unknown,Unknown \| 6 RaxCC,1RaxFE,Unknown
4 tonk	15	13 Unknown,Unknown,Unknown \| 2 RaxFact,Unknown,5FactGoliath

Both the rush and the tonk build usually denied scouting, which seems like it should have been important because the builds call for opposite reactions. Yet McRave defeated the tanks and had less trouble with the rush than with WillyT’s expansion builds. RaxCC and 1RaxFE seem simple enough to recognize, and were. The followup seems harder to recognize, and was. I doubt that so many were actually 5FactGoliath.

AIIDE 2021 - BananaBrain versus WillyT

Not much to see here, because the pairing was one-sided. But there are still a few points to note. In the tables, blue is good for BananaBrain, red is good for WillyT.

bananabrain strategies versus willyt strategies

	overall	1 rush	2 fe bio-mech	3 fe mech	4 tonk
overall	146/157 93%	43/46 93%	71/79 90%	17/17 100%	15/15 100%
10/12gate	41/44 93%	7/10 70%	19/19 100%	10/10 100%	5/5 100%
12nexus	5/6 83%	1/1 100%	1/2 50%	1/1 100%	2/2 100%
2gatedt	0/1 0%	-	0/1 0%	-	-
32nexus	21/24 88%	17/17 100%	2/5 40%	-	2/2 100%
9/9proxygate	76/77 99%	18/18 100%	46/47 98%	6/6 100%	6/6 100%
dtdrop	1/2 50%	-	1/2 50%	-	-
stove	2/3 67%	-	2/3 67%	-	-

It’s striking how quickly BananaBrain gave up on a build; it only had to fail once in six games. The reason is of course that the other builds were doing better than that. Now we see the relative success of WillyT’s build 2 bio-mech: It provided all of the wins in the builds that BananaBrain gave up on, and a few other wins as well. Otherwise, only WillyT’s rush was able to score a few wins, and then only against BananaBrain’s zealot play.

willyt as seen by bananabrain

willyt played	#	bananabrain recognized
1 rush	46	39 2rax \| 7 unknown
2 fe bio-mech	79	40 fastexpand \| 25 unknown \| 14 2rax
3 fe mech	17	12 fastexpand \| 2 unknown \| 2 1fac \| 1 2rax
4 tonk	15	10 1fac \| 4 unknown \| 1 2rax

BananaBrain seems to have diagnosed builds mostly correctly, when it was able to at all. WillyT’s build 2 seemed to be better at denying scouting. There is no sign that reading the opponent’s build helped BananaBrain play better; it’s the opposite if anything. But with the results so lopsided, we shouldn’t expect much of a sign anyway.

AIIDE 2021 - what WillyT learned

A middle group of bots finished close to each other, from #5 McRave at 41.70% to #8 DaQin at 39.63%. #6 WillyT is the second of the group.

WillyT’s learning files record the bot’s strategy as 01, 02, 03, or 04. Last year it only went up to 03. There may be an expectation of going up to 10 someday! Here is how I translated the strategy numbers into names, based on the numbering in the bot’s top-level README.

#	name	description
01	1 rush	2 rax bio + SCVs
02	2 fe bio-mech	1 rax expand into bio-mech
03	3 fe mech	1 rax expand into mech
04	4 tonk	slowly make many tanks

#1 stardust

opening	games	wins	first	last
4 tonk	156	3%	0	155
1 opening	156	3%

Stardust got special treatment, and it was still only good enough for 3%. The tonk build seems to have been specially devised to give a chance against Stardust. I checked on BASIL and found that the chance there was never high. But it’s over zero, that’s better than Steamhammer did!

#2 bananabrain

opening	games	wins	first	last
1 rush	46	7%	4	156
2 fe bio-mech	79	10%	0	154
3 fe mech	17	0%	3	152
4 tonk	15	0%	1	141
4 openings	157	7%

Switching between the rush and bio-mech was able to squeeze a little blood from BananaBrain. It’s interesting that mech scored lower, though the expected win rate is so low that it’s hard to be sure the difference is real. Does the build suffer from a weak timing?

#3 dragon

opening	games	wins	first	last
1 rush	30	0%	3	155
2 fe bio-mech	35	3%	0	156
3 fe mech	66	5%	1	153
4 tonk	26	0%	2	154
4 openings	157	3%

The author explained in a comment that WillyT is weak at TvT because it does not understand siege lines. With only one terran opponent, it wasn’t critical. This version of Dragon always makes a slow start to the game, so any slowness in WillyT’s bio-mech build did not matter, and having tanks likely helped.

#4 steamhammer

opening	games	wins	first	last
1 rush	86	49%	10	155
2 fe bio-mech	53	45%	2	156
3 fe mech	11	18%	0	150
4 tonk	7	0%	3	147
4 openings	157	43%

WillyT could not outscore Steamhammer, but it made a good attempt. I find it distressing that the rush won so many games; it’s a strong rush but not that hard to hold. Again, bio-mech was better than mech. Well, that’s more expected versus zerg, but I wonder whether the reason is the same as versus BananaBrain?

#5 mcrave

opening	games	wins	first	last
1 rush	41	61%	4	155
2 fe bio-mech	54	83%	0	154
3 fe mech	47	74%	13	156
4 tonk	15	27%	5	138
4 openings	157	69%

McRave is meticulous in defense, which shows in these numbers. But it suffered against 2-base play. I think we can infer that WillyT has good mutalisk defense.

#7 microwave

opening	games	wins	first	last
1 rush	58	83%	0	155
2 fe bio-mech	19	42%	7	156
3 fe mech	47	64%	8	153
4 tonk	33	61%	6	151
4 openings	157	68%

The tonk build had success against Microwave, but was neither successful nor much played other than here and versus Stardust. I’d say the build is overspecialized, useful only in a narrow range of situations. The rush was overwhelming, though.

#8 daqin

opening	games	wins	first	last
1 rush	14	14%	2	35
2 fe bio-mech	130	44%	4	154
3 fe mech	7	0%	8	34
4 tonk	4	0%	0	18
4 openings	155	38%

And again, bio-mech over mech. It’s not conclusive, but I feel that something may be weak in the mech build. Maybe WillyT is just better with marines.

#9 freshmeat

opening	games	wins	first	last
1 rush	133	72%	4	156
2 fe bio-mech	12	50%	2	39
3 fe mech	6	33%	1	38
4 tonk	6	50%	0	136
4 openings	157	68%

FreshMeat is newer and perhaps not ready yet to face the rush.

#10 ualbertabot

opening	games	wins	first	last
1 rush	73	81%	5	151
2 fe bio-mech	49	61%	0	154
3 fe mech	26	54%	3	148
4 tonk	7	29%	12	144
4 openings	155	68%

UAlbertaBot, with aggressive openers and no strong defensive skill, also fell to the rush. It got outrushed. It strikes me that WillyT scored nearly the same against #5 McRave, #7 Microwave, #9 FreshMeat, and #10 UAlbertaBot, even though the four are different in style and strength. WillyT did not crush any opponent. To me that suggests some kind of inconsistency in its play: It may have flaws that even weaker bots can exploit sometimes.

AIIDE 2021 - McRave versus Dragon

McRave recorded all 157 games versus Dragon, but Dragon recorded only 155. I was able to align the files anyway, because Dragon’s missing games were clearly due to Dragon’s 2 crashes. I manually removed the corresponding games from McRave’s records. Then there was one more fix: The game of round 97 that McRave and Dragon both recorded that they had lost. Officially, McRave had timed out and Dragon won, so I manually altered Dragon’s game record to give it the win.

Dragon doesn’t record anything but win/loss and its own strategy, so it has nothing to say about how McRave played.

mcrave strategies versus dragon strategies

	overall	1rax fe	2rax bio	2rax mech	bio	dirty worker rush	mass vulture	siege expand
overall	51/155 33%	4/20 20%	14/40 35%	6/17 35%	9/24 38%	2/2 100%	3/6 50%	13/46 28%
HatchPool,12Hatch,2HatchMuta	21/66 32%	0/7 0%	5/14 36%	3/5 60%	2/11 18%	-	2/2 100%	9/27 33%
HatchPool,12Hatch,2HatchSpeedling	0/1 0%	-	-	0/1 0%	-	-	-	-
PoolHatch,12Pool,2HatchMuta	1/9 11%	0/2 0%	0/2 0%	1/2 50%	-	-	-	0/3 0%
PoolHatch,12Pool,3HatchMuta	14/36 39%	1/3 33%	6/14 43%	1/2 50%	6/11 55%	-	0/3 0%	0/3 0%
PoolHatch,Overpool,2HatchMuta	9/26 35%	3/4 75%	1/7 14%	0/4 0%	1/2 50%	-	-	4/9 44%
PoolHatch,Overpool,2HatchSpeedling	2/2 100%	-	-	-	-	2/2 100%	-	-
PoolHatch,Overpool,3HatchMuta	4/15 27%	0/4 0%	2/3 67%	1/3 33%	-	-	1/1 100%	0/4 0%

Another reaction build: PoolHatch,Overpool,2HatchSpeedling was a reaction to Dragon’s worker rush, and taught Dragon not to do that.

dragon as seen by mcrave

dragon played	#	mcrave recognized
1rax fe	20	12 Unknown,Unknown,Unknown \| 5 RaxCC,1RaxFE,5FactGoliath \| 3 RaxCC,1RaxFE,Unknown
2rax bio	40	31 Unknown,Unknown,Unknown \| 6 2Rax,Main,Unknown \| 1 2Rax,Main,1FactTanks \| 1 RaxCC,1RaxFE,Unknown \| 1 2Rax,Expand,Unknown
2rax mech	17	14 Unknown,Unknown,Unknown \| 2 2Rax,Main,Unknown \| 1 2Rax,Proxy,Unknown
bio	24	11 RaxCC,1RaxFE,5FactGoliath \| 7 Unknown,Unknown,Unknown \| 3 RaxCC,1RaxFE,1FactTanks \| 3 RaxCC,1RaxFE,Unknown
dirty worker rush	2	2 Unknown,Unknown,WorkerRush
mass vulture	6	5 Unknown,Unknown,Unknown \| 1 RaxFact,Unknown,2PortWraith
siege expand	46	18 Unknown,Unknown,Unknown \| 13 RaxFact,Unknown,5FactGoliath \| 9 RaxCC,1RaxFE,5FactGoliath \| 4 RaxCC,1RaxFE,Unknown \| 2 RaxFact,Unknown,Unknown

Not much to see here; we already knew that McRave had trouble scouting Dragon. Much of what it did recognize was correct, at least. Though what McRave saw as a proxy, Dragon called “2rax mech”. The game is game 2767 from round 61 (replay file) on Heartbreak Ridge. Nothing in it resembles a proxy; it must have been a McRave bug.

AIIDE 2021 - McRave versus BananaBrain

McRave and BananaBrain both recorded all 157 of their mutual games. I chose to put McRave down the left side of the strategy cross, because its longer strategy names make the table hard to read otherwise. I also trimmed off the “PvZ_” and “Z_” prefixes from BananaBrain’s strategy names for compactness.

Blue is good for McRave, red is good for BananaBrain.

mcrave strategies versus bananabrain strategies

	overall	10/12gate	1basespeedzeal	2basespeedzeal	4gate2archon	5gategoon	9/9gate	9/9proxygate	bisu	neobisu	sairdt	sairgoon	sairreaver	stove
overall	28/157 18%	8/54 15%	1/3 33%	1/5 20%	1/1 100%	1/1 100%	1/1 100%	2/8 25%	1/5 20%	1/4 25%	1/3 33%	2/12 17%	1/1 100%	7/59 12%
HatchPool,12Hatch,2HatchMuta	16/86 19%	3/28 11%	1/1 100%	1/3 33%	1/1 100%	1/1 100%	1/1 100%	0/1 0%	-	1/4 25%	0/2 0%	2/10 20%	1/1 100%	4/33 12%
HatchPool,12Hatch,2HatchSpeedling	0/1 0%	-	-	-	-	-	-	0/1 0%	-	-	-	-	-	-
HatchPool,9Pool,2HatchSpeedling	2/2 100%	-	-	-	-	-	-	2/2 100%	-	-	-	-	-	-
PoolHatch,9Pool,2HatchMuta	3/15 20%	0/2 0%	-	-	-	-	-	-	0/3 0%	-	-	-	-	3/10 30%
PoolHatch,9Pool,2HatchSpeedling	0/3 0%	-	-	-	-	-	-	0/3 0%	-	-	-	-	-	-
PoolHatch,9Pool,3HatchMuta	4/19 21%	4/13 31%	-	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	0/4 0%
PoolHatch,9Pool,6HatchHydra	0/2 0%	-	-	-	-	-	-	-	-	-	-	0/1 0%	-	0/1 0%
PoolHatch,Overpool,2HatchMuta	0/4 0%	0/2 0%	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-
PoolHatch,Overpool,2HatchSpeedling	0/1 0%	-	-	-	-	-	-	0/1 0%	-	-	-	-	-	-
PoolHatch,Overpool,3HatchMuta	3/14 21%	1/9 11%	0/1 0%	-	-	-	-	-	1/1 100%	-	1/1 100%	-	-	0/2 0%
PoolHatch,Overpool,6HatchHydra	0/10 0%	-	-	0/1 0%	-	-	-	-	-	-	-	-	-	0/9 0%

There we have the explanation for the 2 lonely HatchPool,9Pool,2HatchSpeedling games: The strategy was a successful reaction to proxy gates. I read it as meaning that the build is 9 hatch, 9 pool, and gas soon for the zergling speed upgrade.

BananaBrain split its effort between 10-12 gate and the Stove (a scout into dark templar build), very different builds. McRave answered both mostly with 12 hatch into 2 hatch muta. A hydralisk opening would have been a more natural way to counter both, but play what you’re good at.

mcrave as seen by bananabrain

mcrave played	#	bananabrain recognized
HatchPool,12Hatch,2HatchMuta	86	84 12hatch \| 2 unknown
HatchPool,12Hatch,2HatchSpeedling	1	1 unknown
HatchPool,9Pool,2HatchSpeedling	2	2 12pool
PoolHatch,9Pool,2HatchMuta	15	8 9pool \| 7 overpool
PoolHatch,9Pool,2HatchSpeedling	3	2 overpool \| 1 9pool
PoolHatch,9Pool,3HatchMuta	19	13 9pool \| 6 overpool
PoolHatch,9Pool,6HatchHydra	2	1 overpool \| 1 9pool
PoolHatch,Overpool,2HatchMuta	4	4 overpool
PoolHatch,Overpool,2HatchSpeedling	1	1 overpool
PoolHatch,Overpool,3HatchMuta	14	14 overpool
PoolHatch,Overpool,6HatchHydra	10	10 overpool

BananaBrain was accurate in recognizing 12 hatch and overpool, but had trouble with 9 pool. It did not try to narrow the build down any further than that.

bananabrain as seen by mcrave

bananabrain played	#	mcrave recognized
10/12gate	54	31 2Gate,10/12,Corsair \| 8 2Gate,10/12,DT \| 4 2Gate,10/12,ZealotRush \| 3 2Gate,10/17,Corsair \| 2 2Gate,Unknown,Corsair \| 2 2Gate,9/9,DT \| 2 2Gate,9/9,Corsair \| 1 2Gate,10/17,4Gate \| 1 2Gate,10/17,DT
1basespeedzeal	3	2 1GateCore,2Zealot,DT \| 1 1GateCore,2Zealot,Corsair
2basespeedzeal	5	3 FFE,Forge,Speedlot \| 1 FFE,Nexus,Speedlot \| 1 FFE,Nexus,5GateGoon
4gate2archon	1	1 FFE,Forge,5GateGoon
5gategoon	1	1 FFE,Nexus,5GateGoon
9/9gate	1	1 2Gate,9/9,Corsair
9/9proxygate	8	7 2Gate,Proxy,ZealotRush \| 1 2Gate,9/9,Unknown
bisu	5	2 FFE,Forge,Unknown \| 2 FFE,Forge,5GateGoon \| 1 FFE,Nexus,Unknown
neobisu	4	4 FFE,Forge,Speedlot
sairdt	3	3 1GateCore,2Zealot,Corsair
sairgoon	12	6 FFE,Forge,5GateGoon \| 2 FFE,Nexus,5GateGoon \| 2 FFE,Forge,Unknown \| 1 FFE,Gateway,5GateGoon \| 1 FFE,Gateway,Unknown
sairreaver	1	1 FFE,Forge,Unknown
stove	59	27 1GateCore,2Zealot,Corsair \| 10 1GateCore,Unknown,Corsair \| 6 1GateCore,Unknown,DT \| 6 1GateCore,2Zealot,DT \| 3 2Gate,10/12,DT \| 3 2Gate,10/12,ZealotRush \| 2 2Gate,10/17,4Gate \| 1 2Gate,10/17,DT \| 1 2Gate,10/12,4Gate

In 1-base protoss plays, McRave tried to distinguish when the gates were made, and often got it right but had some trouble. It seems like something you can’t do perfectly, even if you combine direct scouting of the gates with inferences based on the enemy army. Recognizing the enemy build precisely doesn’t seem possible in general, though you can usually get close.

AIIDE 2021 - a questionable game result and fixing it

Working through the learning files of various bots, I ran into a strange discrepancy: One game between Dragon and McRave which both bots recorded as a loss. In the detailed results, the game ID is 4387 from round 97. The official result has Dragon winning after McRave timed out.

What really happened? I watched the replays recorded by both bots. On the surface, both replays of the game looked the same. McRave destroyed all of Dragon’s buildings, then as usual the replays continued a few more moments. According to OpenBW, Dragon’s replay ended at 22:10 and McRave’s replay at 22:09 (according to the official results, the game ended at 21:57). It’s reflected in the file sizes; Dragon has 1,553,196 bytes while McRave has 1,552,818 bytes (in other cases I checked, differences between replay sizes for the same game are less than 10 bytes). Here is a view at the end of Dragon’s replay, followed by the end of McRave’s. Notice that the valkyrie has moved a little farther in Dragon’s version, and Dragon shows supply 0 for both bots at the end (supporting that they both lost).

On the face of it, Dragon lost because all its buildings were destroyed, then in the brief runout of the game before Broodwar stopped, McRave lost by timing out (the tournament manager said it timed out, plus it recorded its own game result so it didn’t crash). That’s how they both believed they lost. The tournament manager, I have to assume, didn’t expect that situation and took the timeout as definitive, recording that Dragon won.

I wrote to Dave Churchill about it and got this answer:

I don’t have any time to work on this right now and it seems like a pretty small edge case, so if you could post about it and crowd source the fix that would be ideal. I’ll accept any pull request that makes the tournament better!

It’s not critical, it’s a rare case that is not even close to affecting the tournament finishing order. But it would still be nice to fix it.

The first job may be to read code and/or run experiments to figure out more exactly what actually happened. Regardless of the course of events, there are two issues to solve. 1. What should the game result be? 2. How can we tell both bots the correct result?

1. the game result

It was a misplay. Both bots messed up fatally. Don’t count it as a win for either, but skip the game.

McRave won. All terran buildings were destroyed, and that’s the winning condition, right? Never mind that McRave had trouble with time later. Why is there a later at all?

Dragon won. The tournament needs to control how much time bots take, no matter when they take it, as a matter of efficiency and fairness. Therefore the game is not over until Broodwar stops it.

Reasonable people can disagree. I don’t think the answer matters much. What’s more important is to make sure that the actual game, the tournament manager, and the bots all agree on the result as much as possible. I think that issue 1 and issue 2 are interrelated, and should be answered together, not separately.

2. notifying the bots

I don’t understand the technical details of how bots are notified that they won or lost much beyond “somebody calls onEnd()”. But I have poked at it a bit. Looking at it from the tournament end, the software includes a java tournament manager, a C++ tournament module that is part of BWAPI, and then the bots and game itself.

I think the outline is this: When a game completes normally, there is a short runout phase, and then the tournament module notifies both bots of the result, all good. If a bot times out, the tournament module kicks it out of the game immediately and notifies it of its loss. Then Broodwar realizes there is only one player left, calls the game over, and the remaining player wins. Usually good. In this case, I think Dragon lost and the game entered the runout phase. Then McRave timed out, was kicked out and notified of its loss. Then, when Broodwar ended the game slightly later (giving Dragon the longer replay), for whatever reason the tournament module told Dragon it had lost too. Meanwhile, the tournament manager took the timeout as definitive and recorded that Dragon won. Is my thinking correct? I think it’s close, but I may have details wrong.

The goal is to find a fix so that one of the game results of issue 1 is decided on and carried through consistently. It would be nice if the fix only affected the java tournament manager, but I don’t know if that’s possible.

AIIDE 2021 - what McRave learned

I’m taking the bots in finishing order; McRave is next. Last year I analyzed McRave’s three-part strategy representation and learning algorithm. These apparently have not changed in outline, though details may have changed. The set of available strategies has been updated. For example, 6HatchHydra is new this year. It follows that the set of enabled strategy triples has also changed.

McRave is much stronger this year. It has become noted for dangerous mutalisk control.

#1 stardust

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	81	6%	0	156
PoolHatch,9Pool,2HatchMuta	22	14%	13	155
PoolHatch,9Pool,3HatchMuta	11	0%	4	147
PoolHatch,9Pool,6HatchHydra	9	0%	9	143
PoolHatch,Overpool,2HatchMuta	9	0%	6	145
PoolHatch,Overpool,3HatchMuta	13	8%	11	125
PoolHatch,Overpool,6HatchHydra	12	0%	1	149
7 openings	157	6%

enemy	games	wins
1GateCore,2Zealot,4Gate	105	6%
2Gate,10/12,4Gate	10	10%
2Gate,10/17,4Gate	40	5%
2Gate,9/9,4Gate	1	0%
2Gate,Unknown,4Gate	1	0%
5 openings	157	6%

9 pool into 2 hatch muta worked best, with 3 wins out of 22. That is not intuitive. The more natural 12 hatch into 2 hatch muta was tried more but was less successful. Did Stardust react inefficiently to the 9 pool? McRave appears to correctly understand that Stardust ends up with 4 gates despite taking different routes to get there. That’s kind of impressive.

Last year McRave scored 3 out of 150 against Stardust. This year it scored 8 out of 150 against a stronger Stardust—which seems to have updates specifically to defeat McRave, since McRave was the only bot to upset it in CoG 2021. Good progress!

#2 bananabrain

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	86	19%	0	156
HatchPool,12Hatch,2HatchSpeedling	1	0%	26	26
HatchPool,9Pool,2HatchSpeedling	2	100%	30	100
PoolHatch,9Pool,2HatchMuta	15	20%	18	144
PoolHatch,9Pool,2HatchSpeedling	3	0%	24	29
PoolHatch,9Pool,3HatchMuta	19	21%	4	124
PoolHatch,9Pool,6HatchHydra	2	0%	12	129
PoolHatch,Overpool,2HatchMuta	4	0%	9	96
PoolHatch,Overpool,2HatchSpeedling	1	0%	25	25
PoolHatch,Overpool,3HatchMuta	14	21%	16	114
PoolHatch,Overpool,6HatchHydra	10	0%	1	146
11 openings	157	18%

enemy	games	wins
1GateCore,2Zealot,Corsair	31	13%
1GateCore,2Zealot,DT	8	12%
1GateCore,Unknown,Corsair	10	10%
1GateCore,Unknown,DT	6	0%
2Gate,10/12,4Gate	1	0%
2Gate,10/12,Corsair	31	16%
2Gate,10/12,DT	11	27%
2Gate,10/12,ZealotRush	7	0%
2Gate,10/17,4Gate	3	67%
2Gate,10/17,Corsair	3	0%
2Gate,10/17,DT	2	50%
2Gate,9/9,Corsair	3	33%
2Gate,9/9,DT	2	0%
2Gate,9/9,Unknown	1	0%
2Gate,Proxy,ZealotRush	7	29%
2Gate,Unknown,Corsair	2	0%
FFE,Forge,5GateGoon	9	22%
FFE,Forge,Speedlot	7	29%
FFE,Forge,Unknown	5	20%
FFE,Gateway,5GateGoon	1	100%
FFE,Gateway,Unknown	1	0%
FFE,Nexus,5GateGoon	4	25%
FFE,Nexus,Speedlot	1	0%
FFE,Nexus,Unknown	1	100%
24 openings	157	18%

McRave’s wins over BananaBrain are dominated by games where BananaBrain timed out. In game 2754, BananaBrained timed out with 320 frames over 55ms, when McRave had 318 frames over 55ms—close call! McRave had more timeouts than any other bot, but only 3 losses to BananaBrain by timeout. BananaBrain’s timeouts seem to be concentrated on Dragon and McRave, and to a lesser extent on Steamhammer.

If HatchPool,9Pool,2HatchSpeedling was tried twice and won twice, why wasn’t it tried more often? The first try was on game 30 of 0-156. I imagine that it was a reactive build, not enabled as an initial choice but switched to under given circumstances. I didn’t read the source to verify that.

The enemy table shows a complex set of strategies by BananaBrain.

#3 dragon

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	66	32%	0	156
HatchPool,12Hatch,2HatchSpeedling	1	0%	61	61
PoolHatch,12Pool,2HatchMuta	9	11%	27	150
PoolHatch,12Pool,3HatchMuta	37	41%	1	129
PoolHatch,Overpool,2HatchMuta	27	33%	3	127
PoolHatch,Overpool,2HatchSpeedling	2	100%	21	99
PoolHatch,Overpool,3HatchMuta	15	27%	25	148
7 openings	157	33%

enemy	games	wins
2Rax,Expand,Unknown	1	100%
2Rax,Main,1FactTanks	1	0%
2Rax,Main,Unknown	8	25%
2Rax,Proxy,Unknown	1	0%
RaxCC,1RaxFE,1FactTanks	3	33%
RaxCC,1RaxFE,5FactGoliath	25	12%
RaxCC,1RaxFE,Unknown	11	27%
RaxFact,Unknown,2PortWraith	1	100%
RaxFact,Unknown,5FactGoliath	13	23%
RaxFact,Unknown,Unknown	2	50%
Unknown,Unknown,Unknown	89	39%
Unknown,Unknown,WorkerRush	2	100%
12 openings	157	33%

Over half of McRave’s losses to Dragon were by timeout. I think Dragon is an especially easy bot to time out against, because its strong macro and big battles with light units put heavy demands on the opponent.

The enemy table shows 89 games with Unknown,Unknown,Unknown. Apparently Dragon often denied scouting. Presumably the scouting overlord was afraid to approach due to marines, and any scouting drone was turned away. Also, I wonder about 2Rax,Proxy,Unknown. Did Dragon really proxy once, or was it a misrecognition? On Python, bases can be close by air. If McRave measures proxy distance by air distance, it might take a barracks in the enemy main for a proxy.

#4 steamhammer

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	21	33%	3	142
PoolHatch,12Pool,2HatchSpeedling	12	17%	1	136
PoolLair,9Pool,1HatchMuta	125	51%	0	157
3 openings	158	46%

enemy	games	wins
HatchPool,10Hatch,1HatchMuta	2	100%
HatchPool,10Hatch,2HatchSpeedling	16	81%
HatchPool,10Hatch,3HatchMuta	1	0%
HatchPool,10Hatch,Unknown	8	50%
HatchPool,9Pool,3HatchMuta	1	0%
HatchPool,9Pool,Unknown	6	67%
HatchPool,Unknown,Unknown	1	100%
PoolHatch,12Pool,3HatchMuta	1	100%
PoolHatch,12Pool,Unknown	2	50%
PoolHatch,4Pool,LingRush	8	88%
PoolHatch,9Pool,2HatchSpeedling	1	100%
PoolHatch,9Pool,Unknown	8	38%
PoolHatch,Unknown,2HatchHydra	1	100%
PoolHatch,Unknown,3HatchMuta	2	100%
PoolHatch,Unknown,Unknown	4	75%
PoolLair,9Pool,1HatchMuta	2	100%
PoolLair,Unknown,1HatchMuta	4	25%
Unknown,Unknown,1HatchHydra	1	100%
Unknown,Unknown,1HatchLurker	2	100%
Unknown,Unknown,1HatchMuta	51	27%
Unknown,Unknown,3HatchMuta	2	0%
Unknown,Unknown,Unknown	34	29%
22 openings	158	46%

McRave chose from the same fixed set of 3 strategies against all the zergs. Only the 1 hatch mutalisks were able to hold their own with Steamhammer.

#6 willyt

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	89	33%	0	156
PoolHatch,12Pool,3HatchMuta	55	35%	1	155
PoolHatch,Overpool,2HatchMuta	13	0%	2	151
3 openings	157	31%

enemy	games	wins
RaxCC,1RaxFE,1FactTanks	20	0%
RaxCC,1RaxFE,5FactGoliath	38	26%
RaxCC,1RaxFE,Unknown	30	33%
RaxFact,Unknown,5FactGoliath	2	0%
Unknown,Unknown,Unknown	67	42%
5 openings	157	31%

Hurrying the mutas too much did not help against WillyT. The enemy table shows that WillyT sometimes countered with goliaths. Does McRave later make a hydra switch to fight the goliaths? WillyT sometimes goes for goliaths with 2 tanks, and it’s sensible to fight back with hydralisks. I didn’t see a hydra switch in the games I watched.

I noticed that McRave doesn’t clear the terran scout from its main until mutas come out. WillyT gets to know the exact timing for its turrets with no need to spend a scan.

#7 microwave

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	49	65%	3	156
PoolHatch,12Pool,2HatchSpeedling	15	33%	1	147
PoolLair,9Pool,1HatchMuta	93	62%	0	149
3 openings	157	61%

enemy	games	wins
HatchPool,10Hatch,+1Ling	1	0%
HatchPool,10Hatch,1HatchMuta	1	0%
HatchPool,10Hatch,2HatchHydra	1	100%
HatchPool,10Hatch,2HatchSpeedling	41	46%
HatchPool,10Hatch,3HatchMuta	11	82%
HatchPool,10Hatch,Unknown	18	67%
HatchPool,9Pool,+1Ling	1	0%
HatchPool,9Pool,2HatchSpeedling	2	100%
HatchPool,9Pool,3HatchMuta	2	50%
HatchPool,9Pool,Unknown	2	100%
HatchPool,Unknown,2HatchSpeedling	2	50%
PoolHatch,12Pool,+1Ling	1	100%
PoolHatch,12Pool,1HatchHydra	1	100%
PoolHatch,12Pool,2HatchSpeedling	1	100%
PoolHatch,12Pool,3HatchHydra	1	100%
PoolHatch,12Pool,3HatchMuta	9	22%
PoolHatch,12Pool,Unknown	5	80%
PoolHatch,4Pool,LingRush	14	79%
PoolHatch,9Pool,+1Ling	1	0%
PoolHatch,9Pool,1HatchMuta	1	0%
PoolHatch,9Pool,2HatchSpeedling	1	100%
PoolHatch,9Pool,3HatchMuta	3	67%
PoolHatch,9Pool,Unknown	2	50%
PoolHatch,Unknown,2HatchHydra	1	100%
PoolHatch,Unknown,Unknown	3	100%
Unknown,9Pool,+1Ling	2	50%
Unknown,9Pool,1HatchHydra	1	100%
Unknown,Unknown,+1Ling	3	0%
Unknown,Unknown,1HatchLurker	1	100%
Unknown,Unknown,1HatchMuta	1	100%
Unknown,Unknown,3HatchHydra	2	50%
Unknown,Unknown,3HatchMuta	10	60%
Unknown,Unknown,Unknown	11	73%
33 openings	157	61%

#8 daqin

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	123	83%	0	156
PoolHatch,9Pool,2HatchMuta	3	33%	12	138
PoolHatch,9Pool,3HatchMuta	2	50%	11	20
PoolHatch,9Pool,6HatchHydra	2	0%	58	98
PoolHatch,Overpool,2HatchMuta	23	78%	83	155
PoolHatch,Overpool,3HatchMuta	3	0%	37	105
PoolHatch,Overpool,6HatchHydra	1	0%	1	1
7 openings	157	78%

enemy	games	wins
FFE,Forge,5GateGoon	27	96%
FFE,Forge,Speedlot	88	74%
FFE,Forge,Unknown	2	100%
FFE,Forge,ZealotArchon	7	100%
FFE,Gateway,5GateGoon	2	50%
FFE,Gateway,Speedlot	23	74%
FFE,Nexus,5GateGoon	2	100%
FFE,Nexus,Speedlot	6	33%
8 openings	157	78%

The mutalisks did in DaQin. DaQin’s slow start puts 12 hatch ahead of other choices; DaQin makes cannons before nexus regardless of what the opponent does. DaQin defends its natural entrance with cannons, but not its nexus, so the mutalisks have a free hand and DaQin finds itself short of probes.

#9 freshmeat

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	40	65%	3	152
PoolHatch,12Pool,2HatchSpeedling	19	47%	1	135
PoolLair,9Pool,1HatchMuta	98	69%	0	156
3 openings	157	66%

enemy	games	wins
HatchPool,10Hatch,2HatchSpeedling	26	62%
HatchPool,10Hatch,3HatchMuta	13	38%
HatchPool,10Hatch,Unknown	10	60%
HatchPool,9Pool,2HatchSpeedling	7	29%
HatchPool,9Pool,Unknown	2	50%
HatchPool,Unknown,3HatchMuta	1	100%
HatchPool,Unknown,Unknown	1	100%
PoolHatch,4Pool,LingRush	24	79%
PoolHatch,9Pool,2HatchSpeedling	4	50%
PoolHatch,9Pool,Unknown	2	100%
PoolHatch,Unknown,2HatchSpeedling	4	75%
PoolHatch,Unknown,3HatchMuta	1	100%
PoolLair,Unknown,Unknown	1	100%
Unknown,9Pool,Unknown	4	100%
Unknown,Unknown,+1Ling	2	100%
Unknown,Unknown,1HatchHydra	2	100%
Unknown,Unknown,1HatchMuta	2	100%
Unknown,Unknown,3HatchMuta	11	64%
Unknown,Unknown,3HatchSpeedling	1	0%
Unknown,Unknown,Unknown	39	67%
20 openings	157	66%

#10 ualbertabot

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	1	0%	98	98
PoolHatch,Overpool,2HatchMuta	85	42%	0	156
PoolHatch,Overpool,2HatchSpeedling	71	31%	4	154
3 openings	157	37%

enemy	games	wins
1GateCore,0Zealot,4Gate	2	100%
1GateCore,0Zealot,DT	1	100%
2Gate,10/12,ZealotRush	10	90%
2Gate,9/9,Unknown	2	0%
2Gate,9/9,ZealotRush	36	42%
2Rax,Main,MarineRush	25	0%
2Rax,Main,Unknown	6	33%
PoolHatch,4Pool,LingRush	52	40%
RaxCC,8Rax,Unknown	10	0%
RaxFact,Unknown,2Fact	1	100%
RaxFact,Unknown,Unknown	1	0%
Unknown,Unknown,Unknown	11	64%
12 openings	157	37%

McRave met UAlbertaBot with the same strategies as last year (except for one stray PoolHatch,12Pool,2HatchMuta this year). The same strategies by name, that is. The actual play was different and performed far worse against UAlbertaBot’s pressure builds. I looked at some games. When McRave respected its enemy and defended itself, it generally won. Sometimes it seemed to arrogantly conclude “Pff, you’re not worth spending a sunken on” and got overrun. As far as I could tell from watching games, it wasn’t a scouting miss—though it’s easy to overlook things in watching games. It had the feel of a bug.

AIIDE 2021 - what Steamhammer learned

The submitted Steamhammer was mistakenly configured to retain 100 game records per opponent. I had thought it was set for 200, and didn’t double-check. So of the 157 games against each opponent, of which 150 counted in the tournament, I have records for only the final 100. That’s 93 tournament games plus the 7 extra at the end.

My prepared data was successful. For all opponents which I prepared for, the prepared openings were among the highest scoring (including the zero score versus Stardust). It’s notable that Steamhammer’s gas steal was not successful against any opponent, perhaps another sign of an elite tournament. It was either infeasible or abandoned as a failure against every opponent except DaQin, and did no good then.

Steamhammer’s game records are rich with data. To show a little bit more of it, I added a new feature in the opening table. There are new “wins” and “losses” columns showing the median time that winning and losing games with that opening lasted. The median is a better measure than the mean, because we can expect the distribution of game times to be right-tailed: Games are limited to between zero minutes and one hour, but we expect a hump nearer to zero and a long tail of slower games. That inflates the mean and makes it misleading. For the tournament, I turned off surrendering, so Steamhammer played its losses out to the end.

#1 stardust

opening	games	wins	wins	losses	first	last
10Hatch	1	0%	-	8:48	48	48
10HatchHydra	1	0%	-	8:41	45	45
11HatchTurtleHydra	3	0%	-	10:33	14	56
11HatchTurtleMuta	3	0%	-	10:40	10	67
11Pool	1	0%	-	8:48	74	74
12Gas11PoolMuta	1	0%	-	6:53	60	60
12Hatch_4HatchLing	1	0%	-	14:08	25	25
2HatchLurkerAllIn	1	0%	-	9:12	65	65
2x10Hatch	1	0%	-	8:30	95	95
2x10HatchBurrow	1	0%	-	9:21	55	55
3HatchHydraExpo	4	0%	-	8:03	23	86
3HatchLateHydras	1	0%	-	7:43	9	9
3HatchLing	1	0%	-	7:54	51	51
3HatchLingBurrow	1	0%	-	8:31	71	71
3HatchLingExpo	2	0%	-	8:43	6	37
4HatchBeforeLair	1	0%	-	7:39	47	47
4PoolSoft	1	0%	-	8:26	68	68
6Pool	2	0%	-	9:12	75	92
6PoolHide	1	0%	-	8:35	17	17
6PoolSpeed	6	0%	-	8:27	3	85
7DroneGas	1	0%	-	7:51	80	80
7Pool10Hatch	1	0%	-	8:16	83	83
7Pool12Hatch	1	0%	-	8:36	50	50
7Pool6GasLurker B	1	0%	-	9:38	44	44
7PoolHard	1	0%	-	14:07	41	41
7PoolHarder	1	0%	-	8:23	76	76
7PoolMid	1	0%	-	8:03	89	89
7PoolSoft	1	0%	-	13:09	42	42
8Hatch7PoolBurrow	1	0%	-	9:47	64	64
8Hatch7PoolBurrowB	1	0%	-	8:23	5	5
8Scout	1	0%	-	8:14	87	87
9HatchExpo9Pool9Gas	2	0%	-	8:29	16	94
9Pool8Hatch	1	0%	-	8:17	98	98
9Pool9Hatch	1	0%	-	10:29	70	70
9PoolBurrow	1	0%	-	9:34	84	84
9PoolBurrowB	1	0%	-	8:07	4	4
9PoolHatchSpeed7Drone	2	0%	-	7:58	31	73
9PoolHatchSpeed7DroneB	2	0%	-	8:01	0	24
9PoolHatchSpeedAllInB	1	0%	-	8:41	22	22
9PoolLair	1	0%	-	7:36	99	99
9PoolLurker	1	0%	-	9:41	27	27
9PoolSpeed	2	0%	-	9:03	8	26
9PoolSpire	1	0%	-	9:03	32	32
9PoolSunkSpeed	1	0%	-	7:41	79	79
AntiFact_13Pool	1	0%	-	8:17	54	54
AntiFact_2Hatch	3	0%	-	7:39	69	93
AntiFactoryHydra	1	0%	-	7:08	63	63
AntiZeal_12Hatch	3	0%	-	10:20	33	77
HiveRush	1	0%	-	6:50	30	30
Over10Hatch	2	0%	-	10:07	15	34
Over10Hatch1Sunk	1	0%	-	8:35	96	96
Over10Hatch2Sunk	3	0%	-	10:33	1	88
Over10Hatch2SunkHard	1	0%	-	9:15	46	46
Over10HatchBust	2	0%	-	8:21	18	49
Over10HatchSlowLings	2	0%	-	8:23	61	78
OverhatchExpoLing	1	0%	-	8:30	13	13
OverhatchLing	1	0%	-	10:25	58	58
Overpool14Hatch	1	0%	-	7:39	7	7
Overpool2HatchLurker	2	0%	-	9:06	43	82
OverpoolLurker	1	0%	-	8:48	72	72
OverpoolTurtle 0	1	0%	-	8:32	2	2
Overpool_3HatchLing	1	0%	-	10:29	20	20
PurpleSwarmBuild	1	0%	-	8:08	66	66
ZvP_2HatchMuta	2	0%	-	7:55	38	97
ZvP_Overpool3Hatch	1	0%	-	8:16	29	29
ZvT_13Pool	2	0%	-	9:18	57	91
ZvT_7Pool	1	0%	-	8:30	81	81
ZvZ_12Pool	1	0%	-	7:01	53	53
ZvZ_Overpool11Gas	1	0%	-	7:50	35	35
ZvZ_Overpool9Gas	1	0%	-	7:47	19	19
70 openings	100	0%	-	8:30

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	100	100%	0%	40	40%	0%	40%	58%
Naked expand		-	-	2	2%	0%	0%	0%
Unknown		-	-	58	58%	0%	0%	0%

timing	#	median	early	late
my combat unit	100	2:54	1:47	4:11
my gas	99	3:17	1:34	7:33
enemy scout	100	1:57	1:18	7:53
enemy combat unit	100	2:41	2:21	4:37
enemy gas	100	4:20	3:37	6:37
enemy air unit	9	9:42	8:30	11:09
enemy cloaked unit	8	9:43	9:14	11:09
game duration	100	8:30	6:50	18:12

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	10	2:12	1:58	2:21	0%	4:22
gas steal success	6	2:15	2:03	2:31	0%	4:32
none or failed	94	-	-	-	0%	4:17
gas steal killed	6	2:47	2:42	2:58

Steamhammer lost every game, but there is still valuable info here. If you’re losing all games, the game duration is a plausible proxy for how much trouble you caused the opponent. Especially so if you tried a rush opening and ended up in a long game—either the rush did some damage, or the opponent overreacted and was slowed down. Here, a couple of 7 pool builds were among the longest games. Steamhammer probably should have repeated them.

Notice the 4 pool and the hive rush. Steamhammer tried the whole range. Steamhammer recognized Stardust’s build in 2 games as nexus without cannons, a reaction that Stardust did not have last year. Otherwise, results are similar to last year’s.

#2 bananabrain

opening	games	wins	wins	losses	first	last
11Gas10PoolMuta	1	0%	-	6:04	33	33
11Gas10PoolMutaB	1	0%	-	6:21	56	56
11HatchTurtleLurker	15	53%	15:32	9:43	71	98
11Pool	1	0%	-	14:48	10	10
12-11HatchStem	1	0%	-	16:34	78	78
2x10HatchSlow	7	0%	-	8:30	4	95
3HatchHydra	1	0%	-	12:20	42	42
3HatchLingBurrow	1	0%	-	14:00	19	19
4PoolSoft	1	0%	-	7:55	74	74
6Scout	1	0%	-	8:48	66	66
9Hatch8Pool	1	0%	-	6:12	69	69
9PoolBurrow	8	12%	16:29	12:53	43	82
9PoolHatchSpeed7DroneB	1	0%	-	10:26	1	1
9PoolHatchSpeedAllIn	5	20%	9:49	6:51	58	68
9PoolHatchSpeedSpire	8	0%	-	7:21	3	99
9PoolHatchSpeedSpire2	1	0%	-	7:02	15	15
9PoolSpeed	1	0%	-	11:01	14	14
9PoolSpeedAllIn	1	0%	-	13:02	16	16
9PoolSunkHatch	1	0%	-	11:50	28	28
AntiFact_Overpool11Hatch	1	0%	-	13:18	93	93
AntiZeal_12Hatch	1	0%	-	7:48	26	26
Over10Hatch1Sunk	1	0%	-	15:10	47	47
Over10Hatch2Sunk	1	0%	-	14:38	27	27
Over10Hatch2SunkHard	1	0%	-	16:03	36	36
Over10HatchHydra	1	0%	-	10:35	38	38
Overgas+1	1	0%	-	13:18	85	85
OverhatchExpoLing	11	18%	7:34	14:51	24	83
OverpoolLurker	1	0%	-	6:15	61	61
OverpoolTurtle	6	17%	15:01	15:59	17	96
ZvP_3HatchPoolHydra	15	7%	18:18	8:27	2	70
ZvT_3HatchMuta	1	0%	-	15:27	0	0
ZvZ_12HatchMain	1	0%	-	15:23	6	6
ZvZ_12Pool	1	0%	-	6:39	31	31
33 openings	100	14%	15:24	10:42

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	2	2%	0%	11	11%	0%	0%	50%
Heavy rush	97	97%	14%	63	63%	13%	63%	22%
Safe expand	1	1%	0%	3	3%	67%	0%	0%
Turtle		-	-	1	1%	0%	0%	0%
Unknown		-	-	22	22%	18%	0%	0%

timing	#	median	early	late
my combat unit	100	3:03	1:47	4:38
my gas	93	2:57	1:33	7:14
enemy scout	100	2:10	1:15	5:03
enemy combat unit	100	2:40	2:18	5:47
enemy gas	94	6:05	3:16	9:12
enemy air unit	91	6:07	3:17	11:33
enemy cloaked unit	57	9:26	6:06	14:59
game duration	100	11:45	6:04	21:43

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	10	2:30	1:51	3:33	10%	5:42
gas steal success	3	2:10	1:55	2:11	0%	4:22
none or failed	97	-	-	-	14%	6:07
gas steal killed	3	2:50	2:48	2:51

In the 150 tournament games, Steamhammer scored 25 wins versus BananaBrain. Of those, 15 were due to BananaBrain suffering a frame timeout. Ouch. The game scores say that BananaBrain was ahead in 11 of the 15 games when it timed out. So the win percentages and times need to be interpreted carefully. The wins overall were longer games than the losses, possibly because BananaBrain was more likely to time out in a longer game with a larger game state to model and more units to control.

11HatchTurtleLurker scored over 50% in 15 games! Is it particularly good at prompting BananaBrain to time out? If I’d known about it ahead of time, I could have added it to my preparation and perhaps scored higher.

The build 2x10HatchSlow is shown as tried 7 times with no wins. I know from watching games that the opening scored wins earlier in the tournament, before the final 100 games; that is why it was tried so often later on. The build is very similar to Broken Horn’s 10 hatch-9 hatch-pool, but (I think) slightly more efficient. Apparently BananaBrain learned to avoid lines that lose to the mass of slow zerglings.

Successfully stealing its gas caused BananaBrain to take its gas sooner. I haven’t seen that before. In any case, it was only 3 games; Steamhammer found the gas steal unprofitable.

#3 dragon

opening	games	wins	wins	losses	first	last
2HatchLurkerAllIn	1	0%	-	30:51	41	41
3HatchHydra	1	0%	-	10:16	33	33
5HatchPool	24	71%	13:23	28:23	6	94
7-7HydraLingRush	1	0%	-	16:57	45	45
9PoolFastLurker	9	33%	9:16	27:47	1	92
9PoolHatchSpeed	4	25%	3:31	16:32	17	58
9PoolSunkSpeed	2	0%	-	26:54	14	38
AntiFact_13Pool	17	65%	18:05	16:34	50	96
AntiZeal_12Hatch	1	0%	-	38:54	12	12
ZvP_4HatchPoolHydra	8	62%	5:57	15:58	65	99
ZvT_3HatchMutaExpo	32	78%	15:50	24:55	0	98
11 openings	100	62%	15:36	25:13

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	41	41%	56%	17	17%	47%	24%	41%
Naked expand		-	-	4	4%	50%	0%	0%
Safe expand	23	23%	74%	15	15%	80%	9%	48%
Turtle		-	-	1	1%	0%	0%	0%
Unknown		-	-	31	31%	68%	0%	0%
Worker rush	36	36%	61%	32	32%	59%	75%	8%

timing	#	median	early	late
my combat unit	98	3:12	2:13	7:53
my gas	80	3:49	1:34	12:10
enemy scout	98	2:11	0:53	12:07
enemy combat unit	82	2:48	2:21	8:38
enemy gas	81	6:04	2:44	10:40
enemy air unit	74	9:39	4:31	17:18
enemy cloaked unit	62	10:51	5:50	19:39
game duration	100	16:28	3:31	60:00

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	17	2:09	1:56	2:58	53%	6:31
gas steal success	9	2:16	2:07	2:30	44%	7:04
none or failed	91	-	-	-	64%	6:03
gas steal killed	9	4:05	3:08	5:05

The most successful openings were 5HatchPool (5 hatcheries before pool, a supremely greedy build to exploit bots that never attack early) and ZvT_3HatchMutaExpo, the two openings I selected as preparation. For bots carried over from the previous year, good preparation is easier.

Dragon has a chaotic play style. Steamhammer’s wildest game of the tournament may be Steamhammer-Dragon on Longinus (replay file). Dragon played a V strategy: Vultures, valkyries, and vessels.

#5 mcrave

opening	games	wins	wins	losses	first	last
11Gas10PoolMuta	1	0%	-	10:07	89	89
12Gas11PoolLurker	1	0%	-	9:05	40	40
2HatchHydra	1	0%	-	6:32	45	45
2HatchMutaPure	1	0%	-	4:02	61	61
4PoolHard	3	0%	-	8:52	14	43
9Pool8GasLurker	1	0%	-	11:25	88	88
9PoolHatchSpeedAllIn	16	62%	6:03	10:25	0	96
9PoolLair	1	0%	-	4:58	68	68
Over10Hatch11Pool	18	44%	10:45	7:54	2	81
OverhatchLateGas	1	0%	-	16:08	53	53
ZvZ_12HatchExpo	1	0%	-	8:18	23	23
ZvZ_12HatchMain	10	30%	11:05	8:26	7	90
ZvZ_OverpoolTurtle	45	78%	9:27	11:10	4	99
13 openings	100	56%	9:19	9:46

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	23	23%	48%	5	5%	60%	4%	74%
Turtle	77	77%	58%	11	11%	0%	8%	87%
Unknown		-	-	84	84%	63%	0%	0%

timing	#	median	early	late
my combat unit	99	2:26	1:49	3:19
my gas	94	2:09	1:43	5:02
enemy scout	99	2:57	0:41	5:25
enemy combat unit	100	2:32	1:49	4:26
enemy gas	98	3:47	2:52	6:12
enemy air unit	94	5:05	4:01	7:07
enemy cloaked unit	0	-	-	-
game duration	100	9:27	4:02	24:09

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	7	2:13	1:58	2:55	57%	2:58
gas steal success	0	-	-	-	-	-
none or failed	100	-	-	-	56%	3:47
gas steal killed	0	-	-	-

Again two of my prepared builds, 9PoolHatchSpeedAllIn and ZvZ_OverpoolTurtle, were the top choices. Both are tough for most zerg bots to handle. My other prepared build, ZvZ_Overgas9Pool, does not appear in these 100 games. Apparently it flopped and was abandoned early. Rushy builds ended up winning faster than they lost, and more macro builds were the reverse, as you might expect. The timing table shows that McRave went spire nearly every game (overlords do not count as “air units” there), and not slowly. That’s normal for ZvZ, of course, but it shows that McRave did not favor builds to overrun the opponent with zerglings.

#6 willyt

opening	games	wins	wins	losses	first	last
12Hatch_4HatchLing	1	0%	-	14:08	73	73
2.5HatchMutaExpo	4	50%	19:42	14:04	76	94
9HatchExpo9Pool9Gas	1	0%	-	17:39	56	56
9PoolHatchSpeedAllIn	13	38%	4:53	8:51	0	97
9PoolHatchSpeedSpire2	1	0%	-	9:33	70	70
9PoolLair	1	0%	-	16:43	30	30
9PoolLurker	15	80%	12:15	20:15	3	98
9PoolSpeed	13	46%	6:26	12:49	1	90
9PoolSpeedAllIn	12	67%	5:50	9:32	12	99
ZvT_13Pool	25	64%	19:35	20:09	4	81
ZvT_2HatchMuta	1	0%	-	22:10	29	29
ZvT_3HatchMuta	13	54%	17:35	18:49	22	78
12 openings	100	56%	13:56	14:57

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	1	1%	100%	1	1%	100%	0%	0%
Fast rush	8	8%	38%	3	3%	67%	0%	75%
Heavy rush	5	5%	80%	2	2%	0%	0%	60%
Naked expand	47	47%	57%	16	16%	100%	17%	64%
Safe expand	39	39%	54%	13	13%	46%	10%	67%
Unknown		-	-	65	65%	48%	0%	0%

timing	#	median	early	late
my combat unit	100	2:18	2:13	5:58
my gas	100	2:14	1:45	6:22
enemy scout	100	2:14	1:42	7:17
enemy combat unit	100	2:58	2:06	6:14
enemy gas	85	5:16	3:16	7:59
enemy air unit	44	14:59	8:39	23:14
enemy cloaked unit	31	15:22	7:19	20:23
game duration	100	14:30	4:41	60:00

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	10	2:06	1:56	3:09	10%	5:34
gas steal success	8	2:14	2:06	3:15	0%	5:42
none or failed	92	-	-	-	61%	5:15
gas steal killed	8	3:59	3:03	4:20

WillyT has become much stronger over the past year. It is better at handling Steamhammer’s lurker builds—except for the especially early 9 pool lurker build, which apparently catches it unready. Steamhammer’s improvements in lurker play were important to keep up with progress. I think Steamhammer’s diverse mix of openings was essential to counter WillyT, which has its own diverse mix and will figure out how to counter anything that is too predictable.

Steamhammer’s closest game of the tournament was Steamhammer-WillyT on Empire of the Sun (replay file). Steamhammer decisively stopped WillyT from taking the nearby north island base, but allowed it to hold the distant south island despite scouting it the instant it started. Notice WillyT’s interesting but somewhat uncoordinated dropship play throughout the game.

#7 microwave

opening	games	wins	wins	losses	first	last
5HatchPool	1	0%	-	5:23	18	18
8Hatch7Pool	5	80%	10:15	9:32	20	59
973HydraBust	5	40%	13:28	5:16	54	91
9HatchMain9Pool9Gas	1	0%	-	4:32	56	56
9PoolHatchBurrow	1	0%	-	5:26	46	46
9PoolHatchSpeedAllIn	20	80%	6:48	12:00	0	99
9PoolHatchSpeedSpire	24	83%	11:05	6:06	4	93
9PoolSpeedSpire	1	0%	-	11:09	81	81
ZvZ_12HatchMain	20	85%	11:20	17:50	65	96
ZvZ_12PoolMain	11	73%	9:35	5:06	8	97
ZvZ_Overpool9Gas	11	64%	13:27	17:08	2	42
11 openings	100	74%	11:14	8:31

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	17	17%	71%	14	14%	93%	6%	59%
Heavy rush	49	49%	71%	25	25%	76%	27%	31%
Naked expand	26	26%	81%	16	16%	62%	12%	35%
Turtle	8	8%	75%	9	9%	89%	12%	25%
Unknown		-	-	36	36%	67%	0%	0%

timing	#	median	early	late
my combat unit	98	2:25	2:13	3:15
my gas	97	2:31	1:47	7:09
enemy scout	98	2:30	1:22	4:43
enemy combat unit	100	2:32	1:05	3:31
enemy gas	66	4:35	2:25	17:32
enemy air unit	52	7:51	3:43	17:33
enemy cloaked unit	0	-	-	-
game duration	100	10:57	4:27	25:22

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	8	2:11	1:53	2:41	75%	4:33
gas steal success	2	2:29	2:14	2:43	100%	4:34
none or failed	98	-	-	-	73%	4:35
gas steal killed	2	2:40	2:21	2:58

Microwave had too many weaknesses. Of the 4 openings with 80% plus win rates, 3 were from preparation and one was Steamhammer’s discovery during the tournament. It’s interesting that the 12 hatch build ZvZ_12HatchMain was faster to win than to lose. I think that means it won with zerglings from its extra larvas.

The plan table shows that Microwave followed its own broad range of plans. In the timing table, see the wide and matching variation in Microwave’s gas timing and air unit timing. Did Microwave never get zergling speed in long games?

#8 daqin

opening	games	wins	wins	losses	first	last
11Gas10PoolMuta	1	0%	-	11:29	24	24
2HatchLingAllInSpire	8	12%	9:37	12:16	52	92
2HatchLurkerPure	1	0%	-	15:31	45	45
2x10HatchSlow	1	0%	-	9:52	55	55
3HatchHydra	2	0%	-	17:14	72	93
3HatchHydraBust	5	20%	18:07	12:09	22	91
3HatchHydraExpo	1	0%	-	11:04	6	6
3HatchLing	12	33%	7:06	11:29	3	73
3HatchLingExpo	12	17%	34:02	11:35	36	97
4HatchBeforeGas	2	0%	-	12:31	2	21
4HatchBeforeLair	2	0%	-	11:47	67	99
5HatchBeforeGas	1	0%	-	11:11	68	68
5PoolHard2Player	1	0%	-	9:41	4	4
9HatchExpo9Pool9Gas	10	30%	8:12	12:25	75	95
9Pool9Hatch	1	0%	-	12:24	32	32
AntiZeal_12Hatch	1	0%	-	11:40	41	41
Over10Hatch11Pool	1	0%	-	14:04	31	31
Over10Hatch2Sunk	1	0%	-	15:21	70	70
Over10PoolHydra	1	0%	-	9:43	74	74
OverhatchExpoLing	30	40%	6:34	10:26	0	98
OverhatchLateGas	1	0%	-	12:30	96	96
OverhatchMuta	1	0%	-	14:16	29	29
ZvP_3BaseSpire+Den	2	0%	-	14:06	5	25
ZvP_3HatchPoolHydra	2	0%	-	13:17	50	78
24 openings	100	23%	7:02	11:40

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	4	4%	50%	10	10%	0%	25%	0%
Naked expand	3	3%	0%	11	11%	100%	0%	0%
Safe expand	58	58%	24%	41	41%	10%	40%	3%
Turtle	35	35%	20%	34	34%	24%	40%	6%
Unknown		-	-	4	4%	0%	0%	0%

timing	#	median	early	late
my combat unit	100	3:07	1:53	3:54
my gas	100	2:47	1:47	6:26
enemy scout	100	1:31	1:10	9:29
enemy combat unit	100	4:33	4:06	6:41
enemy gas	94	5:28	5:06	6:52
enemy air unit	12	16:50	9:10	20:14
enemy cloaked unit	24	12:39	6:35	17:59
game duration	100	11:18	5:42	60:00

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	30	2:08	1:55	2:36	23%	5:35
gas steal success	24	2:17	2:06	2:25	25%	5:36
none or failed	76	-	-	-	22%	5:23
gas steal killed	24	2:47	2:35	3:06

After this upset, I think I’ll take DaQin as a test opponent and finally figure out the skills to defeat it. DaQin is a Locutus fork, so beating it probably means doing better against other protoss bots.

The timing table shows that DaQin was remarkably late with air units. That includes both corsairs and observers—DaQin was late with both of them. In fact, I don’t remember whether it makes corsairs at all. Mutalisks might be a good choice to win.

#9 freshmeat

opening	games	wins	wins	losses	first	last
11Gas10PoolMuta	12	67%	7:06	5:50	2	94
8PoolHard	6	33%	8:20	9:09	14	45
9Hatch8Pool	1	0%	-	6:48	92	92
9PoolHatchSpeedAllInB	37	84%	5:56	5:55	5	99
9PoolSunkHatch	8	62%	5:06	9:46	4	74
9PoolSunkSpeed	8	25%	8:01	7:04	0	52
Overpool14Hatch	1	0%	-	6:19	86	86
OverpoolSunk	17	71%	9:18	9:45	3	97
ZvT_13Pool	3	33%	7:33	5:43	90	93
ZvZ_12PoolMain	7	43%	7:07	5:35	11	87
10 openings	100	64%	6:21	6:53

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	13	13%	38%	14	14%	71%	31%	31%
Heavy rush	57	57%	65%	30	30%	67%	25%	32%
Naked expand		-	-	2	2%	100%	0%	0%
Turtle	30	30%	73%	25	25%	52%	10%	20%
Unknown		-	-	28	28%	64%	0%	0%
Worker rush		-	-	1	1%	100%	0%	0%

timing	#	median	early	late
my combat unit	100	2:17	2:09	3:27
my gas	98	2:53	1:46	7:53
enemy scout	79	2:31	1:26	7:29
enemy combat unit	100	2:34	1:05	5:17
enemy gas	39	4:01	2:55	9:29
enemy air unit	29	4:43	4:01	5:51
enemy cloaked unit	0	-	-	-
game duration	100	6:30	4:17	16:24

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	11	2:15	1:55	2:57	45%	3:31
gas steal success	6	2:18	2:01	3:01	33%	-
none or failed	94	-	-	-	66%	4:01
gas steal killed	6	3:04	2:31	4:06

When I was preparing opponent-specific data, Steamhammer had an overwhelming score against FreshMeat on BASIL. This result is good but not overwhelming; FreshMeat improved a lot in a short time. I had recognized that FreshMeat had made great strides, but there was not enough recent data to show what was working in the most recent games. So I made no preparation at all. These tables show an example of how Steamhammer figures out an opponent from scratch. I think it did OK.

#10 ualbertabot

opening	games	wins	wins	losses	first	last
Over10HatchSlowLings	1	0%	-	8:16	99	99
OverhatchExpoMuta	17	59%	5:21	6:26	21	95
OverpoolTurtle	82	94%	6:17	11:51	0	98
3 openings	100	87%	6:00	8:16

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	10	10%	100%	13	13%	100%	10%	0%
Fast rush	33	33%	82%	27	27%	85%	36%	18%
Heavy rush	49	49%	90%	31	31%	81%	31%	20%
Naked expand	8	8%	75%	12	12%	100%	0%	12%
Unknown		-	-	17	17%	82%	0%	0%

timing	#	median	early	late
my combat unit	100	2:26	2:15	3:13
my gas	99	2:58	2:39	6:33
enemy scout	88	2:08	1:21	9:58
enemy combat unit	89	2:33	1:47	4:30
enemy gas	82	3:44	2:37	14:24
enemy air unit	14	14:20	11:50	15:59
enemy cloaked unit	10	14:21	2:37	16:46
game duration	100	6:31	4:35	21:33

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	8	2:32	1:48	2:54	88%	7:00
gas steal success	4	2:29	2:10	2:44	75%	13:15
none or failed	96	-	-	-	88%	3:37
gas steal killed	4	3:02	2:51	3:06

Comparing this year to last year, Steamhammer actually did a little worse against UAlbertaBot. The skills I improved over the last year didn’t include skills to defeat UAlbertaBot’s pressure builds, or to adapt better to its random race.

overall

	total		ZvT		ZvP		ZvZ		ZvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
10Hatch	1	0%			1	0%
10HatchHydra	1	0%			1	0%
11Gas10PoolMuta	15	53%			2	0%	13	62%
11Gas10PoolMutaB	1	0%			1	0%
11HatchTurtleHydra	3	0%			3	0%
11HatchTurtleLurker	15	53%			15	53%
11HatchTurtleMuta	3	0%			3	0%
11Pool	2	0%			2	0%
12-11HatchStem	1	0%			1	0%
12Gas11PoolLurker	1	0%					1	0%
12Gas11PoolMuta	1	0%			1	0%
12Hatch_4HatchLing	2	0%	1	0%	1	0%
2.5HatchMutaExpo	4	50%	4	50%
2HatchHydra	1	0%					1	0%
2HatchLingAllInSpire	8	12%			8	12%
2HatchLurkerAllIn	2	0%	1	0%	1	0%
2HatchLurkerPure	1	0%			1	0%
2HatchMutaPure	1	0%					1	0%
2x10Hatch	1	0%			1	0%
2x10HatchBurrow	1	0%			1	0%
2x10HatchSlow	8	0%			8	0%
3HatchHydra	4	0%	1	0%	3	0%
3HatchHydraBust	5	20%			5	20%
3HatchHydraExpo	5	0%			5	0%
3HatchLateHydras	1	0%			1	0%
3HatchLing	13	31%			13	31%
3HatchLingBurrow	2	0%			2	0%
3HatchLingExpo	14	14%			14	14%
4HatchBeforeGas	2	0%			2	0%
4HatchBeforeLair	3	0%			3	0%
4PoolHard	3	0%					3	0%
4PoolSoft	2	0%			2	0%
5HatchBeforeGas	1	0%			1	0%
5HatchPool	25	68%	24	71%			1	0%
5PoolHard2Player	1	0%			1	0%
6Pool	2	0%			2	0%
6PoolHide	1	0%			1	0%
6PoolSpeed	6	0%			6	0%
6Scout	1	0%			1	0%
7-7HydraLingRush	1	0%	1	0%
7DroneGas	1	0%			1	0%
7Pool10Hatch	1	0%			1	0%
7Pool12Hatch	1	0%			1	0%
7Pool6GasLurker B	1	0%			1	0%
7PoolHard	1	0%			1	0%
7PoolHarder	1	0%			1	0%
7PoolMid	1	0%			1	0%
7PoolSoft	1	0%			1	0%
8Hatch7Pool	5	80%					5	80%
8Hatch7PoolBurrow	1	0%			1	0%
8Hatch7PoolBurrowB	1	0%			1	0%
8PoolHard	6	33%					6	33%
8Scout	1	0%			1	0%
973HydraBust	5	40%					5	40%
9Hatch8Pool	2	0%			1	0%	1	0%
9HatchExpo9Pool9Gas	13	23%	1	0%	12	25%
9HatchMain9Pool9Gas	1	0%					1	0%
9Pool8GasLurker	1	0%					1	0%
9Pool8Hatch	1	0%			1	0%
9Pool9Hatch	2	0%			2	0%
9PoolBurrow	9	11%			9	11%
9PoolBurrowB	1	0%			1	0%
9PoolFastLurker	9	33%	9	33%
9PoolHatchBurrow	1	0%					1	0%
9PoolHatchSpeed	4	25%	4	25%
9PoolHatchSpeed7Drone	2	0%			2	0%
9PoolHatchSpeed7DroneB	3	0%			3	0%
9PoolHatchSpeedAllIn	54	59%	13	38%	5	20%	36	72%
9PoolHatchSpeedAllInB	38	82%			1	0%	37	84%
9PoolHatchSpeedSpire	32	62%			8	0%	24	83%
9PoolHatchSpeedSpire2	2	0%	1	0%	1	0%
9PoolLair	3	0%	1	0%	1	0%	1	0%
9PoolLurker	16	75%	15	80%	1	0%
9PoolSpeed	16	38%	13	46%	3	0%
9PoolSpeedAllIn	13	62%	12	67%	1	0%
9PoolSpeedSpire	1	0%					1	0%
9PoolSpire	1	0%			1	0%
9PoolSunkHatch	9	56%			1	0%	8	62%
9PoolSunkSpeed	11	18%	2	0%	1	0%	8	25%
AntiFact_13Pool	18	61%	17	65%	1	0%
AntiFact_2Hatch	3	0%			3	0%
AntiFact_Overpool11Hatch	1	0%			1	0%
AntiFactoryHydra	1	0%			1	0%
AntiZeal_12Hatch	6	0%	1	0%	5	0%
HiveRush	1	0%			1	0%
Over10Hatch	2	0%			2	0%
Over10Hatch11Pool	19	42%			1	0%	18	44%
Over10Hatch1Sunk	2	0%			2	0%
Over10Hatch2Sunk	5	0%			5	0%
Over10Hatch2SunkHard	2	0%			2	0%
Over10HatchBust	2	0%			2	0%
Over10HatchHydra	1	0%			1	0%
Over10HatchSlowLings	3	0%			2	0%			1	0%
Over10PoolHydra	1	0%			1	0%
Overgas+1	1	0%			1	0%
OverhatchExpoLing	42	33%			42	33%
OverhatchExpoMuta	17	59%							17	59%
OverhatchLateGas	2	0%			1	0%	1	0%
OverhatchLing	1	0%			1	0%
OverhatchMuta	1	0%			1	0%
Overpool14Hatch	2	0%			1	0%	1	0%
Overpool2HatchLurker	2	0%			2	0%
OverpoolLurker	2	0%			2	0%
OverpoolSunk	17	71%					17	71%
OverpoolTurtle	88	89%			6	17%			82	94%
OverpoolTurtle 0	1	0%			1	0%
Overpool_3HatchLing	1	0%			1	0%
PurpleSwarmBuild	1	0%			1	0%
ZvP_2HatchMuta	2	0%			2	0%
ZvP_3BaseSpire+Den	2	0%			2	0%
ZvP_3HatchPoolHydra	17	6%			17	6%
ZvP_4HatchPoolHydra	8	62%	8	62%
ZvP_Overpool3Hatch	1	0%			1	0%
ZvT_13Pool	30	57%	25	64%	2	0%	3	33%
ZvT_2HatchMuta	1	0%	1	0%
ZvT_3HatchMuta	14	50%	13	54%	1	0%
ZvT_3HatchMutaExpo	32	78%	32	78%
ZvT_7Pool	1	0%			1	0%
ZvZ_12HatchExpo	1	0%					1	0%
ZvZ_12HatchMain	31	65%			1	0%	30	67%
ZvZ_12Pool	2	0%			2	0%
ZvZ_12PoolMain	18	61%					18	61%
ZvZ_Overpool11Gas	1	0%			1	0%
ZvZ_Overpool9Gas	12	58%			1	0%	11	64%
ZvZ_OverpoolTurtle	45	78%					45	78%
total	900	48%	200	59%	300	12%	300	65%	100	87%
openings played	125		23		101		30		3

AIIDE 2021 - BananaBrain versus Dragon

BananaBrain and Dragon both recorded their own opening builds for all 157 games played, so I can align their learning files and see how their strategies matched up against each other. BananaBrain also recorded its representation of what the opponent played, so I can compare its idea of Dragon’s build with Dragon’s own idea. I first did this last year. Dragon is carried over from last year unchanged, while BananaBrain is much stronger now.

The win rates and coloring are from the point of view of BananaBrain. Blue is good for BananaBrain and red is good for Dragon.

bananabrain strategies versus dragon strategies

	overall	1rax fe	2rax bio	2rax mech	bio	dirty worker rush	mass vulture	siege expand
overall	117/157 75%	8/8 100%	19/30 63%	8/8 100%	12/13 92%	8/8 100%	27/36 75%	35/54 65%
PvT_10/12gate	34/48 71%	1/1 100%	9/17 53%	1/1 100%	2/2 100%	2/2 100%	3/5 60%	16/20 80%
PvT_1gatedtexpo	0/1 0%	-	0/1 0%	-	-	-	-	-
PvT_28nexus	3/6 50%	-	1/2 50%	-	-	-	1/1 100%	1/3 33%
PvT_2gaterngexpo	2/4 50%	-	0/1 0%	-	-	-	1/1 100%	1/2 50%
PvT_32nexus	0/1 0%	-	-	-	-	-	-	0/1 0%
PvT_9/9gate	78/96 81%	7/7 100%	9/9 100%	7/7 100%	10/11 91%	6/6 100%	22/29 76%	17/27 63%
PvT_9/9proxygate	0/1 0%	-	-	-	-	-	-	0/1 0%

dragon as seen by bananabrain

dragon played	#	bananabrain recognized
1rax fe	8	7 T_unknown \| 1 T_fastexpand
2rax bio	30	30 T_unknown
2rax mech	8	8 T_unknown
bio	13	13 T_unknown
dirty worker rush	8	8 T_unknown
mass vulture	36	21 T_1fac \| 14 T_unknown \| 1 T_2fac
siege expand	54	38 T_1fac \| 16 T_unknown

Last year this table showed that BananaBrain was weak at recognizing Dragon’s builds, with a lot of unknowns. There are more recognized builds this year, but BananaBrain plays differently so I’m not sure whether BananaBrain has improved at recognition. What is clear is that everything is blue. Recognizing some builds does not seem to have helped BananaBrain; it did well no matter what.

AIIDE 2021 - what Dragon learned

Dragon records for each game only its own build and win/loss, so the information is sparse. It has a total of 7 builds. Dragon is a carryover bot, and I analyzed its game records from AIIDE 2020 last year. Dragon considers that “the opening” is a very brief phase of the game: It quickly adapts to what it sees of the opponent’s play, and the opening build fades out of view. Last year I found that, against many opponents, the choice of opening build made little difference; the game was decided later.

#1 stardust

opening	games	wins	first	last
1rax fe	24	0%	5	156
2rax bio	35	6%	1	153
2rax mech	18	0%	2	146
bio	20	5%	3	144
dirty worker rush	19	0%	0	133
mass vulture	20	0%	4	152
siege expand	21	0%	6	150
7 openings	157	2%

It’s interesting that the only openings to make a dent were “2rax bio” and “bio”. Was Stardust surprised by marines? If Stardust made zealots to get units faster, that may have backfired.

#2 bananabrain

opening	games	wins	first	last
1rax fe	8	0%	6	82
2rax bio	30	37%	1	156
2rax mech	8	0%	7	109
bio	13	8%	8	140
dirty worker rush	8	0%	5	148
mass vulture	36	25%	2	151
siege expand	54	35%	0	145
7 openings	157	25%

Again, the marines were a relatively successful choice against protoss. It’s a surprise. #8 DaQin below is different. Yesterday we saw that BananaBrain liked zealot openings against Dragon, and it’s true that marines with good micro can hold their own against zealots. Maybe BananaBrain liked zealots because they upset Dragon’s tech builds, and Dragon found that marines answered best, so that the two settled into this equilibrium with neither bot able to 100% counter the other. It’s a nice story, at least.

#4 steamhammer

opening	games	wins	first	last
1rax fe	53	45%	8	124
2rax bio	7	14%	0	153
2rax mech	12	25%	4	121
bio	17	29%	6	135
dirty worker rush	47	40%	2	156
mass vulture	15	27%	10	117
siege expand	6	0%	5	122
7 openings	157	36%

Switching between opposite builds like fast expand (“1rax fe”) and worker rush (“dirty worker rush”) is not a bad plan for defeating Steamhammer. I don’t think Dragon did it on purpose, though. Most openings scored about equal.

#5 mcrave

opening	games	wins	first	last
1rax fe	20	80%	34	153
2rax bio	40	65%	0	109
2rax mech	17	65%	23	90
bio	24	62%	64	100
dirty worker rush	2	0%	20	98
mass vulture	6	33%	63	137
siege expand	46	72%	11	154
7 openings	155	66%

Again, most were about equal, with only a couple of exceptions. “1rax fe” was also best against last year’s McRave, even though it played rather differently.

#6 willyt

opening	games	wins	first	last
2rax bio	1	0%	53	53
mass vulture	154	98%	0	154
2 openings	155	97%

Last year Dragon chose “2rax mech” as its build to trample on WillyT (winning fewer, 94%, even though this year’s WillyT is substantially stronger). I think it found something that worked and felt no need to experiment any further.

#7 microwave

opening	games	wins	first	last
1rax fe	24	67%	1	120
2rax bio	28	61%	31	152
2rax mech	66	74%	0	149
bio	26	62%	46	147
dirty worker rush	2	0%	8	124
mass vulture	5	40%	12	156
siege expand	6	33%	43	153
7 openings	157	65%

#8 daqin

opening	games	wins	first	last
1rax fe	98	56%	8	156
2rax bio	6	17%	14	104
2rax mech	11	36%	1	105
bio	13	31%	7	136
dirty worker rush	5	0%	5	153
mass vulture	20	50%	9	103
siege expand	4	0%	0	132
7 openings	157	47%

Best was the fast expansion. (“1rax fe” is faster than “siege expand”.) That makes sense against DaQin’s style of play.

#9 freshmeat

opening	games	wins	first	last
1rax fe	12	25%	4	128
2rax bio	12	33%	18	143
2rax mech	25	36%	0	141
bio	10	30%	9	139
dirty worker rush	34	35%	7	155
mass vulture	49	53%	10	156
siege expand	15	27%	15	142
7 openings	157	39%

Mostly about equal again. At some point I’ll look at the games and see how FreshMeat upset Dragon.

#10 ualbertabot

opening	games	wins	first	last
1rax fe	40	92%	116	155
2rax bio	6	67%	54	73
2rax mech	2	50%	114	115
bio	13	77%	36	49
dirty worker rush	17	76%	50	74
mass vulture	76	84%	0	113
siege expand	2	50%	85	110
7 openings	156	83%

CoG 2021 data released

Source code, learning files, and replays from CoG 2021 were released today. Though there is still a note above them saying “(You cannnot access replay files yet)”.

AIIDE 2021 - what BananaBrain learned

Here’s my summary of BananaBrain’s learning files. BananaBrain records both its own strategy and the recognized enemy strategy for every game.

#1 stardust

opening	games	wins	first	last
PvP_10/12gate	5	0%	9	121
PvP_12nexus	5	0%	10	119
PvP_2gatedt	5	0%	3	120
PvP_2gatedtexpo	16	19%	0	125
PvP_2gatereaver	5	0%	1	118
PvP_3gaterobo	5	0%	13	123
PvP_3gatespeedzeal	5	0%	5	116
PvP_4gategoon	24	25%	12	126
PvP_9/9gate	5	0%	6	122
PvP_9/9proxygate	38	29%	8	156
PvP_nzcore	13	15%	4	149
PvP_zcore	5	0%	7	117
PvP_zcorez	9	11%	11	144
PvP_zzcore	17	24%	2	127
14 openings	157	17%

enemy	games	wins
P_1gatecore	6	50%
P_2gate	26	19%
P_2gatefast	13	31%
P_4gategoon	107	14%
P_cannonturtle	1	0%
P_unknown	4	0%
6 openings	157	17%

The most successful: Double proxy gates. Stardust plays the same every game, except for reactions to its opponent, so it’s interesting that BananaBrain diagnosed so many different openings. I suspect that they were all, or nearly all, 4 gate goon, and BananaBrain was not always able to scout long enough to see it. I think the variety is what you get when BananaBrain sees only part of the build.

#3 dragon

opening	games	wins	first	last
PvT_10/12gate	48	71%	0	156
PvT_1gatedtexpo	1	0%	16	16
PvT_28nexus	6	50%	13	119
PvT_2gaterngexpo	4	50%	10	91
PvT_32nexus	1	0%	89	89
PvT_9/9gate	96	81%	2	118
PvT_9/9proxygate	1	0%	92	92
7 openings	157	75%

enemy	games	wins
T_1fac	59	66%
T_2fac	1	100%
T_fastexpand	1	100%
T_unknown	96	79%
4 openings	157	75%

The best builds were zealot builds. BananaBrain seems to be especially successful with early zealot pressure.

#4 steamhammer

opening	games	wins	first	last
PvZ_10/12gate	3	67%	5	7
PvZ_1basespeedzeal	21	86%	37	157
PvZ_2basespeedzeal	9	78%	21	149
PvZ_4gate2archon	1	0%	31	31
PvZ_5gategoon	2	50%	29	30
PvZ_9/9gate	92	88%	61	156
PvZ_9/9proxygate	1	0%	57	57
PvZ_bisu	5	80%	32	36
PvZ_neobisu	12	83%	8	19
PvZ_sairdt	3	67%	146	148
PvZ_sairgoon	1	0%	20	20
PvZ_sairreaver	3	67%	58	60
PvZ_stove	5	80%	0	4
13 openings	158	83%

enemy	games	wins
Z_10hatch	32	84%
Z_12hatch	57	75%
Z_12hatchmain	1	100%
Z_12pool	2	100%
Z_4/5pool	1	100%
Z_9pool	23	96%
Z_9poolspeed	8	88%
Z_overpool	19	84%
Z_unknown	15	80%
9 openings	158	83%

Again, zealot builds. Steamhammer tried a wide variety of counters, of which 12 hatch worked best. BananaBrain records only the earliest steps of zerg openings, so what BananaBrain calls Z_12hatch could have a range of followups.

#5 mcrave

opening	games	wins	first	last
PvZ_10/12gate	54	85%	17	119
PvZ_1basespeedzeal	3	67%	58	60
PvZ_2basespeedzeal	5	80%	1	5
PvZ_4gate2archon	1	0%	61	61
PvZ_5gategoon	1	0%	66	66
PvZ_9/9gate	1	0%	6	6
PvZ_9/9proxygate	8	75%	24	100
PvZ_bisu	5	80%	53	57
PvZ_neobisu	4	75%	62	65
PvZ_sairdt	3	67%	14	16
PvZ_sairgoon	12	83%	7	105
PvZ_sairreaver	1	0%	0	0
PvZ_stove	59	88%	31	156
13 openings	157	82%

enemy	games	wins
Z_12hatch	84	81%
Z_12pool	2	0%
Z_9pool	23	78%
Z_overpool	45	89%
Z_unknown	3	100%
5 openings	157	82%

Most things worked against McRave, but especially tech openings. The earliest steps of McRave’s openings are stereotyped, so BananaBrain recognized few choices.

#6 willyt

opening	games	wins	first	last
PvT_10/12gate	44	93%	7	50
PvT_12nexus	6	83%	0	5
PvT_2gatedt	1	0%	6	6
PvT_32nexus	24	88%	51	74
PvT_9/9proxygate	77	99%	80	156
PvT_dtdrop	2	50%	78	79
PvT_stove	3	67%	75	77
7 openings	157	93%

enemy	games	wins
T_1fac	12	100%
T_2rax	55	95%
T_fastexpand	52	88%
T_unknown	38	95%
4 openings	157	93%

The proxy gates won 76 times out of 77. Ouch.

#7 microwave

opening	games	wins	first	last
PvZ_10/12gate	86	97%	31	156
PvZ_1basespeedzeal	1	0%	23	23
PvZ_2basespeedzeal	2	50%	19	20
PvZ_4gate2archon	2	50%	24	25
PvZ_5gategoon	30	83%	39	79
PvZ_9/9gate	15	80%	9	82
PvZ_9/9proxygate	2	50%	63	64
PvZ_bisu	2	50%	7	8
PvZ_neobisu	2	50%	21	22
PvZ_sairdt	5	80%	0	4
PvZ_sairgoon	5	80%	26	30
PvZ_sairreaver	2	50%	5	6
PvZ_stove	3	67%	16	18
13 openings	157	87%

enemy	games	wins
Z_10hatch	8	100%
Z_12hatch	31	97%
Z_12pool	13	85%
Z_4/5pool	13	100%
Z_9pool	58	79%
Z_9poolspeed	6	100%
Z_overpool	20	75%
Z_unknown	8	88%
8 openings	157	87%

Zealots were best again, though dragoons were good too. I wonder why the economic 10/12 gates were more successful than the fast 9/9 gates? It suggests that Microwave may overdefend, fearing fast zealots, and not have a strong enough economy to hold off efficient zealots instead. Or the followup after the zealots; BananaBrain likes to expand quickly.

#8 daqin

opening	games	wins	first	last
PvP_2gatedt	10	80%	0	37
PvP_2gatedtexpo	1	0%	6	6
PvP_2gatereaver	142	92%	7	156
PvP_9/9gate	3	67%	31	33
PvP_zcore	1	0%	26	26
5 openings	157	90%

enemy	games	wins
P_1gatecore	69	88%
P_4gategoon	68	91%
P_ffe	1	100%
P_unknown	19	89%
4 openings	157	90%

DaQin was apparently not ready for reavers. Otherwise it did not badly against a powerful opponent.

#9 freshmeat

opening	games	wins	first	last
PvZ_4gate2archon	8	88%	26	33
PvZ_9/9gate	122	100%	35	156
PvZ_neobisu	14	86%	0	13
PvZ_sairgoon	1	0%	34	34
PvZ_stove	12	83%	14	25
5 openings	157	96%

enemy	games	wins
Z_12hatch	27	85%
Z_12hatchmain	22	91%
Z_12pool	1	100%
Z_4/5pool	27	100%
Z_9pool	11	100%
Z_overpool	3	100%
Z_unknown	66	100%
7 openings	157	96%

#10 ualbertabot

opening	games	wins	first	last
PvU_10/12gate	4	75%	0	3
PvU_9/9gate	1	0%	4	4
PvU_9/9proxygate	5	80%	10	14
PvU_nzcore	5	80%	5	9
PvU_zcore	142	97%	15	156
5 openings	157	95%

enemy	games	wins
P_1gatecore	19	100%
P_2gate	1	100%
P_2gatefast	25	84%
P_4gategoon	3	100%
P_unknown	6	100%
T_1fac	1	100%
T_2fac	22	100%
T_2rax	16	94%
T_unknown	11	100%
Z_12hatch	26	100%
Z_4/5pool	23	87%
Z_overpool	3	100%
Z_unknown	1	100%
13 openings	157	95%

AIIDE 2021 - Stardust table in minutes and seconds

It occurred to me a little late that many people would find the Stardust data table easier to understand if the frame times were converted to minutes and seconds. So here’s that version. See the previous post from today.

		firstDarkTemplarCompleted				pylonInOurMain				firstMutaliskCompleted
opponent	games	n	min	median	max	n	min	median	max	n	min	median	max
bananabrain	155	20	5:15	5:29	16:11	0	-	-	-	0	-	-	-
dragon	156	0	-	-	-	0	-	-	-	0	-	-	-
steamhammer	158	0	-	-	-	0	-	-	-	17	4:59	5:43	7:11
mcrave	157	0	-	-	-	0	-	-	-	124	6:17	7:35	11:12
willyt	157	0	-	-	-	0	-	-	-	0	-	-	-
microwave	157	0	-	-	-	0	-	-	-	17	5:07	5:55	7:54
daqin	156	126	5:13	5:29	12:36	2	1:53	1:54	1:55	0	-	-	-
freshmeat	157	0	-	-	-	0	-	-	-	1	11:40	11:40	11:40
ualbertabot	157	17	4:19	4:29	4:36	0	-	-	-	0	-	-	-

AIIDE 2021 - Stardust’s learning

I investigated how Stardust’s learning works, and what it learned. It’s unusual, so it was worth a close look.

In its learning file of game records for each opponent, Stardust records values for 3 keys for each game, firstDarkTemplarCompleted, pylonInOurMain, and firstMutaliskCompleted. If the event occurs in the game, the value is the frame time of the event; otherwise the value is 2147483647 (INT_MAX, the largest int value, in this C++ implementation). It also records whether the game was a win or a loss. It records the hash of the map, too, but that doesn’t seem to be used again.

summarizing the data

The class Opponent is responsible for providing the learned information to the rest of the bot. It summarizes the game records via two routines.

  int minValueInPreviousGames(const std::string &key, int defaultNoData, int maxCount = INT_MAX, int minCount = 0);

If there are at least minCount games, then look through the game records, most recent first, for up to maxCount games. Look up the key for each game and return its minimum value, or the default value if there are none. This amounts to finding the earliest frame at which the event happened, or the default if it did not happen in the specified number of games.

   double winLossRatio(double defaultValue, int maxCount = INT_MAX);

Look through the game records, most recent first, for up to maxCount games and return the winning ratio, or the default value if there are no games yet.

using the summarized data

Each of the 3 keys is used in exactly one place in the code. Here is where firstDarkTemplarCompleted is looked up in the PvP strategy code:

    if (Opponent::winLossRatio(0.0, 200) < 0.99)
    {
        expectedCompletionFrame = Opponent::minValueInPreviousGames("firstDarkTemplarCompleted", 7300, 15, 10);
    }

This means “If we’re rolling you absolutely flat (at least 99% wins in the last 200 games), then it doesn’t matter. Otherwise there’s some risk. In the most recent 15 games, find the earliest frame that the first enemy dark templar was (estimated to be) completed, or return frame 7300 if none.” The default frame 7300 is not the earliest a DT can emerge; they can be on the map over a thousand frames earlier. So it is not a worst-case assumption. Further code overrides the frame number if there is scouting information related to dark templar production. It attempts to build a defensive photon cannon just in time for the enemy DT’s arrival, and sometimes to get an observer.

The key pylonInOurMain is part of cannon rush defense. Stardust again checks the win ratio and again looks back 15 games with a minimum game count of 10, this time with a default of 0 if there are not enough games. It starts scouting its base 500 frames (about 21 seconds) ahead of the earliest seen enemy pylon appearing in its base, which may be never. The idea is that Stardust doesn’t waste time scouting its own base if it hasn’t seen you proxy a pylon in the last 15 games, and delays the scout if the pylon is proxied late.

The key firstMutaliskCompleted is used very similarly, to decide whether and when to defend each nexus with cannons. The goal is to get cannons in time in case mutalisks arrive without being scouted. There are simple rules to decide how many cannons at each nexus:

    // Main and natural are special cases, we only get cannons there to defend against air threats
    if (base == Map::getMyMain() || base == Map::getMyNatural())
    {
        if (enemyAirUnits > 6) return 4;
        if (enemyAirThreat) return 3;
        if (enemyDropThreat && BWAPI::Broodwar->getFrameCount() > 8000) return 1;
        return 0;
    }

    // At expansions we get cannons if the enemy is not contained or has an air threat
    if (!Strategist::isEnemyContained() || enemyAirUnits > 0) return 2;
    if (enemyAirThreat || enemyDropThreat) return 1;
    return 0;

If the firstMutaliskCompleted check says that it’s time, it sets enemyAirThreat to true and makes 3 cannons each at main and natural, and at least 1 at each other base.

the data itself

Here’s my summary of the data in Stardust’s files. The files include prepared data. I left the prepared data out; this covers only what was recorded during the tournament. The tournament was run for 157 rounds, although the official results are given after round 150. The table here is data for all 157 rounds. I don’t have a way to tell which unrecorded games were from rounds 1-150 and which were from 151-157... though I think I could guess.

n is the number of games for which a value (other than 2147483647) was recorded for the key. The values are frame numbers.

		firstDarkTemplarCompleted				pylonInOurMain				firstMutaliskCompleted
opponent	games	n	min	median	max	n	min	median	max	n	min	median	max
bananabrain	155	20	7579	7897.5	23319	0	-	-	-	0	-	-	-
dragon	156	0	-	-	-	0	-	-	-	0	-	-	-
steamhammer	158	0	-	-	-	0	-	-	-	17	7188	8241	10355
mcrave	157	0	-	-	-	0	-	-	-	124	9070	10939	16146
willyt	157	0	-	-	-	0	-	-	-	0	-	-	-
microwave	157	0	-	-	-	0	-	-	-	17	7371	8534	11397
daqin	156	126	7533	7912.5	18154	2	2721	2743.5	2766	0	-	-	-
freshmeat	157	0	-	-	-	0	-	-	-	1	16801	16801	16801
ualbertabot	157	17	6230	6477	6627	0	-	-	-	0	-	-	-

As you might expect after deep contemplation of the nature of reality, only protoss makes dark templar or proxy pylons, and only zerg makes mutalisks. Nothing interesting was recorded for the terran opponents.

Notice that UAlbertaBot sometimes makes dark templar much earlier than the no-data 7300 frame default time; the others do not. DaQin is recorded as twice placing a proxy pylon in Stardust’s main. I didn’t think it ever did that. I guess it’s a holdover from the Locutus proxy pylon play, to trick opponents into overreacting? DaQin made DTs in most games, and McRave went mutalisks in most games. FreshMeat is recorded as having made a mutalisk (or more than one) in exactly one game, which seems unusual.