machine learning - 3 | Starcraft AI blog

AIIDE 2020 - Dragon versus Ecgberht

Two posts today, to cover the newly available Ecgberht pairings. Neither post has much meat to it.

dragon strategies versus ecgberht strategies

	overall	14CC	BioMechGreedyFE	FullMech	ProxyBBS	ProxyEightRax
overall	141/150 94%	28/28 100%	27/28 96%	25/25 100%	36/44 82%	25/25 100%
1rax fe	5/6 83%	1/1 100%	1/1 100%	1/1 100%	2/3 67%	-
bio	136/144 94%	27/27 100%	26/27 96%	24/24 100%	34/41 83%	25/25 100%

I was curious about Dragon’s pattern of seemingly giving up on “1rax fe” (barracks expand) after a single loss, so I looked at the file. In fact Dragon played “bio” as the regular build the whole time, throwing in “1rax fe” occasionally for spice. The “1rax fe” loss was not the last “1rax fe” game, but the second to last.

For Ecgberht, when one build is producing nearly all the wins, probably you should play it more often than 30% of the time. You may not want to play it every game, because that makes it easy for the opponent to adapt—mixing it up is good. Maybe 50% of the time would be better, given this number of alternatives? To know for sure, I guess we’d have to test against a range of bots to see the overall effectiveness of learning.

dragon as seen by ecgberht

dragon played	#	ecgberht recognized
1rax fe	6	6 Unknown
bio	144	144 Unknown

Nothing to see here. Move along.

ecgberht as seen by dragon

Dragon does not record its idea of the opponent’s build. If it has one.

AIIDE 2020 - BananaBrain versus Ecgberht

bananabrain strategies versus ecgberht strategies

	overall	14CC	FullMech	JoyORush	MechGreedyFE	ProxyEightRax
overall	148/150 99%	31/31 100%	28/28 100%	28/28 100%	28/28 100%	33/35 94%
PvT_10/12gate	10/10 100%	3/3 100%	-	1/1 100%	3/3 100%	3/3 100%
PvT_10/15gate	10/10 100%	2/2 100%	3/3 100%	1/1 100%	1/1 100%	3/3 100%
PvT_12nexus	10/10 100%	3/3 100%	3/3 100%	2/2 100%	1/1 100%	1/1 100%
PvT_1gatedtexpo	10/10 100%	1/1 100%	3/3 100%	2/2 100%	4/4 100%	-
PvT_1gatereaver	10/10 100%	2/2 100%	3/3 100%	2/2 100%	1/1 100%	2/2 100%
PvT_28nexus	10/10 100%	3/3 100%	2/2 100%	1/1 100%	2/2 100%	2/2 100%
PvT_2gatedt	11/11 100%	2/2 100%	4/4 100%	1/1 100%	2/2 100%	2/2 100%
PvT_2gaterngexpo	10/10 100%	3/3 100%	3/3 100%	-	1/1 100%	3/3 100%
PvT_32nexus	10/10 100%	1/1 100%	1/1 100%	2/2 100%	3/3 100%	3/3 100%
PvT_9/9gate	10/10 100%	2/2 100%	-	3/3 100%	-	5/5 100%
PvT_9/9proxygate	10/10 100%	4/4 100%	3/3 100%	1/1 100%	1/1 100%	1/1 100%
PvT_bulldog	10/10 100%	1/1 100%	1/1 100%	5/5 100%	1/1 100%	2/2 100%
PvT_dtdrop	10/10 100%	1/1 100%	-	3/3 100%	3/3 100%	3/3 100%
PvT_proxydt	7/9 78%	1/1 100%	1/1 100%	3/3 100%	2/2 100%	0/2 0%
PvT_stove	10/10 100%	2/2 100%	1/1 100%	1/1 100%	3/3 100%	3/3 100%

We can see exactly how Ecgberht scored its total of 2 wins: It happened to play a fast proxy when BananaBrain played a slow proxy. For BananaBrain, maybe the lesson is to avoid risky openings versus much weaker opponents. As a general principle, I suggest saving risky builds for games where you have a high risk of losing with safe play—in that case, why not?

bananabrain as seen by ecgberht

bananabrain played	#	ecgberht recognized
PvT_10/12gate	10	7 ZealotRush \| 3 Unknown
PvT_10/15gate	10	10 Unknown
PvT_12nexus	10	9 ProtossFE \| 1 Unknown
PvT_1gatedtexpo	10	10 Unknown
PvT_1gatereaver	10	10 Unknown
PvT_28nexus	10	10 Unknown
PvT_2gatedt	11	11 Unknown
PvT_2gaterngexpo	10	10 Unknown
PvT_32nexus	10	10 Unknown
PvT_9/9gate	10	7 ZealotRush \| 3 Unknown
PvT_9/9proxygate	10	8 Unknown \| 2 CannonRush
PvT_bulldog	10	10 Unknown
PvT_dtdrop	10	10 Unknown
PvT_proxydt	9	9 Unknown
PvT_stove	10	10 Unknown

Except for a couple cases of CannonRush, the builds that Ecgberht recognized were named correctly. I imagine that it interpreted CannonRush as “something proxied.”

ecgberht as seen by bananabrain

ecgberht played	#	bananabrain recognized
14CC	31	21 T_fastexpand \| 6 T_unknown \| 4 T_2rax
FullMech	28	21 T_unknown \| 6 T_1fac \| 1 T_2fac
JoyORush	28	23 T_2fac \| 3 T_unknown \| 2 T_1fac
MechGreedyFE	28	25 T_unknown \| 3 T_2rax
ProxyEightRax	35	35 T_unknown

As we’ve seen before, BananaBrain has little skill in recognizing terran builds.

AIIDE 2020 - what Ecgberht learned

I added parsing code for Ecgberht’s JSON format learning files. I had to refactor for generality, and it added complexity, but I can use the parser for more than one purpose. Today I summarize the contents of its history files.

Ecgberht I think is a complex and interesting bot. It played up to 5 different strategies in each matchup, though the selection of the 5 varied by matchup. Sometimes it played fewer. Against most opponents Ecgberht played its strategies at roughly equal rates—except for the strategies it didn’t play at all. Ecgberht uses UCB with a high exploration rate. The strategy manager in the source lists 15 strategies (plus one more played only on the map Plasma and named PlasmaWraithHell), so it did not play everything it knows. I made a quick scan through the source for opponent-specific preparation, and did find some, but for bots in the tournament only ZZZKBot is affected (it is flagged by a zergling rush check; some bots that always zealot rush are flagged for that). I didn’t dig deep enough to find out why Ecgberht ignores so many of its available strategies.

Ecgberht tries to recognize the opponent’s strategy, but often finds itself unsure. It recorded a high rate of Unknown enemy plans. The ones it does recognize are drawn from a small set that seems to me well-chosen.

Ecgberht recorded fewer than 150 games for 5 of its 11 opponents, although it completed all games with no crashes. In total, 7 games do not appear in the game records of the history files. Maybe it has a cleanup bug that bites occasionally?

#1 stardust

opening	games	wins	first	last
14CC	31	0%	3	147
FullMech	28	0%	0	148
JoyORush	27	0%	2	143
MechGreedyFE	27	0%	4	146
ProxyEightRax	36	6%	1	141
5 openings	149	1%

enemy	games	wins
Unknown	149	1%
1 opening	149	1%

A couple wins against the top player is not bad.

#2 purplewave

opening	games	wins	first	last
14CC	35	3%	3	148
FullMech	29	0%	0	149
JoyORush	28	0%	2	146
MechGreedyFE	28	0%	4	147
ProxyEightRax	30	0%	1	142
5 openings	150	1%

enemy	games	wins
ProtossFE	7	0%
Unknown	143	1%
2 openings	150	1%

#3 bananabrain

opening	games	wins	first	last
14CC	31	0%	3	146
FullMech	28	0%	0	144
JoyORush	28	0%	2	147
MechGreedyFE	28	0%	4	148
ProxyEightRax	35	6%	1	149
5 openings	150	1%

enemy	games	wins
CannonRush	2	0%
ProtossFE	9	0%
Unknown	125	2%
ZealotRush	14	0%
4 openings	150	1%

#4 dragon

opening	games	wins	first	last
14CC	28	0%	3	148
BioMechGreedyFE	28	4%	4	144
FullMech	25	0%	0	146
ProxyBBS	44	18%	2	149
ProxyEightRax	25	0%	1	147
5 openings	150	6%

enemy	games	wins
Unknown	150	6%
1 opening	150	6%

#5 mcrave

opening	games	wins	first	last
14CC	28	7%	7	147
BioGreedyFE	51	29%	0	145
ProxyEightRax	47	26%	21	140
TwoPortWraith	22	5%	3	146
4 openings	148	20%

enemy	games	wins
FastHatch	61	16%
NinePool	13	31%
Unknown	74	22%
3 openings	148	20%

Ecgberht put up its strongest fight against zerg.

#6 microwave

opening	games	wins	first	last
14CC	32	9%	4	145
BioGreedyFE	21	0%	0	148
FullBioFE	24	4%	3	146
ProxyEightRax	52	27%	1	147
TwoPortWraith	20	0%	2	138
5 openings	149	12%

enemy	games	wins
FastHatch	99	4%
NinePool	5	40%
Unknown	45	27%
3 openings	149	12%

#7 steamhammer

opening	games	wins	first	last
14CC	34	12%	8	147
BioGreedyFE	36	17%	0	142
ProxyEightRax	36	14%	1	141
TwoPortWraith	43	23%	4	148
4 openings	149	17%

enemy	games	wins
EarlyPool	4	0%
FastHatch	22	32%
NinePool	81	14%
Unknown	42	17%
4 openings	149	17%

#8 daqin

opening	games	wins	first	last
14CC	32	0%	8	148
FullMech	29	0%	0	149
JoyORush	28	0%	4	144
MechGreedyFE	28	0%	43	147
ProxyEightRax	33	3%	1	141
5 openings	150	1%

enemy	games	wins
Unknown	150	1%
1 opening	150	1%

#9 zzzkbot

opening	games	wins	first	last
FullBio	150	71%	0	149
1 opening	150	71%

enemy	games	wins
EarlyPool	150	71%
1 opening	150	71%

Ecgberht upset ZZZKBot, possibly aided by its hardcoded knowledge of how ZZZKBot plays.

#10 ualbertabot

opening	games	wins	first	last
FullBio	58	43%	0	144
FullMech	52	38%	2	145
ProxyBBS	40	32%	1	149
3 openings	150	39%

enemy	games	wins
BioPush	11	91%
EarlyPool	12	50%
MechRush	9	33%
Unknown	104	24%
ZealotRush	14	100%
5 openings	150	39%

#11 willyt

opening	games	wins	first	last
14CC	31	3%	68	148
FullMech	34	9%	0	147
ProxyEightRax	85	41%	2	149
3 openings	150	26%

enemy	games	wins
BioPush	34	15%
Unknown	116	29%
2 openings	150	26%

#13 eggbot

opening	games	wins	first	last
FullMech	148	94%	0	147
1 opening	148	94%

enemy	games	wins
CannonRush	94	95%
Unknown	54	93%
2 openings	148	94%

AIIDE 2020 - Microwave versus BananaBrain

This is the last matchup I can analyze this way without writing more parsing code. McRave did ask for more in a comment, though, so I may do that. All the matchups have featured BananaBrain.

Microwave plays a large number of strategies, so I put it on the left side. Blue is good for Microwave, red is good for BananaBrain.

microwave strategies versus bananabrain strategies

	overall	PvZ_10/12gate	PvZ_1basespeedzeal	PvZ_2basespeedzeal	PvZ_4gate2archon	PvZ_5gategoon	PvZ_9/9gate	PvZ_9/9proxygate	PvZ_bisu	PvZ_neobisu	PvZ_sairdt	PvZ_sairgoon	PvZ_sairreaver	PvZ_stove
overall	58/150 39%	5/17 29%	3/19 16%	4/11 36%	4/9 44%	4/7 57%	5/11 45%	5/12 42%	4/14 29%	4/10 40%	5/10 50%	6/11 55%	4/9 44%	5/10 50%
10Hatch9Pool9gas	0/2 0%	-	-	-	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-
10HatchMain9Pool9Gas	0/1 0%	-	-	-	-	-	-	-	0/1 0%	-	-	-	-	-
11HatchTurtleHydra	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-	-
12Hatch	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
12PoolMain	22/43 51%	0/5 0%	0/9 0%	2/2 100%	3/3 100%	3/3 100%	0/1 0%	1/3 33%	2/2 100%	3/3 100%	0/3 0%	2/3 67%	4/4 100%	2/2 100%
12PoolMuta	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
1HatchMuta_Sparkle	0/1 0%	-	-	-	-	-	-	0/1 0%	-	-	-	-	-	-
2HatchMuta	1/5 20%	-	-	1/1 100%	-	-	0/1 0%	-	0/1 0%	-	-	-	0/1 0%	0/1 0%
3HatchHydraBust	0/1 0%	-	-	-	-	-	-	-	0/1 0%	-	-	-	-	-
3HatchHydra_BHG	0/1 0%	-	-	0/1 0%	-	-	-	-	-	-	-	-	-	-
3HatchLingBust	2/6 33%	-	0/1 0%	0/1 0%	-	-	1/1 100%	0/1 0%	-	-	-	1/1 100%	-	0/1 0%
3HatchMuta	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-	-
3HatchPoolHydraExpo	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
4HatchBeforeGas	0/1 0%	-	-	-	-	-	-	-	-	-	-	0/1 0%	-	-
4HatchPoolHydra	0/2 0%	-	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-
4PoolHard	2/6 33%	-	1/1 100%	0/1 0%	-	-	1/1 100%	-	0/1 0%	-	-	-	-	0/2 0%
4PoolSoft	0/1 0%	-	0/1 0%	-	-	-	-	-	-	-	-	-	-	-
6Pool	0/1 0%	-	0/1 0%	-	-	-	-	-	-	-	-	-	-	-
7Pool	0/1 0%	-	-	-	-	-	-	-	-	-	0/1 0%	-	-	-
8Pool	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-	-
8PoolHydraRush8D	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
9PoolGasHatchSpeed8D	12/18 67%	2/2 100%	2/2 100%	-	1/2 50%	0/1 0%	1/1 100%	0/2 0%	1/1 100%	1/1 100%	1/1 100%	1/2 50%	0/1 0%	2/2 100%
9PoolHatchGasSpeed7D	0/1 0%	-	-	-	0/1 0%	-	-	-	-	-	-	-	-	-
9PoolHatchGasSpeed8D	17/32 53%	3/4 75%	0/1 0%	1/1 100%	0/1 0%	0/1 0%	2/4 50%	4/5 80%	1/5 20%	0/1 0%	4/4 100%	2/2 100%	0/2 0%	0/1 0%
9PoolSpeed	0/3 0%	0/1 0%	-	-	0/1 0%	-	-	-	-	-	-	0/1 0%	-	-
9PoolSpeedLing	1/5 20%	-	-	-	-	-	0/1 0%	-	0/1 0%	-	-	0/1 0%	0/1 0%	1/1 100%
9PoolSunkHatch	0/1 0%	-	-	0/1 0%	-	-	-	-	-	-	-	-	-	-
Overpool	0/1 0%	0/1 0%	-	-	-	-	-	-	-	-	-	-	-	-
OverpoolSpeed	0/3 0%	-	0/1 0%	0/1 0%	-	-	-	-	0/1 0%	-	-	-	-	-
ZvP_10Hatch9Pool	1/3 33%	-	0/1 0%	0/1 0%	-	1/1 100%	-	-	-	-	-	-	-	-
ZvP_11Hatch10Pool	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-	-
ZvZ_Overgas9Pool	0/1 0%	-	-	-	-	-	-	-	-	0/1 0%	-	-	-	-
ZvZ_Overpool11Gas	0/2 0%	-	-	-	-	-	0/1 0%	-	-	-	0/1 0%	-	-	-

This table looks even more scattered than yesterday’s BananaBrain-Dragon table, but to me it tells a story of duelling learning algorithms. Microwave found a few builds that countered BananaBrain’s preferred play, and BananaBrain did not shift its responses far enough to entirely squelch them.

microwave as seen by bananabrain

microwave played	#	bananabrain recognized
10Hatch9Pool9gas	2	2 Z_10hatch
10HatchMain9Pool9Gas	1	1 Z_10hatch
11HatchTurtleHydra	1	1 Z_12hatch
12Hatch	1	1 Z_12hatch
12PoolMain	43	36 Z_12pool \| 5 Z_10hatch \| 2 Z_unknown
12PoolMuta	1	1 Z_10hatch
1HatchMuta_Sparkle	1	1 Z_unknown
2HatchMuta	5	5 Z_12hatch
3HatchHydraBust	1	1 Z_12hatch
3HatchHydra_BHG	1	1 Z_10hatch
3HatchLingBust	6	6 Z_12hatch
3HatchMuta	1	1 Z_12hatch
3HatchPoolHydraExpo	1	1 Z_12hatch
4HatchBeforeGas	1	1 Z_12hatch
4HatchPoolHydra	2	2 Z_12hatch
4PoolHard	6	6 Z_4/5pool
4PoolSoft	1	1 Z_4/5pool
6Pool	1	1 Z_4/5pool
7Pool	1	1 Z_9pool
8Pool	1	1 Z_9pool
8PoolHydraRush8D	1	1 Z_9pool
9PoolGasHatchSpeed8D	18	15 Z_9pool \| 3 Z_overpool
9PoolHatchGasSpeed7D	1	1 Z_9pool
9PoolHatchGasSpeed8D	32	29 Z_9pool \| 3 Z_overpool
9PoolSpeed	3	2 Z_9poolspeed \| 1 Z_9pool
9PoolSpeedLing	5	5 Z_9poolspeed
9PoolSunkHatch	1	1 Z_9pool
Overpool	1	1 Z_overpool
OverpoolSpeed	3	3 Z_overpool
ZvP_10Hatch9Pool	3	3 Z_10hatch
ZvP_11Hatch10Pool	1	1 Z_12hatch
ZvZ_Overgas9Pool	1	1 Z_12pool
ZvZ_Overpool11Gas	2	2 Z_overpool

BananaBrain was accurate at reading Microwave’s initial build. Lumping 11 hatch with 12 hatch is fine, they’re very similar. 12 pool can be difficult to distinguish from 10 hatch, if you scout it late after the second hatchery finishes. It would be useful to better separate 9 pool from overpool, which are significantly different in effect, but it requires close attention to detail. Overall, highly accurate readings with only one wide miss, seeing the overgas 9 pool as 12 pool—and that is a ZvZ build that is extremely rare in ZvP.

It makes quite a contrast with yesterday’s BananaBrain-Dragon analysis, where BananaBrain barely recognized terran builds.

bananabrain as seen by microwave

bananabrain played	#	microwave recognized
PvZ_10/12gate	17	13 HeavyRush \| 3 Unknown \| 1 NakedExpand
PvZ_1basespeedzeal	19	14 Unknown \| 5 HeavyRush
PvZ_2basespeedzeal	11	4 NakedExpand \| 3 Turtle \| 3 SafeExpand \| 1 HeavyRush
PvZ_4gate2archon	9	4 NakedExpand \| 4 SafeExpand \| 1 HeavyRush
PvZ_5gategoon	7	6 NakedExpand \| 1 HeavyRush
PvZ_9/9gate	11	9 HeavyRush \| 2 Unknown
PvZ_9/9proxygate	12	6 HeavyRush \| 6 Unknown
PvZ_bisu	14	6 SafeExpand \| 4 NakedExpand \| 2 Turtle \| 1 HeavyRush \| 1 Unknown
PvZ_neobisu	10	4 NakedExpand \| 3 SafeExpand \| 2 Turtle \| 1 HeavyRush
PvZ_sairdt	10	8 Unknown \| 2 HeavyRush
PvZ_sairgoon	11	7 NakedExpand \| 1 SafeExpand \| 1 Turtle \| 1 Unknown \| 1 HeavyRush
PvZ_sairreaver	9	4 SafeExpand \| 3 NakedExpand \| 2 Turtle
PvZ_stove	10	7 Unknown \| 3 HeavyRush

Microwave borrowed Steamhammer’s rather crude classification of enemy plans (which was still far in the future when Microwaved forked from Steamhammer). It was intended to be minimal, just enough to allow for basic reactions, to hold the fort until I could raise enough troops to make a sally. Microwave’s recognitions look similar to Steamhammer’s, with the right general tendency but many sloppy variations (which I think are due mostly to weak scouting, with a contribution from overlapping recognition rules).

It’s striking that some recognitions—of dubious accuracy—are dark blue in stark contrast to their neighbors. It gives me the impression that Microwave makes use of the recognized enemy plan, in some cases to good effect. It suggests that more accurate recognition, if the reactions are also good, could be a major improvement.

AIIDE 2020 - BananaBrain versus Dragon

Of the 4 bots I’m prepared to run this analysis on, this is the only pairing involving Dragon. Dragon did not record all 150 games against either McRave or Microwave. Like yesterday, all win rates and coloring are from the point of view of BananaBrain: Blue is good for BananaBrain, red is good for Dragon.

bananabrain strategies versus dragon strategies

	overall	1rax fe	2rax bio	2rax mech	bio	dirty worker rush	mass vulture	siege expand
overall	67/150 45%	6/14 43%	6/11 55%	8/15 53%	15/37 41%	3/3 100%	22/56 39%	7/14 50%
PvT_10/12gate	12/17 71%	2/3 67%	-	2/3 67%	4/4 100%	-	3/6 50%	1/1 100%
PvT_10/15gate	5/12 42%	-	2/2 100%	1/5 20%	1/3 33%	-	1/2 50%	-
PvT_12nexus	1/8 12%	1/2 50%	-	-	0/1 0%	-	0/3 0%	0/2 0%
PvT_1gatedtexpo	3/7 43%	1/2 50%	-	-	0/1 0%	-	2/4 50%	-
PvT_1gatereaver	0/5 0%	-	0/1 0%	-	0/2 0%	-	0/2 0%	-
PvT_28nexus	5/11 45%	0/2 0%	0/1 0%	0/2 0%	1/1 100%	-	4/5 80%	-
PvT_2gatedt	3/9 33%	0/1 0%	-	1/1 100%	0/2 0%	-	0/3 0%	2/2 100%
PvT_2gaterngexpo	2/7 29%	-	0/1 0%	-	1/1 100%	1/1 100%	0/4 0%	-
PvT_32nexus	2/8 25%	-	-	-	1/4 25%	1/1 100%	0/2 0%	0/1 0%
PvT_9/9gate	14/18 78%	-	2/3 67%	-	4/4 100%	1/1 100%	7/9 78%	0/1 0%
PvT_9/9proxygate	8/14 57%	1/1 100%	1/1 100%	3/3 100%	0/2 0%	-	2/6 33%	1/1 100%
PvT_bulldog	0/6 0%	0/1 0%	-	-	0/3 0%	-	0/1 0%	0/1 0%
PvT_dtdrop	2/8 25%	-	1/1 100%	-	0/4 0%	-	1/2 50%	0/1 0%
PvT_proxydt	10/14 71%	1/1 100%	-	1/1 100%	3/3 100%	-	2/5 40%	3/4 75%
PvT_stove	0/6 0%	0/1 0%	0/1 0%	-	0/2 0%	-	0/2 0%	-

Not one table cell has more than 9 games in it. Neither bot successfully predicted what the other would play, if it even tried: BananaBrain is unpredictable and Dragon changes its choice frequently when losing, and besides BananaBrain is poor at recognizing terran plans. So the strategy x strategy cross is a hash. To me the table means that, at least for this pairing, reactions during the game were more important than the initial choice of strategy. Neither side had a way to choose a counter beforehand.

bananabrain as seen by dragon

Dragon does not record a recognized opponent strategy. Its history files have only its own strategy and whether it won.

dragon as seen by bananabrain

dragon played	#	bananabrain recognized
1rax fe	14	13 T_unknown \| 1 T_fastexpand
2rax bio	11	8 T_unknown \| 2 T_fastexpand \| 1 T_1fac
2rax mech	15	14 T_unknown \| 1 T_1fac
bio	37	35 T_unknown \| 1 T_1fac \| 1 T_fastexpand
dirty worker rush	3	3 T_unknown
mass vulture	56	30 T_1fac \| 26 T_unknown
siege expand	14	9 T_unknown \| 5 T_1fac

We knew that BananaBrain struggles to recognize terran strategies. Maybe the author has not spent effort on it because it doesn’t affect results much? In any case, given how Dragon plays, with its love of fast expansions and mixed tech, the terran builds that are recognized probably represent truths about the games. It’s not clear that they are helpful truths, though, because they say so little about what happened.

From the coloring, it looks as though there was little relationship between whether BananaBrain recognized Dragon’s build and whether BananaBrain won. That is consistent with the theory that the author decided it didn’t matter.

AIIDE 2020 - BananaBrain versus McRave

If both bots in a pairing write history files, and both record all 150 games of the tournament, then the history files can be aligned and we can compare what the bots were thinking in each game. So far, between the limitations of the data and the limitations of my script, I’m only ready to do that for a few pairings. Dragon in particular often did not record all 150 games, and I’d rather not try to align game records when there are gaps in the histories (there is enough data to do it programmatically, but it’s a pain and risks errors). Also my script depends on parsing out data into a specific format, and it is only implemented for 4 bots so far (#3 BananaBrain, #4 Dragon, #5 McRave, #6 Microwave—alphabetical order and their finishing order were the same).

Today is BananaBrain versus McRave. The first BananaBrain line in its file about McRave:

2020-10-09 20:56:04,2,(2)Destination.scx,PvZ_9/9proxygate,Z_overpool,7.6,1

The first McRave line in its history file about BananaBrain (we’re told it doesn’t use this data in games, but it’s there and we can analyze it):

Lost,Destination,7:30,2Gate,Proxy,ZealotRush,PoolHatch,Overpool,2HatchSpeedling,1:21,1:21,1:21,5,Zerg_Larva,30,Zerg_Zergling,15,Zerg_Drone,3,Zerg_Overlord,24,Protoss_Probe,16,Protoss_Zealot,1,Protoss_Corsair Lost,HeartbreakRidge,17:40,2Gate,Main,Corsair,PoolHatch,Overpool,2HatchMuta,2:01,2:01,5:10,2,Zerg_Larva,16,Zerg_Zergling,42,Zerg_Drone,10,Zerg_Overlord,28,Zerg_Mutalisk,18,Zerg_Scourge,89,Protoss_Probe,54,Protoss_Zealot,23,Protoss_Dragoon,1,Protoss_High_Templar,1,Protoss_Shuttle,33,Protoss_Corsair,5,Protoss_Dark_Templar,1,Protoss_Reaver,8,Protoss_Scarab

My script extracts key info from each line so we can compare. BananaBrain played PvZ_9/9proxygate and concluded that McRave answered with Z_overpool, while McRave played PoolHatch,Overpool,2HatchSpeedling and classified what it saw from BananaBrain as 2Gate,Proxy,ZealotRush. In this game, both sides agreed pretty well about what was going on.

bananabrain strategies versus mcrave strategies

This table shows which BananaBrain strategies were successful against which three-part McRave strategies. All the winning rates are from BananaBrain’s point of view. The intersection of the overall row and the overall column says that BananaBrain won 82 out of 150 games throughout the tournament, which can be checked against the official crosstable. The overall row tells how BananaBrain fared against each of McRave’s strategies, which can be checked against my tables of what McRave learned. The overall column tells how each BananaBrain strategy performed, which can be checked against what BananaBrain learned. (Spoiler: All the numbers match.) The center cells are the meat, and show what countered what.

	overall	PoolHatch,Overpool,2HatchMuta	PoolHatch,Overpool,2HatchSpeedling	PoolHatch,Overpool,3HatchSpeedling
overall	82/150 55%	72/131 55%	5/10 50%	5/9 56%
PvZ_10/12gate	9/13 69%	4/4 100%	-	5/9 56%
PvZ_1basespeedzeal	6/12 50%	6/12 50%	-	-
PvZ_2basespeedzeal	3/9 33%	3/9 33%	-	-
PvZ_4gate2archon	1/6 17%	1/6 17%	-	-
PvZ_5gategoon	9/16 56%	9/16 56%	-	-
PvZ_9/9gate	26/27 96%	26/27 96%	-	-
PvZ_9/9proxygate	5/10 50%	-	5/10 50%	-
PvZ_bisu	1/5 20%	1/5 20%	-	-
PvZ_neobisu	5/11 45%	5/11 45%	-	-
PvZ_sairdt	3/10 30%	3/10 30%	-	-
PvZ_sairgoon	11/17 65%	11/17 65%	-	-
PvZ_sairreaver	1/5 20%	1/5 20%	-	-
PvZ_stove	2/9 22%	2/9 22%	-	-

The table makes it plain that 2HatchSpeedling and 3HatchSpeedling were reactions to specific protoss builds, as the author pointed out in a comment. The counter to 10/12 gate at least seems to have been valuable, because McRave lost all 4 games where the 10/12 gate was played but not countered. The 9/9 gate crushed because no counter was played against it; the zealots are a McRave weakness.

bananabrain as seen by mcrave

But wait, there’s more. Both bots recorded not only their own strategy, but the recognized opponent strategy, so we can compare the known strategy of one bot with how the other bot recognized it. Note well: If the recognized strategy looks different than the actual strategy, it is not necessarily a mistake or a scouting miss. The bots may simply be noting different aspects of the game. Only some differences indicate mistakes.

The coloring is from the point of view of BananaBrain. For McRave, red is good and blue is bad.

bananabrain played	#	mcrave recognized
PvZ_10/12gate	13	12 2Gate,Main,Corsair \| 1 2Gate,Main,DT
PvZ_1basespeedzeal	12	8 1GateCore,2Zealot,DT \| 2 2Gate,Main,DT \| 1 2Gate,Main,4Gate \| 1 1GateCore,2Zealot,Corsair
PvZ_2basespeedzeal	9	8 FFE,Forge,Speedlot \| 1 FFE,Gateway,Speedlot
PvZ_4gate2archon	6	2 FFE,Forge,NeoBisu \| 2 FFE,Forge,5GateGoon \| 1 FFE,Forge,ZealotArchon \| 1 FFE,Nexus,NeoBisu
PvZ_5gategoon	16	14 FFE,Forge,5GateGoon \| 2 FFE,Nexus,5GateGoon
PvZ_9/9gate	27	26 2Gate,Main,Corsair \| 1 2Gate,Main,DT
PvZ_9/9proxygate	10	10 2Gate,Proxy,ZealotRush
PvZ_bisu	5	3 FFE,Forge,NeoBisu \| 2 FFE,Nexus,NeoBisu
PvZ_neobisu	11	5 FFE,Forge,NeoBisu \| 4 FFE,Forge,Speedlot \| 2 FFE,Nexus,NeoBisu
PvZ_sairdt	10	10 1GateCore,2Zealot,Corsair
PvZ_sairgoon	17	9 FFE,Forge,NeoBisu \| 2 FFE,Nexus,5GateGoon \| 2 FFE,Nexus,NeoBisu \| 2 FFE,Forge,Unknown \| 1 FFE,Forge,Speedlot \| 1 FFE,Forge,5GateGoon
PvZ_sairreaver	5	5 FFE,Forge,NeoBisu
PvZ_stove	9	9 1GateCore,2Zealot,Corsair

Only 2 games have an Unknown element. Without watching replays, I can’t say that any of McRave’s recognitions are wrong. Seeing PvZ_sairgoon as FFE,Forge,Speedlot could be correct if BananaBrain followed up with zealots in that one game.

I’m not sure what the difference is between FFE,Forge and FFE,Gateway and FFE,Nexus. FFE stands for forge fast expand, which means a forge and a nexus, and then you need a gateway if you’re ever going to make a mobile army, so all three buildings are required. Maybe it’s whatever building McRave saw first.

mcrave as seen by bananabrain

Again, the coloring is from the point of view of BananaBrain.

mcrave played	#	bananabrain recognized
PoolHatch,Overpool,2HatchMuta	131	101 Z_overpool \| 27 Z_9pool \| 3 Z_unknown
PoolHatch,Overpool,2HatchSpeedling	10	9 Z_overpool \| 1 Z_unknown
PoolHatch,Overpool,3HatchSpeedling	9	8 Z_overpool \| 1 Z_9pool

BananaBrain remembered far less detail about the game than McRave. Overpool is only an initial build order which reaches its end at 9 supply and can be followed up with any tech or unit mix whatsoever. If all you know is that the opponent will start with overpool, the only conclusions you can draw are limits on the opponent’s tech timings and economy. On the other hand, if you do know more about the opponent’s play, can you use the information productively?

more?

I could generate more tables. Various tables showing recognized strategies might make sense. If at least one bot of the pair records the map for each game, it would be easy to break down strategies by map. Is there any particular breakdown you’d like to see?

Update: I added coloring to the “as seen by” tables, to show how win rates vary depending on what the bots recognized.

AIIDE 2020 - what DaQin learned

Holdover bot DaQin is based on Locutus and writes game records in a format close to old-style Steamhammer game records. The same script parses both Locutus and DaQin files, and is only slightly different from my original Steamhammer code. But DaQin plays a more restricted set of builds.

#1 stardust

opening	games	wins
2GateDT	6	0%
3GateDT	133	8%
4GateGoon	11	0%
3 openings	150	7%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	112	75%	7%	89	59%	3%	58%	1%
Fast rush	11	7%	0%	16	11%	6%	0%	0%
Not fast rush	26	17%	12%	44	29%	16%	23%	0%
Unknown	1	1%	0%	1	1%	0%	0%	0%

timing	#	median	early	late
gas steal attempt	63	1:43	0:46	1:47
gas steal success	2	-	-	-
enemy scout	150	1:55	1:17	2:34
enemy combat units	150	3:07	2:22	5:42
enemy air units	126	8:02	5:35	12:26
enemy cloaked units	11	8:13	7:06	22:38

It’s interesting that DaQin settled on a dark templar strategy. DaQin seems poor at recognizing the enemy strategy. Given these classes, I think all games would have been best classified as Not fast rush.

#2 purplewave

opening	games	wins
2GateDT	7	43%
3GateDT	3	0%
4GateGoon	140	11%
3 openings	150	13%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	12	8%	33%	13	9%	23%	50%	0%
Fast rush	137	91%	11%	136	91%	11%	95%	0%
Naked expand		-	-	1	1%	100%	0%	0%
Unknown	1	1%	0%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	8	0:46	0:45	0:47
gas steal success	5	-	-	-
enemy scout	150	2:07	1:18	3:47
enemy combat units	150	3:02	2:21	7:50
enemy air units	14	6:07	3:41	19:42
enemy cloaked units	93	6:47	5:14	20:05

BananaBrain chose 4 gate goon as best against both Stardust and PurpleWave. DaQin liked the dragoons only against PurpleWave. This table doesn’t say so, but 2GateDT was tried on the tournament’s 3rd round out of 150 and won then, while 4GateGoon lost on its first outing. I would have to read the code to decipher the strange seeming preference for dragoons.

#3 bananabrain

opening	games	wins
2GateDT	14	36%
3GateDT	89	40%
4GateGoon	47	21%
3 openings	150	34%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	65	43%	34%	72	48%	40%	43%	2%
Fast rush	79	53%	35%	63	42%	25%	41%	3%
Naked expand	2	1%	0%	3	2%	0%	0%	0%
Not fast rush	1	1%	0%	4	3%	75%	0%	100%
Proxy	2	1%	50%	3	2%	0%	0%	0%
Safe expand		-	-	1	1%	100%	0%	0%
Unknown	1	1%	0%	4	3%	50%	0%	0%

timing	#	median	early	late
gas steal attempt	60	1:42	0:46	1:47
gas steal success	9	-	-	-
enemy scout	150	1:58	0:53	3:11
enemy combat units	150	2:57	2:18	5:50
enemy air units	103	7:45	3:45	15:31
enemy cloaked units	38	6:11	5:29	22:03

#4 dragon

opening	games	wins
12NexusCarriers	106	58%
3GateDT	9	33%
4GateGoon	27	15%
DTDrop	8	25%
4 openings	150	47%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	41	27%	56%	29	19%	62%	44%	15%
Naked expand	10	7%	30%	15	10%	67%	10%	10%
Not fast rush	35	23%	63%	24	16%	12%	3%	20%
Proxy	26	17%	15%	24	16%	21%	0%	8%
Safe expand	35	23%	46%	31	21%	48%	26%	20%
Unknown	2	1%	100%	24	16%	67%	0%	50%
Worker rush	1	1%	0%	3	2%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	122	2:18	0:46	3:18
gas steal success	9	-	-	-
enemy scout	141	2:15	1:06	19:29
enemy combat units	148	2:59	2:30	6:37
enemy air units	132	9:29	7:58	14:01
enemy cloaked units	127	9:55	8:18	17:26

The carriers were successful versus Dragon. DaQin attempted to steal gas in almost all games, and rarely succeeded. The tables don’t show enough information to tell whether the attempts were worth it.

#5 mcrave

opening	games	wins
ForgeExpand5GateGoon	3	0%
ForgeExpandSpeedlots	147	36%
2 openings	150	35%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	14	9%	43%	28	19%	57%	14%	7%
Not fast rush	132	88%	36%	117	78%	31%	78%	2%
Unknown	4	3%	0%	5	3%	20%	0%	25%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	150	2:39	0:53	5:35
enemy combat units	150	3:02	2:25	6:17
enemy air units	150	6:21	5:23	7:01
enemy cloaked units	131	11:02	9:30	15:13

#6 microwave

opening	games	wins
4GateGoon	11	73%
ForgeExpand5GateGoon	3	0%
ForgeExpandSpeedlots	136	12%
3 openings	150	17%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	1	1%	0%	1	1%	100%	0%	0%
Heavy rush	13	9%	8%	30	20%	17%	15%	0%
Not fast rush	124	83%	13%	103	69%	17%	69%	6%
Proxy	11	7%	73%	8	5%	12%	18%	0%
Unknown	1	1%	0%	8	5%	12%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	146	2:42	1:23	10:18
enemy combat units	150	3:56	2:47	6:45
enemy air units	133	6:17	4:43	16:55
enemy cloaked units	50	10:06	5:49	11:58

Another puzzling choice of a seemingly less-successful strategy...

#7 steamhammer

opening	games	wins
ForgeExpand5GateGoon	136	79%
ForgeExpandSpeedlots	14	71%
2 openings	150	78%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush		-	-	4	3%	100%	0%	0%
Heavy rush	27	18%	89%	33	22%	85%	15%	4%
Hydra bust	8	5%	62%	2	1%	100%	0%	0%
Not fast rush	114	76%	77%	104	69%	74%	68%	5%
Unknown	1	1%	0%	7	5%	86%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	127	2:11	0:59	36:17
enemy combat units	150	3:15	1:55	6:29
enemy air units	33	8:06	4:39	27:42
enemy cloaked units	33	12:02	4:54	23:23

... but not against Steamhammer. Steamhammer is far more skillful at fighting zealots than dragoons, and DaQin’s choice was correct here. (I expect that Steamhammer would have scored well against the speed zealots if DaQin had stuck to them.) Was Steamhammer’s poor showing relative to Microwave partly due to a mistake by DaQin that only happened versus Microwave?

#9 zzzkbot

opening	games	wins
ForgeExpand5GateGoon	147	10%
ForgeExpandSpeedlots	3	0%
2 openings	150	9%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush		-	-	5	3%	100%	0%	0%
Heavy rush	143	95%	6%	144	96%	6%	99%	1%
Unknown	7	5%	71%	1	1%	0%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	147	3:45	0:51	5:39
enemy combat units	150	2:47	1:46	3:49
enemy air units	9	7:47	7:43	8:21
enemy cloaked units	0	-	-	-

#10 ualbertabot

opening	games	wins
12NexusCarriers	1	0%
3GateDT	34	68%
4GateGoon	32	53%
DTDrop	1	0%
ForgeExpand5GateGoon	81	78%
5 openings	149	69%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	36	24%	61%	35	23%	60%	22%	0%
Factory	1	1%	100%	16	11%	94%	100%	0%
Fast rush	81	54%	78%	52	35%	69%	37%	1%
Heavy rush	3	2%	67%	6	4%	83%	0%	0%
Hydra bust		-	-	1	1%	100%	0%	0%
Naked expand		-	-	1	1%	100%	0%	0%
Not fast rush	19	13%	68%	26	17%	73%	5%	5%
Proxy	8	5%	25%	10	7%	30%	0%	0%
Unknown	1	1%	0%	2	1%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	39	1:42	0:46	1:48
gas steal success	11	-	-	-
enemy scout	138	1:41	1:14	8:25
enemy combat units	147	3:57	1:34	6:54
enemy air units	7	6:54	5:50	15:14
enemy cloaked units	19	5:07	4:31	5:15

#11 willyt

opening	games	wins
12NexusCarriers	3	67%
3GateDT	147	97%
2 openings	150	96%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory		-	-	2	1%	100%	0%	0%
Heavy rush	11	7%	82%	8	5%	100%	0%	27%
Not fast rush	10	7%	100%	8	5%	100%	0%	40%
Safe expand	123	82%	98%	90	60%	94%	60%	27%
Unknown	6	4%	83%	42	28%	98%	0%	33%

timing	#	median	early	late
gas steal attempt	139	1:43	1:39	2:21
gas steal success	53	-	-	-
enemy scout	149	1:53	1:43	3:55
enemy combat units	150	3:05	2:38	5:06
enemy air units	65	17:30	8:27	29:58
enemy cloaked units	37	15:07	11:43	21:49

#12 ecgberht

opening	games	wins
12NexusCarriers	2	50%
3GateDT	148	100%
2 openings	150	99%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	88	59%	100%	54	36%	100%	23%	14%
Fast rush	24	16%	100%	27	18%	96%	17%	0%
Naked expand		-	-	2	1%	100%	0%	0%
Not fast rush	5	3%	100%	14	9%	100%	20%	0%
Proxy		-	-	1	1%	100%	0%	0%
Safe expand	31	21%	100%	37	25%	100%	19%	10%
Unknown	2	1%	50%	15	10%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	126	1:43	1:38	2:21
gas steal success	29	-	-	-
enemy scout	140	1:35	0:34	3:55
enemy combat units	150	3:43	2:05	5:45
enemy air units	46	7:29	5:59	9:31
enemy cloaked units	15	8:41	7:35	9:54

#13 eggbot

opening	games	wins
3GateDT	16	94%
4GateGoon	134	97%
2 openings	150	97%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	104	69%	95%	90	60%	98%	58%	0%
Proxy	42	28%	100%	60	40%	95%	36%	0%
Unknown	4	3%	100%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	58	0:46	0:46	0:49
gas steal success	54	-	-	-
enemy scout	121	0:46	0:35	3:41
enemy combat units	29	6:34	4:39	22:57
enemy air units	0	-	-	-
enemy cloaked units	0	-	-	-

overall

	total		PvT		PvP		PvZ		PvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
12NexusCarriers	112	57%	111	58%					1	0%
2GateDT	27	30%			27	30%
3GateDT	579	65%	304	96%	241	26%			34	68%
4GateGoon	402	46%	27	15%	332	47%	11	73%	32	53%
DTDrop	9	22%	8	25%					1	0%
ForgeExpand5GateGoon	370	50%					289	42%	81	78%
ForgeExpandSpeedlots	300	27%					300	27%
total	1799	50%	450	81%	600	38%	600	35%	149	69%
openings played	7		4		3		3		5

AIIDE 2020 - what BananaBrain learned

#3 BananaBrain had pre-trained data, 3000 (!) games versus Stardust and 100 each against PurpleWave, Dragon, McRave, Microwave, and DaQin. Other opponents did not rate. The main tables here include only tournament games, not pre-training games. BananaBrain scored higher in training games than in tournament games against every opponent except the carryover DaQin (training 61% of 100 games, tournament 66% of 150 games, statistically close enough). It looks as though BananaBrain might have won the tournament if it had played against the same versions of opponents that it trained against. I take it as a sign that secret tournament improvements may be worth it.

A couple conclusions: 1. Enemy strategy recognition seemed to have some misfires with terran opponents. 2. BananaBrain would have scored slightly better if it had never played the Stove. It’s a for-fun build more than a for-real build—though bots are often poor at adaptation, so perhaps on BASIL it is more effective than here.

#1 stardust

opening	games	wins	first	last
PvP_10/12gate	10	40%	12	119
PvP_12nexus	2	0%	11	66
PvP_2gatedt	6	17%	28	145
PvP_2gatedtexpo	7	0%	0	140
PvP_2gatereaver	29	24%	3	149
PvP_3gaterobo	4	25%	50	106
PvP_3gatespeedzeal	3	0%	4	65
PvP_4gategoon	26	65%	31	144
PvP_9/9gate	4	0%	29	146
PvP_9/9proxygate	17	41%	8	142
PvP_nzcore	2	0%	2	68
PvP_zcore	1	0%	118	118
PvP_zcorez	5	40%	42	117
PvP_zzcore	34	53%	5	148
14 openings	150	38%

enemy	games	wins
P_1gatecore	23	35%
P_2gate	4	75%
P_2gatefast	10	30%
P_4gategoon	94	44%
P_unknown	19	11%
5 openings	150	38%

BananaBrain was the only bot to leave a dent in Stardust. Its plan recognition was not precise enough to pin down Stardust’s 4 gate goon strategy consistently, likely because Stardust ejected the bananascout before the build was complete. Still, BananaBrain’s 4 gate goon opener defeated Stardust’s 4 gate goons 65% of the time in 26 games. BananaBrain is deliberately unpredictable, but against this opponent consistency may have been better: Stardust played without learning, so its opponents should have sought the single best opening as an answer rather than the best mix of openings. It’s possible that #3 BananaBrain could have upset Stardust if it had done that, though it would not have gained enough wins to pass #2 PurpleWave.

opening	games	wins	first	last
PvP_10/12gate	60	3%	11	2934
PvP_12nexus	56	0%	4	2915
PvP_2gatedt	112	31%	8	2978
PvP_2gatedtexpo	278	53%	5	2921
PvP_2gatereaver	1110	66%	6	2988
PvP_3gaterobo	60	3%	2	2935
PvP_3gatespeedzeal	116	26%	7	2916
PvP_4gategoon	56	0%	3	2917
PvP_9/9gate	136	36%	12	2995
PvP_9/9proxygate	437	61%	13	2936
PvP_nzcore	56	0%	1	2918
PvP_zcore	102	21%	0	2989
PvP_zcorez	72	12%	9	2986
PvP_zzcore	349	67%	10	2999
14 openings	3000	51%

This is the table of pre-trained games. It looks different from the tournament table; the overall score and the individual results by strategy do not match up. The training may have been against an older version of Stardust on the Starcraft AI Ladder, or it may have been against Locutus, which was wrapped around an encrypted Stardust binary on SSCAIT and played instead of Stardust if it didn’t have its encryption key.

The misleading training cannot have helped BananaBrain’s results. It did adapt, but notice that BananaBrain’s most effective counter of 4 gate goon was only played 26 times in 150 games, less often than P_2gatereaver and P_zzcore which scored highest in training.

#2 purplewave

opening	games	wins	first	last
PvP_10/12gate	7	14%	1	146
PvP_12nexus	7	14%	2	125
PvP_2gatedt	11	36%	22	142
PvP_2gatedtexpo	10	40%	36	141
PvP_2gatereaver	9	22%	20	144
PvP_3gaterobo	13	23%	3	148
PvP_3gatespeedzeal	9	33%	9	108
PvP_4gategoon	16	69%	4	137
PvP_9/9gate	9	22%	13	128
PvP_9/9proxygate	9	22%	10	147
PvP_nzcore	14	43%	7	149
PvP_zcore	7	14%	29	115
PvP_zcorez	12	42%	0	143
PvP_zzcore	17	65%	5	139
14 openings	150	37%

enemy	games	wins
P_1gatecore	106	38%
P_2gate	5	80%
P_2gatefast	3	67%
P_4gategoon	14	36%
P_ffe	1	0%
P_unknown	21	24%
6 openings	150	37%

Again, 4 gate goon was BananaBrain’s best, though zealot-zealot-core was statistically equal. PurpleWave learns, so mixing it up was likely correct. Judging by the names, the last 4 builds here, the ones that mention “core”, are nonspecific tech builds that might aim for any tech and unit mix in the midgame. Do the names accurately describe them?

Again the training data was somewhat misleading (52% win rate in 100 training games and entirely different best counters), but BananaBrain tried its strategies in a fairly even distribution so I think it made little difference in this case. Presumably 100 games of training provide a smaller bias than 3000 games.

#4 dragon

opening	games	wins	first	last
PvT_10/12gate	17	71%	8	145
PvT_10/15gate	12	42%	15	147
PvT_12nexus	8	12%	16	132
PvT_1gatedtexpo	7	43%	36	134
PvT_1gatereaver	5	0%	10	118
PvT_28nexus	11	45%	25	149
PvT_2gatedt	9	33%	2	128
PvT_2gaterngexpo	7	29%	32	131
PvT_32nexus	8	25%	12	124
PvT_9/9gate	18	78%	7	146
PvT_9/9proxygate	14	57%	4	148
PvT_bulldog	6	0%	5	120
PvT_dtdrop	8	25%	0	117
PvT_proxydt	14	71%	1	140
PvT_stove	6	0%	11	125
15 openings	150	45%

enemy	games	wins
T_1fac	38	42%
T_fastexpand	4	75%
T_unknown	108	44%
3 openings	150	45%

BananaBrain could not recognize most of Dragon’s builds. It looks like Dragon was vulnerable to mass zealots, an important weakness, and to hidden dark templar, which any terran might die to if unscouted. But what is the Stove doing there? Does it work against decent terran bots? Maybe the key is that there are few mid-rank terran bots, they are mostly in the upper and lower tiers.

67% overall win rate in training, but this time BananaBrain’s best counters matched between training and the tournament. The training helped, even though it was against a weaker version of Dragon.

#5 mcrave

opening	games	wins	first	last
PvZ_10/12gate	13	69%	24	144
PvZ_1basespeedzeal	12	50%	8	133
PvZ_2basespeedzeal	9	33%	25	148
PvZ_4gate2archon	6	17%	29	132
PvZ_5gategoon	16	56%	7	129
PvZ_9/9gate	27	96%	1	145
PvZ_9/9proxygate	10	50%	0	147
PvZ_bisu	5	20%	19	87
PvZ_neobisu	11	45%	5	149
PvZ_sairdt	10	30%	4	141
PvZ_sairgoon	17	65%	3	146
PvZ_sairreaver	5	20%	26	91
PvZ_stove	9	22%	2	140
13 openings	150	55%

enemy	games	wins
Z_9pool	28	82%
Z_overpool	118	47%
Z_unknown	4	75%
3 openings	150	55%

According to McRave’s tables, McRave played overpool every game against BananaBrain. BananaBrain was able to correctly recognize that most of the time, but strangely won at a higher rate when recognition failed. Does BananaBrain perhaps have a reaction to overpool which was detrimental in this case? Or was BananaBrain perfectly right, and McRave sometimes slips up in its build order and goes 9 pool instead of overpool? I think it’s more likely that BananaBrain misrecognized it.

#6 microwave

opening	games	wins	first	last
PvZ_10/12gate	17	71%	6	148
PvZ_1basespeedzeal	19	84%	2	141
PvZ_2basespeedzeal	11	64%	1	145
PvZ_4gate2archon	9	56%	4	147
PvZ_5gategoon	7	43%	20	125
PvZ_9/9gate	11	55%	19	123
PvZ_9/9proxygate	12	58%	0	130
PvZ_bisu	14	71%	5	138
PvZ_neobisu	10	60%	11	121
PvZ_sairdt	10	50%	8	146
PvZ_sairgoon	11	45%	3	149
PvZ_sairreaver	9	56%	7	143
PvZ_stove	10	50%	17	136
13 openings	150	61%

enemy	games	wins
Z_10hatch	13	77%
Z_12hatch	20	85%
Z_12pool	37	49%
Z_4/5pool	8	75%
Z_9pool	50	52%
Z_9poolspeed	7	86%
Z_overpool	12	58%
Z_unknown	3	67%
8 openings	150	61%

BananaBrain’s openings scored roughly similarly against Microwave, only a couple below 50% and only one above 75%. I think that argues that Microwave is well balanced in its skills.

#7 steamhammer

opening	games	wins	first	last
PvZ_10/12gate	17	100%	4	148
PvZ_1basespeedzeal	13	77%	8	124
PvZ_2basespeedzeal	11	73%	11	130
PvZ_4gate2archon	9	33%	6	140
PvZ_5gategoon	9	44%	19	147
PvZ_9/9gate	17	100%	27	149
PvZ_9/9proxygate	16	94%	5	142
PvZ_bisu	8	38%	2	145
PvZ_neobisu	6	0%	18	129
PvZ_sairdt	14	93%	0	137
PvZ_sairgoon	11	64%	7	146
PvZ_sairreaver	10	40%	14	144
PvZ_stove	9	56%	1	139
13 openings	150	71%

enemy	games	wins
Z_10hatch	34	53%
Z_12hatch	87	69%
Z_12pool	11	91%
Z_4/5pool	2	100%
Z_9pool	8	100%
Z_overpool	3	100%
Z_unknown	5	100%
7 openings	150	71%

The 2 gate zealot openings whomped Steamhammer. Steamhammer survives the zealot attack in many cases, but usually by spending more than it can afford so that it falls far behind in economy.

#8 daqin

opening	games	wins	first	last
PvP_10/12gate	10	70%	15	140
PvP_12nexus	7	57%	49	146
PvP_2gatedt	16	88%	8	147
PvP_2gatedtexpo	14	71%	9	133
PvP_2gatereaver	16	81%	3	149
PvP_3gaterobo	13	62%	4	138
PvP_3gatespeedzeal	7	29%	11	121
PvP_4gategoon	8	50%	32	118
PvP_9/9gate	16	94%	16	148
PvP_9/9proxygate	10	60%	1	135
PvP_nzcore	11	64%	10	139
PvP_zcore	7	43%	0	130
PvP_zcorez	7	29%	2	137
PvP_zzcore	8	50%	31	145
14 openings	150	66%

enemy	games	wins
P_1gatecore	66	68%
P_4gategoon	68	60%
P_ffe	1	100%
P_unknown	15	80%
4 openings	150	66%

#9 zzzkbot

opening	games	wins	first	last
PvZ_10/12gate	12	100%	6	133
PvZ_1basespeedzeal	12	100%	25	148
PvZ_2basespeedzeal	14	100%	2	149
PvZ_4gate2archon	11	91%	3	144
PvZ_5gategoon	12	83%	0	142
PvZ_9/9gate	12	100%	21	135
PvZ_9/9proxygate	9	78%	4	146
PvZ_bisu	11	91%	17	145
PvZ_neobisu	12	100%	10	143
PvZ_sairdt	12	100%	5	147
PvZ_sairgoon	12	100%	18	136
PvZ_sairreaver	10	80%	1	132
PvZ_stove	11	91%	12	141
13 openings	150	94%

enemy	games	wins
Z_4/5pool	99	98%
Z_9pool	48	85%
Z_overpool	2	100%
Z_unknown	1	100%
4 openings	150	94%

ZZZKBot is the first opponent that BananaBrain outclassed. It did not much matter what protoss played.

#10 ualbertabot

opening	games	wins	first	last
PvU_10/12gate	21	90%	2	145
PvU_9/9gate	22	91%	4	147
PvU_9/9proxygate	18	72%	1	133
PvU_ffe	23	91%	0	146
PvU_nzcore	20	85%	10	139
PvU_zcore	26	96%	3	149
PvU_zzcore	20	85%	9	148
7 openings	150	88%

enemy	games	wins
P_1gatecore	21	95%
P_2gate	5	80%
P_2gatefast	17	94%
P_4gategoon	2	100%
P_unknown	2	0%
T_2fac	22	100%
T_2rax	18	94%
T_unknown	15	100%
T_wallin	1	100%
Z_12hatch	14	93%
Z_4/5pool	32	66%
Z_unknown	1	100%
12 openings	150	88%

It’s interesting that BananaBrain recognized a terran wall in one game. UAlbertaBot does not know how to build a wall, and in fact places its early buildings near the command center. Terran is also the opponent race with the highest number of unrecognized (“unknown”) builds. BananaBrain suffered recognition trouble against Dragon too.

#11 willyt

opening	games	wins	first	last
PvT_10/12gate	12	100%	6	148
PvT_10/15gate	11	100%	12	129
PvT_12nexus	9	78%	8	146
PvT_1gatedtexpo	10	80%	7	139
PvT_1gatereaver	10	80%	14	143
PvT_28nexus	8	62%	5	144
PvT_2gatedt	11	73%	1	145
PvT_2gaterngexpo	9	78%	29	127
PvT_32nexus	12	100%	0	142
PvT_9/9gate	12	100%	2	149
PvT_9/9proxygate	13	100%	3	147
PvT_bulldog	9	78%	4	141
PvT_dtdrop	9	78%	19	108
PvT_proxydt	9	67%	25	133
PvT_stove	6	50%	22	131
15 openings	150	84%

enemy	games	wins
T_1fac	8	88%
T_2rax	52	96%
T_fastexpand	26	77%
T_unknown	64	77%
4 openings	150	84%

The Stove did not work well against WillyT.

#12 ecgberht

opening	games	wins	first	last
PvT_10/12gate	10	100%	2	127
PvT_10/15gate	10	100%	6	137
PvT_12nexus	10	100%	24	129
PvT_1gatedtexpo	10	100%	9	148
PvT_1gatereaver	10	100%	11	131
PvT_28nexus	10	100%	15	139
PvT_2gatedt	11	100%	12	142
PvT_2gaterngexpo	10	100%	1	140
PvT_32nexus	10	100%	10	125
PvT_9/9gate	10	100%	18	147
PvT_9/9proxygate	10	100%	0	145
PvT_bulldog	10	100%	25	149
PvT_dtdrop	10	100%	8	143
PvT_proxydt	9	78%	17	146
PvT_stove	10	100%	3	144
15 openings	150	99%

enemy	games	wins
T_1fac	8	100%
T_2fac	24	100%
T_2rax	7	100%
T_fastexpand	21	100%
T_unknown	90	98%
5 openings	150	99%

The Stove did as well as anything against Ecgberht. But then, only one opening had any losses (it had 2 losses).

#13 eggbot

opening	games	wins	first	last
PvP_10/12gate	12	100%	1	145
PvP_12nexus	11	100%	16	142
PvP_2gatedt	12	100%	19	148
PvP_2gatedtexpo	12	92%	10	137
PvP_2gatereaver	10	100%	27	122
PvP_3gaterobo	9	89%	12	104
PvP_3gatespeedzeal	12	100%	0	146
PvP_4gategoon	12	100%	3	149
PvP_9/9gate	12	100%	2	143
PvP_9/9proxygate	8	75%	22	136
PvP_nzcore	9	89%	13	130
PvP_zcore	11	100%	5	147
PvP_zcorez	11	100%	8	138
PvP_zzcore	9	89%	11	131
14 openings	150	96%

enemy	games	wins
P_cannonrush	147	96%
P_proxygate	3	100%
2 openings	150	96%

AIIDE 2020 - what Steamhammer learned

Steamhammer’s tables are fuller than others, partly because Steamhammer records more information than most bots and partly because I put more effort into analyzing my own bot’s results. Even so, the game records contain far more information than is summarized here. I have many features in mind that I’d like to add to my analysis scripts.

Some bottom line findings: 1. My preparation for specific opponents was largely successful, as far as it went. All prepared openings had at least a fair win rate, and most were among the most successful openings Steamhammer discovered throughout the tournament. 2. Steamhammer’s gas steal skill had value against many opponents, even though my first analysis of the skill suggested that it might not. (I have known for a long time that my first analysis had missed the truth.) It was able to judge pretty accurately whether stealing gas was effective versus a given opponent, so it could exploit it when successful and reduce the cost of trying it when unsuccessful.

#1 stardust

opening	games	wins	first	last
10HatchBurrow	18	6%	96	146
10HatchHydra	1	0%	49	49
10Pool9Hatch	1	0%	34	34
11HatchTurtleHydra	7	0%	3	79
11HatchTurtleMuta	1	0%	111	111
12-11HatchLing	1	0%	60	60
12HatchTurtle	4	0%	13	29
2x10HatchAllIn	2	0%	53	147
2x10HatchSlow	1	0%	38	38
3HatchHydra	1	0%	0	0
3HatchHydraBust	1	0%	58	58
3HatchLateHydras+1	1	0%	59	59
3HatchLingExpo	1	0%	54	54
3HatchLurker	1	0%	104	104
4HatchBeforeLair	1	0%	101	101
4PoolHard	1	0%	1	1
4Scout	2	0%	43	48
5HatchBeforeGas	1	0%	70	70
5HatchPoolLing	1	0%	148	148
5HatchPoolLingBurrow	1	0%	120	120
5PoolHard2Player	2	0%	36	118
5Scout	1	0%	45	45
7HatchSpeed	3	0%	41	117
8DroneGas	1	0%	32	32
8Gas7PoolLurker B	1	0%	51	51
8Hatch7Pool	2	0%	47	139
8Hatch7PoolBurrow	1	0%	63	63
8Hatch7PoolSpeed	1	0%	35	35
9Hatch8Pool	2	0%	67	71
9HatchMain9Pool9Gas	2	0%	50	62
9PoolHatch	1	0%	99	99
9PoolHatchBurrow	1	0%	46	46
9PoolHatchSpeed	1	0%	69	69
9PoolHatchSpeedSpire	2	0%	37	73
9PoolSpeedAllIn	1	0%	33	33
9PoolSpireSlowlings	1	0%	55	55
AntiTyrLurker	1	0%	66	66
AntiZeal_12Hatch	7	0%	8	138
GuardianRush	1	0%	40	40
Over10Hatch	2	0%	6	83
Over10Hatch1Sunk	3	0%	2	24
Over10Hatch2Sunk	4	0%	5	16
Over10Hatch2SunkHard	2	0%	14	52
Over10HatchBurrow	1	0%	105	105
Over10HatchBust	29	7%	74	145
Over10HatchHydra	2	0%	86	91
Over10HatchSlowLings	2	0%	4	20
OverhatchLateGas	2	0%	77	95
OverpoolHydra	1	0%	39	39
OverpoolSpeed	1	0%	126	126
OverpoolSunk	1	0%	56	56
OverpoolTurtle	5	0%	11	134
Overpool_3HatchLing	1	0%	149	149
Overpool_4HatchLing	2	0%	57	68
PurpleSwarmBuild	2	0%	129	140
QueenRush	1	0%	144	144
Sparkle 3HatchMuta	1	0%	137	137
ZvP_2HatchFakeMuta	1	0%	130	130
ZvP_4HatchPoolHydra	2	0%	44	64
ZvP_Overpool3Hatch	1	0%	72	72
ZvT_2HatchMuta	1	0%	65	65
ZvZ_12HatchMain	2	0%	61	142
ZvZ_12PoolLingB	1	0%	31	31
63 openings	150	2%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	148	99%	2%	101	67%	2%	68%	32%
Unknown	2	1%	0%	49	33%	2%	0%	50%

timing	#	median	early	late
my combat unit	150	3:02	1:51	5:35
my gas	148	3:23	1:15	7:50
enemy scout	150	1:57	1:17	2:41
enemy combat unit	150	2:38	2:21	4:47
enemy gas	150	4:16	3:59	8:20
enemy air unit	5	9:49	7:47	11:21
enemy cloaked unit	5	9:49	7:47	11:21
game duration	150	8:14	6:16	18:36

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	15	2:13	1:48	3:35	0%	4:25
gas steal success	11	2:09	2:00	2:27	0%	4:27
none or failed	139	-	-	-	2%	4:15
gas steal killed	11	2:44	2:38	2:59

Steamhammer managed 3 wins with 2 different openings, but basically nothing worked despite trying the full range from 4 pool to hive rush. Stealing gas did not help, but the situation was desperate so Steamhammer tried it 10% of the time. An interesting number is the enemy scout timing: The scout probe arrived at the zerg base with more consistent timing (arriving in a narrower time window) than against any other opponent. Part of Stardust’s recipe is good scouting.

#2 purplewave

opening	games	wins	first	last
11Gas10PoolLurker	1	0%	43	43
11Gas10PoolMuta	1	0%	65	65
11HatchTurtleHydra	3	0%	1	89
11HatchTurtleMuta	2	0%	14	19
12-11Hatch	2	0%	46	56
12Hatch12Pool	1	0%	81	81
12HatchTurtle	13	15%	10	147
2HatchLingAllInSpire	1	0%	69	69
2HatchLurkerAllIn	2	0%	25	84
2HatchLurkerPure	13	15%	85	148
2HatchMutaPure	1	0%	128	128
2x10HatchSlow	1	0%	111	111
3HatchHydra	1	0%	140	140
3HatchHydraExpo	2	0%	24	67
3HatchLing	3	0%	0	75
3HatchLingBust2	1	0%	55	55
3HatchLingExpo	2	0%	74	138
3HatchLurker	1	0%	60	60
5HatchBeforeGas	2	0%	116	144
5HatchPoolLing	1	0%	68	68
5PoolHard2Player	1	0%	146	146
6PoolHide	1	0%	118	118
6PoolSpeed	2	0%	58	62
6Scout	1	0%	149	149
7DroneHatch	1	0%	141	141
8-8HydraRush	1	0%	109	109
8Gas7PoolLurker B	1	0%	108	108
8Hatch7Pool	15	13%	37	127
973HydraBust	1	0%	104	104
9Hatch8Pool	1	0%	72	72
9HatchExpo9Pool9Gas	7	14%	51	143
9PoolExpo	1	0%	48	48
9PoolHatch	3	0%	34	119
9PoolHatchSpeed7DroneB	1	0%	66	66
9PoolHatchSpeedAllInB	2	0%	88	96
9PoolHatchSpeedSpire	1	0%	31	31
9PoolSpeedAllIn	1	0%	53	53
9PoolSunkHatch	1	0%	98	98
AntiFact_13Pool	1	0%	45	45
AntiFact_2Hatch	1	0%	61	61
AntiFactoryHydra	1	0%	73	73
AntiWraith_2Hatch	1	0%	114	114
AntiZeal_12Hatch	11	0%	4	112
Over10Hatch	4	0%	5	18
Over10Hatch+1	1	0%	71	71
Over10Hatch11Pool	1	0%	79	79
Over10Hatch1Sunk	1	0%	9	9
Over10Hatch2Sunk	1	0%	16	16
Over10Hatch2SunkHard	4	0%	3	100
Over10HatchBust	1	0%	23	23
Over10HatchHydra	1	0%	80	80
OverhatchExpoLing	1	0%	121	121
OverhatchExpoMuta	4	0%	8	28
OverhatchLateGas	1	0%	29	29
Overpool+1	1	0%	115	115
Overpool2HatchLurker	1	0%	110	110
OverpoolHatch	1	0%	64	64
OverpoolHide	1	0%	93	93
OverpoolSpeed	3	0%	33	82
OverpoolTurtle	3	0%	2	13
OverpoolTurtle 0	1	0%	42	42
Overpool_3HatchLing	1	0%	106	106
ZvP_2HatchMuta	1	0%	63	63
ZvP_3BaseSpire+Den	1	0%	102	102
ZvP_3HatchMuta	1	0%	49	49
ZvP_3HatchPoolHydra	1	0%	101	101
ZvT_13Pool	1	0%	99	99
ZvZ_Overgas8Pool	1	0%	97	97
ZvZ_Overpool11Gas	1	0%	70	70
69 openings	150	5%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush		-	-	6	4%	0%	0%	0%
Heavy rush	133	89%	5%	71	47%	4%	47%	26%
Naked expand		-	-	2	1%	100%	0%	0%
Safe expand	15	10%	7%	22	15%	0%	7%	27%
Turtle	1	1%	0%	10	7%	0%	0%	0%
Unknown	1	1%	0%	39	26%	5%	0%	0%

timing	#	median	early	late
my combat unit	150	3:06	1:53	7:55
my gas	150	3:25	1:14	11:39
enemy scout	150	2:11	1:19	14:11
enemy combat unit	149	2:55	2:18	6:22
enemy gas	147	5:11	3:51	8:40
enemy air unit	102	11:07	4:39	16:47
enemy cloaked unit	87	11:42	7:15	16:58
game duration	150	13:09	5:46	25:41

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	15	2:13	1:58	3:00	0%	5:10
gas steal success	10	2:22	2:18	2:41	0%	5:16
none or failed	140	-	-	-	5%	5:11
gas steal killed	10	3:05	2:53	5:08

Steamhammer again desperately tried every tech and timing known to zerg, and this time found a little success with a few of them—a sunken build, a lurker build, and two 2-hatch zergling wave builds. PurpleWave largely stuck with 2 gates. Stealing gas barely delayed Protoss from taking gas, but Steamhammer still tried 10% of the time—if you’re losing almost all games, it doesn’t hurt.

#3 bananabrain

opening	games	wins	first	last
11HatchTurtleHydra	27	30%	2	149
12-12Hatch	1	0%	63	63
2HatchHydra	1	0%	15	15
3HatchHydraExpo	1	0%	55	55
3HatchLateHydras+1	1	0%	1	1
3HatchLingBurrow	18	28%	5	139
3HatchLingBust2	29	38%	4	140
4HatchBeforeGas	1	0%	9	9
4PoolHard	1	0%	10	10
5PoolHard	1	0%	143	143
973HydraBust	1	0%	0	0
9PoolHatchSpeedAllInB	1	0%	57	57
AntiFact_13Pool	24	38%	3	137
DefilerRush	3	33%	144	147
OverhatchExpoLing	1	0%	61	61
OverhatchExpoMuta	1	0%	94	94
OverhatchLateGas	1	0%	98	98
OverpoolSunk	1	0%	110	110
Overpool_3HatchLing	1	0%	39	39
Sparkle 2HatchMuta	1	0%	48	48
ZvP_3HatchPoolHydra	10	20%	26	120
ZvT_12PoolMuta	8	12%	46	146
ZvT_13Pool	15	47%	108	148
ZvT_3HatchMuta	1	0%	81	81
24 openings	150	29%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	12	8%	33%	7	5%	0%	17%	42%
Heavy rush	42	28%	21%	30	20%	10%	19%	26%
Naked expand	4	3%	25%	5	3%	80%	0%	25%
Proxy		-	-	5	3%	0%	0%	0%
Safe expand	71	47%	31%	37	25%	38%	21%	31%
Turtle	21	14%	38%	18	12%	67%	10%	43%
Unknown		-	-	48	32%	23%	0%	0%

timing	#	median	early	late
my combat unit	150	3:11	1:47	4:11
my gas	145	3:21	2:13	6:42
enemy scout	150	1:45	1:15	6:55
enemy combat unit	150	3:09	2:19	6:37
enemy gas	139	5:22	2:47	9:41
enemy air unit	131	6:02	2:47	11:18
enemy cloaked unit	96	7:11	3:26	14:06
game duration	150	10:22	4:41	32:00

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	65	2:13	1:48	3:23	37%	5:19
gas steal success	46	2:19	1:54	2:40	39%	6:04
none or failed	104	-	-	-	25%	5:07
gas steal killed	46	5:47	2:45	20:00

BananaBrain was the first opponent weak enough that Steamhammer was able to find answers and did not feel a need to throw in the kitchen sink. 2 hatch muta builds were the most successful (and the best was one specialized for play against terran), but zergling builds and hydra builds were also OK. I love that the defiler rush won a game (it’s not at all a practical opening, but against a bot...). Unlike Stardust and PurpleWave, unpredictable BananaBrain played a variety of builds—for example, the appearance of an enemy cloaked unit at 3:26 in one game suggests a dark templar rush. The gas steal was successful and delayed protoss from taking gas for nearly an entire minute, so Steamhammer went with it frequently.

#4 dragon

opening	games	wins	first	last
2HatchLingAllInSpire	8	50%	35	140
3HatchLurker	16	56%	0	148
4PoolSoft	5	20%	17	47
5HatchPool	59	85%	5	146
5PoolHard2Player	1	0%	1	1
6PoolSpeed	1	0%	4	4
9PoolSpeedAllIn	2	0%	2	3
UltraRush	3	33%	142	149
ZvT_3HatchMuta	7	71%	121	147
ZvT_3HatchMutaExpo	40	78%	49	145
ZvT_7Pool	8	50%	53	84
11 openings	150	70%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	56	37%	70%	36	24%	53%	46%	25%
Heavy rush	2	1%	50%	1	1%	100%	0%	50%
Naked expand	17	11%	82%	15	10%	93%	18%	35%
Safe expand	12	8%	33%	12	8%	83%	8%	58%
Turtle	1	1%	100%	1	1%	100%	0%	0%
Unknown		-	-	46	31%	74%	0%	0%
Worker rush	62	41%	74%	39	26%	67%	42%	29%

timing	#	median	early	late
my combat unit	141	3:17	1:47	7:58
my gas	113	3:47	1:47	10:51
enemy scout	149	2:15	1:05	6:47
enemy combat unit	121	2:39	2:21	8:45
enemy gas	118	6:02	2:43	11:45
enemy air unit	108	9:15	4:22	17:01
enemy cloaked unit	85	9:54	5:43	16:34
game duration	150	15:07	3:01	37:21

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	39	2:21	1:56	4:20	72%	6:02
gas steal success	17	2:22	2:14	3:06	65%	7:07
none or failed	133	-	-	-	71%	5:52
gas steal killed	17	4:26	3:18	5:08

Steamhammer upset Dragon. The success of 5 hatcheries before spawning pool means that Dragon was not aggressive early, to its detriment—or that Steamhammer recognized any early aggression as a rush and reacted correctly. 3 hatchery mutalisk builds were also good. Even 3 hatchery lurker was not bad, though on the face it appears unsuited to counter Dragon’s play style. The gas steal was not successful in terms of win rate, but Steamhammer noticed that it delayed Dragon’s gas for a long time so it tried anyway. Fewer than half the attempts to steal gas ended with the extractor successfully made, though. The 37 minute game (here 37:21, officially 37:22) was Steamhammer’s longest game of the tournament, except for a couple games versus EggBot that went to the 60 minute limit and had to be adjudicated on points.

#5 mcrave

opening	games	wins	first	last
12PoolLurker	1	0%	55	55
3HatchLingBurrow	5	20%	97	126
8DroneGas	11	64%	106	148
9HatchMain9Pool9Gas	2	0%	52	138
9PoolHatchSpeedAllInB	1	0%	19	19
9PoolSpire	2	0%	59	95
Over10HatchBust	19	42%	62	149
Over10PoolLing	1	0%	0	0
OverpoolSpeed	15	20%	5	135
OverpoolSunk	21	38%	30	128
OverpoolTurtle	23	48%	66	139
ZvP_3HatchMuta	1	0%	134	134
ZvZ_12HatchExpo	1	0%	7	7
ZvZ_Overgas9Pool	2	0%	1	3
ZvZ_OverpoolTurtle	45	58%	2	146
15 openings	150	43%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	20	13%	35%	8	5%	50%	0%	70%
Naked expand	118	79%	45%	33	22%	42%	21%	69%
Turtle	9	6%	44%	4	3%	25%	0%	78%
Unknown	2	1%	0%	104	69%	42%	0%	50%
Worker rush	1	1%	0%	1	1%	100%	0%	0%

timing	#	median	early	late
my combat unit	150	2:26	2:13	3:15
my gas	146	2:32	1:31	3:30
enemy scout	150	2:47	0:37	5:51
enemy combat unit	150	2:34	1:11	6:03
enemy gas	130	3:23	2:32	11:09
enemy air unit	123	4:31	3:37	10:10
enemy cloaked unit	5	15:06	11:45	24:23
game duration	150	10:58	3:41	29:42

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	10	2:00	1:54	2:48	30%	3:15
gas steal success	0	-	-	-	-	-
none or failed	150	-	-	-	43%	3:23
gas steal killed	0	-	-	-

Steamhammer failed to recognize McRave’s build in 2/3 of games, though we know from yeterday’s post exactly what they were, and that they should not have been hard to understand. I put more effort into recognizing rushes and proxies that change the whole course of the game than into regular builds, and the result is that Steamhammer’s understanding and prediction are sometimes spectacularly unhelpful. The 8DroneGas build is actually a 9 pool which makes a second hatchery; it’s called that because it ends up with 8 drones while flooding zerglings (it’s not the Styx build, but related). 5 games had cloaked units; I think that means that McRave researched burrow, because it does use burrow and does not seem to ever make lurkers. Steamhammer was unable to steal gas in 10 attempts, so it gave up trying.

#6 microwave

opening	games	wins	first	last
6PoolBurrow	1	0%	50	50
8-8HydraRush	1	0%	131	131
9Hatch8Pool	5	20%	68	87
9PoolHatchSpeedSpire	1	0%	60	60
OverhatchLing	1	0%	2	2
OverpoolBurrow	1	0%	121	121
ZvZ_12HatchExpo	5	40%	117	141
ZvZ_12PoolLing	11	64%	0	149
ZvZ_12PoolMain	3	33%	1	71
ZvZ_Overpool11Gas	44	73%	3	147
ZvZ_Overpool9Gas	64	89%	8	148
ZvZ_OverpoolTurtle	13	54%	9	140
12 openings	150	71%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	12	8%	67%	8	5%	88%	8%	67%
Heavy rush	44	29%	75%	32	21%	91%	20%	32%
Naked expand	94	63%	70%	42	28%	67%	23%	45%
Turtle		-	-	4	3%	75%	0%	0%
Unknown		-	-	64	43%	62%	0%	0%

timing	#	median	early	late
my combat unit	150	2:26	1:57	3:14
my gas	150	2:06	1:38	2:45
enemy scout	150	2:25	1:17	4:53
enemy combat unit	150	2:41	1:49	5:25
enemy gas	129	5:21	2:34	9:37
enemy air unit	115	7:34	3:46	15:33
enemy cloaked unit	0	-	-	-
game duration	150	13:32	3:41	30:36

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	6	1:58	1:50	2:20	0%	5:04
gas steal success	1	2:21	2:21	2:21	0%	6:40
none or failed	149	-	-	-	72%	5:16
gas steal killed	1	3:16	3:16	3:16

Steamhammer found a variety of overpool openings to be best. Overpool is flexible; those openings play out quite differently. Microwave didn’t play 5 pool often because it didn’t work: Steamhammer’s rush recognition is improved this year, and it reacted well. The burrow openings and 8-8 dawn hydra rush are not appropriate for ZvZ; those choices could be improved. Unlike McRave, Microwave did not use burrow. Stealing gas failed, so Steamhammer gave it up quickly.

#8 daqin

opening	games	wins	first	last
10HatchLing	1	0%	139	139
11Gas10PoolLurker	1	0%	63	63
12-12Hatch	1	0%	42	42
12Hatch_4HatchLing	2	0%	82	126
2.5HatchMuta	1	0%	99	99
2HatchHydraBust	2	0%	5	131
3HatchHydra	2	0%	20	113
3HatchHydraBust	3	0%	4	97
3HatchHydraExpo	1	0%	53	53
3HatchLateHydras+1	1	0%	107	107
3HatchLing	59	44%	0	147
3HatchLingBust2	2	0%	22	69
4HatchBeforeGas	25	20%	13	144
4HatchBeforeLair	1	0%	142	142
5HatchBeforeGas	2	0%	1	2
5HatchPool	1	0%	128	128
5PoolHard2Player	1	0%	66	66
5Scout	1	0%	93	93
973HydraBust	4	0%	3	73
9Pool8GasLurker	1	0%	91	91
9PoolHatchSpeed	1	0%	38	38
9PoolHatchSpeedSpire2	1	0%	114	114
9PoolHatchSpire	1	0%	67	67
9PoolSpireSlowlings	1	0%	31	31
9PoolSunkHatch	1	0%	92	92
AntiFact_2Hatch	1	0%	87	87
AntiFact_Overpool9Gas	1	0%	141	141
AntiFactory2	1	0%	116	116
Over10Hatch1Sunk	1	0%	76	76
OverhatchExpoMuta	3	0%	17	47
OverpoolSpeed	1	0%	72	72
OverpoolTurtle 0	2	0%	106	146
Proxy8HatchNatural	1	0%	41	41
Sparkle 3HatchMuta	6	17%	120	136
ZvP_2HatchMuta	1	0%	25	25
ZvP_3BaseSpire+Den	1	0%	115	115
ZvP_3HatchPoolHydra	7	14%	133	149
ZvT_2HatchMuta	1	0%	57	57
ZvT_3HatchMuta	1	0%	43	43
ZvT_7Pool	1	0%	59	59
ZvZ_12PoolLing	1	0%	77	77
ZvZ_12PoolLingB	2	0%	103	129
ZvZ_Overpool11Gas	1	0%	88	88
43 openings	150	22%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush		-	-	10	7%	20%	0%	0%
Naked expand		-	-	5	3%	80%	0%	0%
Safe expand	35	23%	11%	48	32%	21%	37%	6%
Turtle	114	76%	25%	85	57%	20%	59%	0%
Unknown	1	1%	100%	2	1%	0%	0%	0%

timing	#	median	early	late
my combat unit	150	3:11	1:51	5:01
my gas	150	3:22	1:47	5:53
enemy scout	149	1:30	1:15	11:34
enemy combat unit	150	4:49	4:06	6:51
enemy gas	148	5:42	4:43	8:31
enemy air unit	38	13:15	9:10	26:30
enemy cloaked unit	48	12:11	4:43	17:37
game duration	150	11:16	6:22	36:26

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	31	2:14	1:58	3:06	35%	6:16
gas steal success	31	2:22	2:08	3:13	35%	6:16
none or failed	119	-	-	-	18%	5:32
gas steal killed	31	2:53	2:38	13:06

DaQin upset Steamhammer badly, and zerg fell back on exploring widely. One ling bust and a few macro builds were able to save some games against DaQin’s consistent forge expand (which Steamhammer often misrecognized as Turtle because it did not actively look for the expansion nexus). The gas steal is measured to increase the overall win rate from 18% to 22%, not bad.

#9 zzzkbot

opening	games	wins	first	last
8Hatch7Pool	5	60%	103	133
9PoolHatchSpire	4	25%	14	148
9PoolSunkHatch	30	67%	3	149
9PoolSunkSpeed	17	71%	0	136
OverhatchLing	14	64%	108	140
OverpoolSunk	20	50%	1	143
ZvZ_12Pool	1	0%	79	79
ZvZ_OverpoolTurtle	59	97%	13	147
8 openings	150	75%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	40	27%	80%	10	7%	100%	25%	68%
Turtle	110	73%	73%	27	18%	70%	22%	78%
Unknown		-	-	113	75%	73%	0%	0%

timing	#	median	early	late
my combat unit	150	2:25	2:14	3:06
my gas	148	2:38	2:01	7:07
enemy scout	150	2:38	0:39	4:01
enemy combat unit	150	2:41	1:53	4:31
enemy gas	121	5:27	2:24	8:21
enemy air unit	62	6:34	6:23	8:35
enemy cloaked unit	0	-	-	-
game duration	150	7:54	4:26	15:23

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	8	2:37	2:01	3:01	75%	5:27
gas steal success	0	-	-	-	-	-
none or failed	150	-	-	-	75%	5:27
gas steal killed	0	-	-	-

ZZZKBot switches between its famous 4 pool (recognized as Fast rush; see the 100% win rate when the strategy was recognized) and a build with sunkens, two hatcheries, and 6 sudden mutalisks (recognized as Turtle). It seems that part of Steamhammer’s difficulty with ZZZKBot (I consider a 75% win rate to be low in this case) is due to poor scouting: See the rate of Unknown enemy strategies. A bigger part of it may be a problem with the exploration policy. ZvZ_OverpoolTurtle scored 97% and was discovered early, but was played in only 59 games out of 150.

Steamhammer does not use enemy scout timing as a clue to the enemy strategy, and it should. When a scout appears at your base in 39 seconds it must have been sent almost immediately. Either the enemy is rushing, or is terrified that you might. A comparison of past scout timings versus recognized strategies for this opponent could be strong evidence.

ZZZKBot takes gas only in its muta strategy, and then it has sunkens so that a gas steal cannot succeed. Steamhammer figured that out fairly quickly.

#10 ualbertabot

opening	games	wins	first	last
973HydraBust	1	0%	129	129
AntiZeal_12Hatch	7	71%	0	142
Over10Hatch	4	75%	2	141
OverpoolTurtle	137	99%	1	148
4 openings	149	96%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	10	7%	100%	17	11%	100%	0%	50%
Fast rush	28	19%	93%	23	15%	96%	14%	36%
Heavy rush	111	74%	96%	60	40%	93%	44%	22%
Naked expand		-	-	10	7%	100%	0%	0%
Unknown		-	-	39	26%	97%	0%	0%

timing	#	median	early	late
my combat unit	149	2:26	2:22	3:15
my gas	147	2:58	2:54	4:46
enemy scout	140	2:02	1:18	7:41
enemy combat unit	118	2:39	1:42	4:33
enemy gas	127	3:38	2:37	13:12
enemy air unit	33	14:31	10:46	16:33
enemy cloaked unit	32	14:16	2:37	16:33
game duration	149	8:27	3:42	20:54

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	0	-	-	-	-	-
gas steal success	0	-	-	-	-	-
none or failed	149	-	-	-	96%	3:38
gas steal killed	0	-	-	-

Steamhammer recorded only 149 games against UAlbertaBot, because it crashed one game, its only crash in the tournament. Against every other opponent, Steamhammer recorded all 150 games.

There was no need to test the gas steal. If you’re winning nearly all games, spending a drone to vary your play is not likely to gain anything. Another point is that UAlbertaBot often takes gas quite late, so stealing gas early is unlikely to pay off.

#11 willyt

opening	games	wins	first	last
11Gas10PoolLurker	47	74%	4	149
12-12Hatch	1	0%	46	46
2HatchLurker	1	0%	115	115
4PoolHard	5	20%	2	58
9HatchMain9Pool9Gas	16	38%	7	148
9PoolSpeed	33	58%	1	141
9PoolSpeedAllIn	28	54%	3	145
GuardianRush	5	40%	123	143
Over10Hatch2Sunk	1	0%	146	146
Overpool_3HatchLing	1	0%	113	113
Sparkle 3HatchMuta	11	45%	65	132
ZvT_3HatchMuta	1	0%	0	0
12 openings	150	55%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory		-	-	1	1%	0%	0%	0%
Fast rush		-	-	1	1%	0%	0%	0%
Heavy rush	16	11%	62%	15	10%	27%	12%	50%
Naked expand	86	57%	53%	35	23%	100%	17%	52%
Safe expand	48	32%	56%	30	20%	77%	25%	31%
Unknown		-	-	68	45%	31%	0%	0%

timing	#	median	early	late
my combat unit	150	2:42	1:46	3:38
my gas	145	1:49	1:45	5:17
enemy scout	142	2:11	1:38	7:19
enemy combat unit	150	2:59	2:05	6:41
enemy gas	112	5:13	4:04	10:07
enemy air unit	11	17:13	10:45	30:58
enemy cloaked unit	22	14:23	7:10	19:33
game duration	150	8:48	4:42	31:49

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	27	2:13	1:48	2:43	67%	5:45
gas steal success	23	2:21	1:56	2:57	70%	6:19
none or failed	127	-	-	-	53%	5:11
gas steal killed	23	4:06	3:00	4:43

WillyT was a tough opponent for Steamhammer. The choice of zerg builds does not seem strong. I have no idea how the guardian rush won 2 games out of 5.

#12 ecgberht

opening	games	wins	first	last
11HatchTurtleLurker	1	0%	1	1
11HatchTurtleMuta	7	57%	8	148
12HatchTurtle	1	0%	119	119
9PoolLurker	47	91%	0	149
9PoolSpeed	25	76%	3	142
AntiStyx_9Pool	4	75%	121	141
HiveRush	8	75%	89	140
Over10HatchBust	22	82%	2	135
OverpoolLurker	35	91%	4	147
9 openings	150	83%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	39	26%	79%	24	16%	79%	28%	26%
Fast rush	30	20%	80%	21	14%	90%	10%	40%
Heavy rush	56	37%	89%	23	15%	87%	18%	39%
Naked expand	6	4%	67%	6	4%	83%	17%	33%
Proxy	8	5%	75%	5	3%	60%	0%	50%
Safe expand	8	5%	100%	8	5%	75%	0%	50%
Unknown		-	-	55	37%	87%	0%	0%
Worker rush	3	2%	67%	8	5%	62%	0%	33%

timing	#	median	early	late
my combat unit	150	2:18	2:11	3:17
my gas	150	1:55	1:45	7:07
enemy scout	144	1:27	0:30	5:18
enemy combat unit	148	3:01	1:55	6:53
enemy gas	98	5:05	2:51	8:43
enemy air unit	49	5:58	4:01	19:55
enemy cloaked unit	31	5:54	5:21	7:15
game duration	150	8:36	3:52	22:29

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	22	2:17	1:48	5:45	86%	5:10
gas steal success	13	2:23	2:00	5:51	85%	5:24
none or failed	137	-	-	-	83%	5:03
gas steal killed	13	3:27	2:17	6:10

Ecgberht is a tricky opponent. If it knew how to defeat fast lurker builds, it would score a lot higher against Steamhammer.

#13 eggbot

opening	games	wins	first	last
2HatchHydraBust	3	67%	32	86
3HatchHydra	7	86%	70	105
3HatchHydraBust	3	67%	52	103
3HatchHydraExpo	8	100%	15	137
3HatchLateHydras+1	6	83%	13	75
3HatchLingBust2	6	100%	12	121
4HatchBeforeGas	6	100%	7	149
4HatchBeforeLair	11	100%	16	142
5HatchBeforeGas	8	100%	1	118
6PoolHide	19	100%	5	145
973HydraBust	9	100%	6	130
9PoolHide	10	100%	27	115
9PoolSunkHatch	11	100%	0	138
9PoolSunkSpeed	7	100%	18	148
AntiStyx_9Pool	6	100%	8	129
OverpoolHide	14	100%	2	146
ZvP_3BaseSpire+Den	10	100%	11	141
ZvP_3HatchPoolHydra	6	83%	30	147
18 openings	150	97%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Contain	56	37%	98%	33	22%	100%	23%	43%
Proxy	35	23%	97%	25	17%	80%	14%	37%
Turtle	59	39%	95%	37	25%	100%	20%	31%
Unknown		-	-	55	37%	100%	0%	0%

timing	#	median	early	late
my combat unit	150	2:26	1:58	4:09
my gas	139	3:43	2:38	8:31
enemy scout	79	1:29	0:31	10:17
enemy combat unit	32	6:50	4:31	8:59
enemy gas	0	-	-	-
enemy air unit	0	-	-	-
enemy cloaked unit	0	-	-	-
game duration	150	6:50	4:04	60:00

gas steal	#	median	early	late	wins	enemy gas
gas steal decision	0	-	-	-	-	-
gas steal success	0	-	-	-	-	-
none or failed	150	-	-	-	97%	-
gas steal killed	0	-	-	-

EggBot is a cannon bot. Its builds were recognized as Proxy for close cannons, Contain for cannons farther away from zerg bases, or Turtle for distant cannons. Steamhammer tried a variety of openings, discarding one as soon as it had lost a single game because there were others that still scored 100%. Steamhammer understood that EggBot never took gas, so gas steal would gain nothing.

overall

	total		ZvT		ZvP		ZvZ		ZvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
10HatchBurrow	18	6%			18	6%
10HatchHydra	1	0%			1	0%
10HatchLing	1	0%			1	0%
10Pool9Hatch	1	0%			1	0%
11Gas10PoolLurker	49	71%	47	74%	2	0%
11Gas10PoolMuta	1	0%			1	0%
11HatchTurtleHydra	37	22%			37	22%
11HatchTurtleLurker	1	0%	1	0%
11HatchTurtleMuta	10	40%	7	57%	3	0%
12-11Hatch	2	0%			2	0%
12-11HatchLing	1	0%			1	0%
12-12Hatch	3	0%	1	0%	2	0%
12Hatch12Pool	1	0%			1	0%
12HatchTurtle	18	11%	1	0%	17	12%
12Hatch_4HatchLing	2	0%			2	0%
12PoolLurker	1	0%					1	0%
2.5HatchMuta	1	0%			1	0%
2HatchHydra	1	0%			1	0%
2HatchHydraBust	5	40%			5	40%
2HatchLingAllInSpire	9	44%	8	50%	1	0%
2HatchLurker	1	0%	1	0%
2HatchLurkerAllIn	2	0%			2	0%
2HatchLurkerPure	13	15%			13	15%
2HatchMutaPure	1	0%			1	0%
2x10HatchAllIn	2	0%			2	0%
2x10HatchSlow	2	0%			2	0%
3HatchHydra	11	55%			11	55%
3HatchHydraBust	7	29%			7	29%
3HatchHydraExpo	12	67%			12	67%
3HatchLateHydras+1	9	56%			9	56%
3HatchLing	62	42%			62	42%
3HatchLingBurrow	23	26%			18	28%	5	20%
3HatchLingBust2	38	45%			38	45%
3HatchLingExpo	3	0%			3	0%
3HatchLurker	18	50%	16	56%	2	0%
4HatchBeforeGas	32	34%			32	34%
4HatchBeforeLair	13	85%			13	85%
4PoolHard	7	14%	5	20%	2	0%
4PoolSoft	5	20%	5	20%
4Scout	2	0%			2	0%
5HatchBeforeGas	13	62%			13	62%
5HatchPool	60	83%	59	85%	1	0%
5HatchPoolLing	2	0%			2	0%
5HatchPoolLingBurrow	1	0%			1	0%
5PoolHard	1	0%			1	0%
5PoolHard2Player	5	0%	1	0%	4	0%
5Scout	2	0%			2	0%
6PoolBurrow	1	0%					1	0%
6PoolHide	20	95%			20	95%
6PoolSpeed	3	0%	1	0%	2	0%
6Scout	1	0%			1	0%
7DroneHatch	1	0%			1	0%
7HatchSpeed	3	0%			3	0%
8-8HydraRush	2	0%			1	0%	1	0%
8DroneGas	12	58%			1	0%	11	64%
8Gas7PoolLurker B	2	0%			2	0%
8Hatch7Pool	22	23%			17	12%	5	60%
8Hatch7PoolBurrow	1	0%			1	0%
8Hatch7PoolSpeed	1	0%			1	0%
973HydraBust	16	56%			15	60%			1	0%
9Hatch8Pool	8	12%			3	0%	5	20%
9HatchExpo9Pool9Gas	7	14%			7	14%
9HatchMain9Pool9Gas	20	30%	16	38%	2	0%	2	0%
9Pool8GasLurker	1	0%			1	0%
9PoolExpo	1	0%			1	0%
9PoolHatch	4	0%			4	0%
9PoolHatchBurrow	1	0%			1	0%
9PoolHatchSpeed	2	0%			2	0%
9PoolHatchSpeed7DroneB	1	0%			1	0%
9PoolHatchSpeedAllInB	4	0%			3	0%	1	0%
9PoolHatchSpeedSpire	4	0%			3	0%	1	0%
9PoolHatchSpeedSpire2	1	0%			1	0%
9PoolHatchSpire	5	20%			1	0%	4	25%
9PoolHide	10	100%			10	100%
9PoolLurker	47	91%	47	91%
9PoolSpeed	58	66%	58	66%
9PoolSpeedAllIn	32	47%	30	50%	2	0%
9PoolSpire	2	0%					2	0%
9PoolSpireSlowlings	2	0%			2	0%
9PoolSunkHatch	43	72%			13	85%	30	67%
9PoolSunkSpeed	24	79%			7	100%	17	71%
AntiFact_13Pool	25	36%			25	36%
AntiFact_2Hatch	2	0%			2	0%
AntiFact_Overpool9Gas	1	0%			1	0%
AntiFactory2	1	0%			1	0%
AntiFactoryHydra	1	0%			1	0%
AntiStyx_9Pool	10	90%	4	75%	6	100%
AntiTyrLurker	1	0%			1	0%
AntiWraith_2Hatch	1	0%			1	0%
AntiZeal_12Hatch	25	20%			18	0%			7	71%
DefilerRush	3	33%			3	33%
GuardianRush	6	33%	5	40%	1	0%
HiveRush	8	75%	8	75%
Over10Hatch	10	30%			6	0%			4	75%
Over10Hatch+1	1	0%			1	0%
Over10Hatch11Pool	1	0%			1	0%
Over10Hatch1Sunk	5	0%			5	0%
Over10Hatch2Sunk	6	0%	1	0%	5	0%
Over10Hatch2SunkHard	6	0%			6	0%
Over10HatchBurrow	1	0%			1	0%
Over10HatchBust	71	39%	22	82%	30	7%	19	42%
Over10HatchHydra	3	0%			3	0%
Over10HatchSlowLings	2	0%			2	0%
Over10PoolLing	1	0%					1	0%
OverhatchExpoLing	2	0%			2	0%
OverhatchExpoMuta	8	0%			8	0%
OverhatchLateGas	4	0%			4	0%
OverhatchLing	15	60%					15	60%
Overpool+1	1	0%			1	0%
Overpool2HatchLurker	1	0%			1	0%
OverpoolBurrow	1	0%					1	0%
OverpoolHatch	1	0%			1	0%
OverpoolHide	15	93%			15	93%
OverpoolHydra	1	0%			1	0%
OverpoolLurker	35	91%	35	91%
OverpoolSpeed	20	15%			5	0%	15	20%
OverpoolSunk	43	42%			2	0%	41	44%
OverpoolTurtle	168	87%			8	0%	23	48%	137	99%
OverpoolTurtle 0	3	0%			3	0%
Overpool_3HatchLing	4	0%	1	0%	3	0%
Overpool_4HatchLing	2	0%			2	0%
Proxy8HatchNatural	1	0%			1	0%
PurpleSwarmBuild	2	0%			2	0%
QueenRush	1	0%			1	0%
Sparkle 2HatchMuta	1	0%			1	0%
Sparkle 3HatchMuta	18	33%	11	45%	7	14%
UltraRush	3	33%	3	33%
ZvP_2HatchFakeMuta	1	0%			1	0%
ZvP_2HatchMuta	2	0%			2	0%
ZvP_3BaseSpire+Den	12	83%			12	83%
ZvP_3HatchMuta	2	0%			1	0%	1	0%
ZvP_3HatchPoolHydra	24	33%			24	33%
ZvP_4HatchPoolHydra	2	0%			2	0%
ZvP_Overpool3Hatch	1	0%			1	0%
ZvT_12PoolMuta	8	12%			8	12%
ZvT_13Pool	16	44%			16	44%
ZvT_2HatchMuta	2	0%			2	0%
ZvT_3HatchMuta	10	50%	8	62%	2	0%
ZvT_3HatchMutaExpo	40	78%	40	78%
ZvT_7Pool	9	44%	8	50%	1	0%
ZvZ_12HatchExpo	6	33%					6	33%
ZvZ_12HatchMain	2	0%			2	0%
ZvZ_12Pool	1	0%					1	0%
ZvZ_12PoolLing	12	58%			1	0%	11	64%
ZvZ_12PoolLingB	3	0%			3	0%
ZvZ_12PoolMain	3	33%					3	33%
ZvZ_Overgas8Pool	1	0%			1	0%
ZvZ_Overgas9Pool	2	0%					2	0%
ZvZ_Overpool11Gas	46	70%			2	0%	44	73%
ZvZ_Overpool9Gas	64	89%					64	89%
ZvZ_OverpoolTurtle	117	77%					117	77%
total	1799	54%	450	70%	750	31%	450	63%	149	96%
openings played	151		29		130		30		4

This is not all of Steamhammer’s zerg openings! The tournament wasn’t long enough for it to try everything.

AIIDE 2020 - what McRave learned

I got distracted, but I’m back. These tables aren’t what McRave learned exactly, they are what it recorded in its Info files, more like a summary of how McRave played given what it had learned. History files like these are rich with information and I wanted to extract more from them, but this will do for now.

The “opening” tables give McRave’s strategy as it represents it, build-opener-transition (see last post). The “enemy” tables represent the recognized enemy strategy in the same format. I like that the strategies of both sides are represented the same way, it’s elegant. The hierarchical representation has advantages for reacting to enemy strategies: McRave may be able to react to an aspect of its enemy’s plan, or to an enemy strategy that it only partially recognized.

Here’s a sample line from the file ZvZ Microwave Info.txt, the first game of 150, to give a taste of how much information there is.

Won,Destination,28:43,HatchPool,9Pool,LingRush,PoolLair,9Pool,1HatchMuta,4:07,2:45,3:31,1,Zerg_Larva,92,Zerg_Zergling,105,Zerg_Drone,18,Zerg_Overlord,73,Zerg_Mutalisk,36,Zerg_Scourge,30,Zerg_Larva,114,Zerg_Zergling,102,Zerg_Drone,12,Zerg_Overlord,34,Zerg_Mutalisk,14,Zerg_Scourge,4,Zerg_Cocoon,11,Zerg_Devourer

#1 stardust

opening	games	wins	first	last
PoolHatch,Overpool,2HatchMuta	150	2%	0	149
1 openings	150	2%

enemy	games	wins
1GateCore,2Zealot,4Gate	14	7%
2Gate,Main,4Gate	135	1%
2Gate,Main,DT	1	0%
3 openings	150	2%

I think Stardust does not make dark templar, so the one DT game may be a strategy inference miss. It’s interesting that most of McRave’s wins came when Stardust was recognized as opening with gate-cybercore instead of with two gates.

#2 purplewave

opening	games	wins	first	last
PoolHatch,Overpool,2HatchMuta	149	5%	0	149
PoolHatch,Overpool,3HatchSpeedling	1	0%	45	45
2 openings	150	5%

enemy	games	wins
2Gate,Main,4Gate	144	6%
2Gate,Main,Corsair	3	0%
2Gate,Main,ZealotRush	3	0%
3 openings	150	5%

PurpleWave chose to play similarly to Stardust, not quite as successfully.

#3 bananabrain

opening	games	wins	first	last
PoolHatch,Overpool,2HatchMuta	131	45%	1	149
PoolHatch,Overpool,2HatchSpeedling	10	50%	0	147
PoolHatch,Overpool,3HatchSpeedling	9	44%	24	131
3 openings	150	45%

enemy	games	wins
1GateCore,2Zealot,Corsair	20	75%
1GateCore,2Zealot,DT	8	62%
2Gate,Main,4Gate	1	0%
2Gate,Main,Corsair	38	11%
2Gate,Main,DT	4	25%
2Gate,Proxy,ZealotRush	10	50%
FFE,Forge,5GateGoon	17	47%
FFE,Forge,NeoBisu	24	62%
FFE,Forge,Speedlot	13	54%
FFE,Forge,Unknown	2	0%
FFE,Forge,ZealotArchon	1	100%
FFE,Gateway,Speedlot	1	0%
FFE,Nexus,5GateGoon	4	25%
FFE,Nexus,NeoBisu	7	86%
14 openings	150	45%

McRave’s plans were about equally successful, but it chose to go with mutalisks much more often than zerglings. I think that’s strategically correct, because BananaBrain is susceptible to the occasional ling bust but will play safely if zerg repeats it too often. Otherwise, BananaBrain is unpredictable as always. As I understand it, McRave does not try to directly predict the enemy strategy but only reacts to what it scouts, so unpredictability can confuse UCB but nothing worse.

#4 dragon

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	60	18%	0	148
HatchPool,12Hatch,2HatchSpeedling	3	0%	24	129
PoolHatch,12Pool,2HatchMuta	19	0%	3	149
PoolHatch,Overpool,2HatchMuta	67	28%	1	147
PoolHatch,Overpool,2HatchSpeedling	1	100%	47	47
5 openings	150	21%

enemy	games	wins
2Rax,Expand,Unknown	2	0%
2Rax,Main,1FactTanks	2	0%
2Rax,Main,Academy	3	0%
2Rax,Main,Unknown	27	22%
2Rax,Proxy,1FactTanks	1	0%
2Rax,Proxy,Unknown	2	0%
RaxCC,1RaxFE,1FactTanks	10	10%
RaxCC,1RaxFE,5FactGoliath	45	18%
RaxCC,1RaxFE,Unknown	12	58%
RaxFact,Unknown,5FactGoliath	14	36%
RaxFact,Unknown,Unknown	2	50%
Unknown,Unknown,2Fact	6	17%
Unknown,Unknown,Unknown	23	4%
Unknown,Unknown,WorkerRush	1	100%
14 openings	150	21%

Dragon is the first opponent to push McRave into exploring most of its available strategies. Notice McRave only played PoolHatch,Overpool,2HatchSpeedling once (about a third of the way through the tournament) even though it won, and even though PoolHatch and Overpool was its preferred stem. It shows a strong preference for mutalisk play.

#6 microwave

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	19	63%	1	127
PoolHatch,12Pool,2HatchSpeedling	14	57%	21	118
PoolLair,9Pool,1HatchMuta	114	84%	0	146
3 openings	147	79%

enemy	games	wins
HatchPool,12Pool,1HatchLurker	1	100%
HatchPool,12Pool,2HatchLing	6	100%
HatchPool,12Pool,3HatchMuta	6	67%
HatchPool,12Pool,Unknown	13	77%
HatchPool,4Pool,LingRush	2	50%
HatchPool,9Pool,LingRush	28	57%
HatchPool,9Pool,Unknown	1	0%
HatchPool,Unknown,2HatchLing	21	86%
HatchPool,Unknown,3HatchLing	1	100%
HatchPool,Unknown,Unknown	6	100%
PoolHatch,12Pool,2HatchLing	2	100%
PoolHatch,12Pool,3HatchMuta	1	0%
PoolHatch,12Pool,Unknown	12	100%
PoolHatch,4Pool,LingRush	2	100%
PoolHatch,9Pool,2HatchLing	1	100%
PoolHatch,9Pool,LingRush	2	100%
PoolHatch,Unknown,2HatchLing	3	100%
PoolHatch,Unknown,Unknown	4	75%
PoolLair,9Pool,1HatchMuta	3	100%
PoolLair,9Pool,Unknown	3	100%
PoolLair,Unknown,Unknown	1	100%
Unknown,12Pool,3HatchLing	1	100%
Unknown,12Pool,3HatchMuta	4	50%
Unknown,12Pool,Unknown	9	67%
Unknown,4Pool,LingRush	1	100%
Unknown,9Pool,LingRush	3	67%
Unknown,9Pool,Unknown	4	75%
Unknown,Unknown,1HatchHydra	1	100%
Unknown,Unknown,3HatchMuta	1	100%
Unknown,Unknown,Unknown	4	100%
30 openings	147	79%

McRave’s play was similar against Microwave and Steamhammer: It chose the same builds in roughly similar proportions, and it found the opponent’s strategies to be highly diverse and difficult to recognize (notice all the Unknown values).

#7 steamhammer

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	33	48%	2	149
PoolHatch,12Pool,2HatchSpeedling	22	55%	1	144
PoolLair,9Pool,1HatchMuta	95	61%	0	148
3 openings	150	57%

enemy	games	wins
HatchPool,12Pool,1HatchHydra	1	0%
HatchPool,12Pool,1HatchLurker	2	100%
HatchPool,12Pool,1HatchMuta	1	100%
HatchPool,12Pool,2HatchLing	4	100%
HatchPool,12Pool,3HatchMuta	2	50%
HatchPool,12Pool,Unknown	8	25%
HatchPool,9Pool,2HatchLing	6	17%
HatchPool,9Pool,3HatchMuta	2	0%
HatchPool,9Pool,LingRush	12	33%
HatchPool,9Pool,Unknown	8	75%
HatchPool,Unknown,1HatchHydra	2	100%
HatchPool,Unknown,2HatchLing	8	88%
HatchPool,Unknown,2HatchMuta	1	100%
HatchPool,Unknown,Unknown	4	100%
PoolHatch,12Pool,1HatchLurker	1	100%
PoolHatch,12Pool,2HatchLing	1	100%
PoolHatch,12Pool,Unknown	5	80%
PoolHatch,9Pool,2HatchLing	2	50%
PoolHatch,9Pool,LingRush	7	86%
PoolHatch,9Pool,Unknown	1	0%
PoolHatch,Unknown,2HatchLing	1	100%
PoolHatch,Unknown,Unknown	2	0%
PoolLair,12Pool,1HatchMuta	6	50%
PoolLair,9Pool,1HatchMuta	10	50%
PoolLair,9Pool,Unknown	1	100%
PoolLair,Unknown,1HatchMuta	7	14%
PoolLair,Unknown,Unknown	3	100%
Unknown,12Pool,1HatchHydra	5	40%
Unknown,12Pool,3HatchMuta	2	50%
Unknown,12Pool,Unknown	8	75%
Unknown,9Pool,1HatchHydra	2	50%
Unknown,9Pool,LingRush	1	100%
Unknown,Unknown,1HatchHydra	3	100%
Unknown,Unknown,3HatchMuta	3	67%
Unknown,Unknown,Unknown	18	44%
35 openings	150	57%

#8 daqin

opening	games	wins	first	last
PoolHatch,Overpool,2HatchMuta	150	65%	0	149
1 openings	150	65%

enemy	games	wins
FFE,Forge,5GateGoon	5	80%
FFE,Forge,Speedlot	121	59%
FFE,Forge,ZealotArchon	2	100%
FFE,Gateway,Speedlot	1	100%
FFE,Nexus,Speedlot	21	90%
5 openings	150	65%

McRave never varied against DaQin. Why is that? It doesn’t seem to be due to elitism (that is, always choosing a plan that has shown itself “elite” aka good enough), because McRave tried two plans versus UAlbertaBot and both had a higher win rate.

#9 zzzkbot

opening	games	wins	first	last
PoolHatch,12Pool,2HatchMuta	16	62%	1	127
PoolHatch,12Pool,2HatchSpeedling	10	50%	40	105
PoolLair,9Pool,1HatchMuta	124	88%	0	149
3 openings	150	83%

enemy	games	wins
HatchPool,4Pool,LingRush	2	100%
HatchPool,9Pool,LingRush	5	80%
HatchPool,Unknown,2HatchHydra	2	100%
HatchPool,Unknown,2HatchMuta	1	100%
PoolHatch,9Pool,LingRush	1	0%
PoolHatch,Unknown,Unknown	2	100%
Unknown,4Pool,LingRush	35	86%
Unknown,9Pool,LingRush	68	87%
Unknown,9Pool,Unknown	9	67%
Unknown,Unknown,1HatchHydra	3	100%
Unknown,Unknown,Unknown	22	68%
11 openings	150	83%

#10 ualbertabot

opening	games	wins	first	last
PoolHatch,Overpool,2HatchMuta	99	80%	0	149
PoolHatch,Overpool,2HatchSpeedling	51	98%	3	147
2 openings	150	86%

enemy	games	wins
1GateCore,0Zealot,4Gate	30	93%
1GateCore,0Zealot,DT	10	100%
1GateCore,Unknown,DT	6	100%
2Gate,Main,ZealotRush	16	100%
2Rax,Main,MarineRush	9	89%
2Rax,Main,Unknown	22	41%
HatchPool,Unknown,1HatchHydra	6	100%
HatchPool,Unknown,2HatchHydra	4	100%
HatchPool,Unknown,2HatchLing	8	100%
HatchPool,Unknown,2HatchMuta	4	100%
HatchPool,Unknown,Unknown	4	100%
PoolHatch,Unknown,Unknown	1	100%
RaxCC,8Rax,Unknown	8	38%
RaxFact,Unknown,Unknown	6	100%
Unknown,4Pool,LingRush	9	100%
Unknown,9Pool,LingRush	1	100%
Unknown,Unknown,1HatchHydra	1	100%
Unknown,Unknown,Unknown	5	100%
18 openings	150	86%

#11 willyt

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	102	67%	0	149
PoolHatch,12Pool,2HatchMuta	6	0%	6	142
PoolHatch,Overpool,2HatchMuta	41	63%	3	146
PoolHatch,Overpool,2HatchSpeedling	1	0%	4	4
4 openings	150	63%

enemy	games	wins
2Rax,Main,1FactTanks	1	0%
2Rax,Main,Academy	3	100%
2Rax,Main,MarineRush	2	50%
RaxCC,1RaxFE,1FactTanks	93	49%
RaxCC,1RaxFE,5FactGoliath	1	0%
RaxCC,1RaxFE,Unknown	7	86%
RaxCC,Unknown,Unknown	20	100%
Unknown,Unknown,2Fact	3	33%
Unknown,Unknown,Unknown	20	85%
9 openings	150	63%

#12 ecgberht

opening	games	wins	first	last
HatchPool,12Hatch,2HatchMuta	83	87%	0	149
HatchPool,12Hatch,2HatchSpeedling	29	79%	12	142
PoolHatch,12Pool,2HatchMuta	6	0%	11	133
PoolHatch,12Pool,2HatchSpeedling	3	100%	30	32
PoolHatch,Overpool,2HatchMuta	16	50%	1	113
PoolHatch,Overpool,2HatchSpeedling	13	100%	27	101
6 openings	150	79%

enemy	games	wins
2Rax,Expand,Unknown	18	83%
2Rax,Main,Academy	1	0%
2Rax,Main,MarineRush	1	0%
2Rax,Main,Unknown	19	47%
2Rax,Proxy,Unknown	7	86%
RaxCC,1RaxFE,1FactTanks	1	100%
RaxCC,1RaxFE,Unknown	39	85%
RaxCC,8Rax,Unknown	20	90%
RaxCC,Unknown,Unknown	21	81%
RaxFact,Unknown,2Fact	1	100%
RaxFact,Unknown,2PortWraith	10	100%
RaxFact,Unknown,Unknown	7	100%
Unknown,Unknown,Unknown	5	40%
13 openings	150	79%

Ecgberht is a tricky opponent. I think McRave tried every plan that was enabled—mostly with success, to be sure.

#13 eggbot

opening	games	wins	first	last
PoolHatch,Overpool,2HatchMuta	10	100%	18	148
PoolHatch,Overpool,2HatchSpeedling	140	99%	0	149
2 openings	150	99%

enemy	games	wins
2Gate,Proxy,ZealotRush	19	89%
CannonRush,Unknown,Unknown	16	100%
Unknown,Unknown,Unknown	115	100%
3 openings	150	99%

AIIDE 2020 - McRave’s learning algorithm

I meant to summarize McRave’s learning data today, but to know what to put in the tables I had to understand how the numbers are used. Yesterday I examined McRave’s strategy representation with three elements, like “PoolHatch,Overpool,2HatchMuta”. In the code, the elements are named “build” (like PoolHatch), “opener” (like Overpool) and “transition” (like 2HatchMuta). Today I read the code to see what the numbers in the learning files are and how they are used.

Here’s a sample data file, showing McRave doing well versus Steamhammer. The first two numbers are the overall wins and losses. After that, delimited by dashes, is a section for the first build, followed by a section for the openers of that build and a section for the transitions of the build. Then more sections for the other two builds and their appendages. Each element has an independent count of wins and losses.

86 64
-
HatchPool 0 0
-
12Hatch 0 0
-
2HatchMuta 0 0
2HatchSpeedling 0 0
-
PoolHatch 28 27
-
4Pool 0 0
9Pool 0 0
Overpool 0 0
12Pool 28 27
-
2HatchMuta 16 17
2HatchSpeedling 12 10
3HatchSpeedling 0 0
-
PoolLair 58 37
-
9Pool 58 37
-
1HatchMuta 58 37

The code calls a function to check which triples are allowed and deals with other minor details, but even with the fiddly bits it’s simple: It picks the build with the highest UCB value, then given that build the corresponding opener with the highest UCB value, then given that build and opener the transition with the highest UCB value. Because of how the data file is organized, this can be done in one pass. The code is in the file LearningManager.cpp in the nested function parseLearningFile().

In theory, this three-level hierarchy could speed up learning. For example, you might be able to conclude that PoolHatch is better than PoolLair against some opponent, even if you don’t have enough data to know which PoolHatch opener or transition is best. My intuition is that the hierarchical scheme should on average work better than a flat scheme, but that there will be perverse situations where it does worse. Many of the triples are not allowed, which limits the value of the hierarchy. There should be enough data from this tournament to judge whether the hierarchy brought an advantage; it would be interesting to do the analysis.

Next: OK, now I know what tables to generate. I have to add some features to my script, but soon I should be able to post the summary tables.

AIIDE 2020 - what Dragon learned

Dragon’s learning file format is spare, one line for each game giving strategy name and win or loss, nothing more. Dragon has 7 strategies, and against most opponents tried all of them. Its habit is to keep with a winning strategy, trying others sporadically but generally switching when the current plan starts to fail.

Dragon calls its worker rush “dirty worker rush”. Perhaps we should get it together with Stone so it can learn a nice clean worker rush.

#1 stardust

opening	games	wins	first	last
1rax fe	16	6%	6	147
2rax bio	18	6%	2	143
2rax mech	14	0%	0	148
bio	26	8%	4	149
dirty worker rush	23	13%	1	114
mass vulture	40	10%	3	144
siege expand	13	0%	5	145
7 openings	150	7%

As you can see in the “first” column (the first game each strategy was played), Dragon tried all 7 strategies in the first 7 games because they all lost on their first tries. Worker rush turned out to be the most successful plan, as far as that goes, which is very interesting. Mass vultures were the most-played plan despite not having the highest win rate, apparently because the worker rush had a string of losses so that vultures looked better in recent games. (Maybe Dragon figured that Stardust had learned how to deal with the worker rush.)

How did mass vultures have any chance against Stardust’s dragoons? I located a couple of the “mass vulture” wins and watched them. In fact, tanks were the core of Dragon’s army and the vultures acted as buffer. It looked like regular tank-vulture unit mix with regular tank pushes.

#2 purplewave

opening	games	wins	first	last
1rax fe	36	64%	9	83
2rax bio	7	43%	4	101
2rax mech	3	0%	19	129
bio	16	50%	3	100
dirty worker rush	3	0%	0	102
mass vulture	32	62%	20	146
siege expand	50	58%	1	128
7 openings	147	56%

Against PurpleWave, and BananaBrain below, most Dragon strategies worked about equally well. Apparently it has well-balanced play against protoss. Actually I think the explanation may be different: Once the opening is over, Dragon quickly adapts to the enemy, playing against the units it sees. If I guess right, then its goal in the opening is to survive in a good position, and after that Dragon will produce whatever units it needs, so the opening doesn’t much affect the outcome. Obviously the worker rush doesn’t leave much room for adaptation, so it is an exception.

This hypothesis explains why Dragon can do well though it records so little data about each game: The openings often don’t much matter.

#3 bananabrain

opening	games	wins	first	last
1rax fe	14	57%	35	143
2rax bio	11	45%	22	139
2rax mech	15	47%	20	149
bio	37	59%	0	137
dirty worker rush	3	0%	23	146
mass vulture	56	61%	15	144
siege expand	14	50%	1	140
7 openings	150	55%

#5 mcrave

opening	games	wins	first	last
1rax fe	90	87%	11	146
2rax bio	14	64%	48	93
2rax mech	15	67%	51	82
bio	22	68%	0	66
dirty worker rush	1	0%	47	47
mass vulture	1	0%	7	7
siege expand	4	50%	8	12
7 openings	147	78%

Fast expand works versus McRave...

#6 microwave

opening	games	wins	first	last
1rax fe	3	0%	4	144
2rax bio	98	66%	0	148
2rax mech	5	20%	7	50
bio	13	46%	2	143
dirty worker rush	11	55%	8	31
mass vulture	3	0%	1	68
siege expand	16	44%	5	47
7 openings	149	57%

2 barracks is good against Microwave...

#7 steamhammer

opening	games	wins	first	last
1rax fe	37	32%	12	149
2rax bio	6	0%	11	108
2rax mech	12	33%	1	141
bio	9	11%	6	106
dirty worker rush	39	33%	10	140
mass vulture	30	37%	7	104
siege expand	17	24%	0	116
7 openings	150	30%

... but against Steamhammer, again, most strategies look about the same. Watching games, I think Dragon converges on a diverse unit mix fairly quickly after the opening.

I checked out a “mass vulture” game against Steamhammer, and it looked different from the same strategy against Stardust. Dragon made a modest number of vultures and researched spider mines, but added tanks and wraiths and soon the unit mix looked like most Dragon-Steamhammer games.

#8 daqin

opening	games	wins	first	last
1rax fe	49	67%	14	133
2rax bio	10	30%	6	132
2rax mech	23	57%	1	148
bio	5	20%	2	77
dirty worker rush	3	0%	0	78
mass vulture	45	53%	3	137
siege expand	14	43%	7	76
7 openings	149	54%

#9 zzzkbot

opening	games	wins	first	last
1rax fe	5	20%	0	71
2rax bio	10	40%	2	83
2rax mech	35	49%	24	99
bio	13	38%	25	113
dirty worker rush	4	0%	6	102
mass vulture	26	54%	4	149
siege expand	57	53%	27	148
7 openings	150	47%

Most curious: Against ZZZKBot, factory openings predominate. Checking the game durations, most games that ZZZKBot won were short, meaning that it played its 4 pool with success. Most games that Dragon won were longer, so either ZZZKBot did not 4 pool or else Dragon was slow to counterattack after surviving.

#10 ualbertabot

opening	games	wins	first	last
1rax fe	32	81%	94	147
2rax bio	5	60%	4	8
2rax mech	13	69%	9	105
bio	29	83%	66	137
dirty worker rush	2	50%	10	11
mass vulture	63	86%	13	125
siege expand	4	50%	0	3
7 openings	148	80%

#11 willyt

opening	games	wins	first	last
1rax fe	5	100%	56	148
2rax mech	143	94%	0	145
mass vulture	1	100%	59	59
3 openings	149	94%

#12 ecgberht

opening	games	wins	first	last
1rax fe	6	83%	49	140
bio	144	94%	0	149
2 openings	150	94%

#13 eggbot

opening	games	wins	first	last
2rax mech	146	99%	0	148
siege expand	3	67%	5	50
2 openings	149	98%

AIIDE 2020 - what Microwave learned 2

Microwave’s history files include both pre-training games and tournament games. I removed the pre-training games, and these tables show only tournament results. I looked at it both ways and decided this way was more informative. Yesterday’s table includes both prepared data and tournament games.

The enemy strategies listed in the form “HeavyRush -> SafeExpand” are the initially predicted and the later recognized enemy play, as explained by MicroDK in a comment. When they’re the same, the prediction was correct.

#1 stardust

opening	games	wins	first	last
10Hatch9Pool9gas	3	0%	60	107
10HatchMain9Pool9Gas	1	0%	133	133
12HatchMain	2	0%	14	49
12Pool	1	0%	110	110
12PoolMain	1	0%	121	121
12PoolMuta	2	0%	46	142
2HatchMuta	7	0%	20	98
3Hatch	3	0%	17	112
3HatchExpo	2	0%	43	57
3HatchHydra	1	0%	139	139
3HatchHydra_BHG	1	0%	38	38
3HatchLingBust	9	11%	24	144
3HatchMuta	36	0%	0	143
3HatchPoolHydra	7	0%	27	147
3HatchPoolHydraExpo	1	0%	114	114
4PoolHard	1	0%	123	123
4PoolSoft	2	0%	16	70
5HatchPoolHydra	18	0%	5	149
5Pool	2	0%	8	81
6Pool	2	0%	40	41
6PoolSpeed	4	0%	28	146
7Pool	1	0%	148	148
9Hatch9Pool9Gas	1	0%	134	134
9HatchMain8Pool8Gas	1	0%	117	117
9Pool	1	0%	15	15
9PoolGasHatchSpeed7D	1	0%	132	132
9PoolGasHatchSpeed8D	1	0%	3	3
9PoolHatchGasSpeed7D	1	0%	34	34
9PoolHatchGasSpeed8D	12	0%	6	138
9PoolHydra	1	0%	118	118
9PoolLurker	1	0%	95	95
9PoolSpeed	1	0%	137	137
9PoolSpeedLing	1	0%	58	58
9PoolSunkHatch	2	0%	71	105
9PoolSunken	1	0%	140	140
OverpoolLurker	1	0%	73	73
OverpoolSpeed	1	0%	116	116
OverpoolTurtle	1	0%	104	104
ZvP_10Hatch9Pool	2	0%	77	109
ZvP_11Hatch10Pool	1	0%	80	80
ZvP_2HatchHydra	2	0%	87	129
ZvP_9Hatch9Pool	2	0%	127	145
ZvZ_Overgas11Pool	2	0%	33	61
ZvZ_Overgas9Pool	2	0%	2	128
ZvZ_Overpool11Gas	2	0%	67	82
ZvZ_Overpool9Gas	1	0%	48	48
ZvZ_OverpoolTurtle	1	0%	122	122
47 openings	150	1%

enemy	games	wins
HeavyRush -> HeavyRush	127	1%
HeavyRush -> Unknown	21	0%
SafeExpand -> HeavyRush	2	0%
3 openings	150	1%

Stardust always plays the same strategy, so it’s no wonder that Microwave was able to predict it. Not that it helped. 3HatchMuta was tried repeatedly because it scored some wins in training.

#2 purplewave

opening	games	wins	first	last
10Hatch9Pool9gas	3	0%	4	122
11HatchTurtleHydra	3	0%	72	139
11HatchTurtleLurker	1	0%	57	57
12HatchMain	1	0%	44	44
12PoolMuta	11	18%	20	121
2HatchLurkerAllIn	1	0%	47	47
3HatchHydraBust	1	0%	40	40
3HatchHydra_BHG	1	0%	16	16
3HatchMuta	9	33%	26	149
3HatchMutaExpo	1	0%	17	17
3HatchPoolHydra	1	0%	94	94
4HatchPoolHydra	1	0%	56	56
4PoolHard	8	12%	7	147
6Pool	1	0%	55	55
6PoolSpeed	10	30%	32	135
7Pool	3	0%	38	86
7PoolHydraLingRush7D	1	0%	108	108
8PoolHydraRush8D	2	0%	19	49
9Hatch9Pool9Gas	1	0%	124	124
9HatchMain8Pool8Gas	8	25%	15	128
9Pool	2	0%	9	102
9PoolGasHatchSpeed7D	28	50%	0	142
9PoolHatchGasSpeed7D	17	65%	11	141
9PoolHatchGasSpeed8D	9	56%	114	146
9PoolSpeed	4	25%	1	54
9PoolSpeedLing	2	0%	60	119
9PoolSunkHatch	1	0%	109	109
9PoolSunken	1	0%	145	145
OverpoolSpeed	7	14%	23	143
OverpoolTurtle	6	17%	14	123
ZvP_10Hatch9Pool	1	0%	66	66
ZvP_2HatchHydra	1	0%	92	92
ZvP_9Hatch9Pool	2	0%	130	148
ZvZ_Overpool9Gas	1	0%	43	43
34 openings	150	29%

enemy	games	wins
HeavyRush -> HeavyRush	103	23%
HeavyRush -> NakedExpand	2	50%
HeavyRush -> SafeExpand	2	0%
HeavyRush -> Turtle	5	40%
HeavyRush -> Unknown	16	31%
NakedExpand -> HeavyRush	2	0%
SafeExpand -> HeavyRush	1	100%
SafeExpand -> SafeExpand	3	33%
SafeExpand -> Turtle	2	50%
Turtle -> HeavyRush	3	33%
Turtle -> NakedExpand	4	100%
Turtle -> SafeExpand	3	33%
Turtle -> Turtle	4	75%
13 openings	150	29%

PurpleWave opened with 2 gate most games. Microwave was able to predict it, but as we saw in UAlbertaBot’s table, the zealots are a Microwave weakness and PurpleWave was able to exploit it. Nevertheless, Microwave was no pushover. The more successful zerg tries were zergling openings, especially variants of the Styx build (9PoolHatchGasSpeed).

#3 bananabrain

opening	games	wins	first	last
10Hatch9Pool9gas	2	0%	4	20
10HatchMain9Pool9Gas	1	0%	5	5
11HatchTurtleHydra	1	0%	83	83
12Hatch	1	0%	60	60
12PoolMain	43	51%	37	139
12PoolMuta	1	0%	68	68
1HatchMuta_Sparkle	1	0%	65	65
2HatchMuta	5	20%	30	80
3HatchHydraBust	1	0%	109	109
3HatchHydra_BHG	1	0%	122	122
3HatchLingBust	6	33%	12	130
3HatchMuta	1	0%	11	11
3HatchPoolHydraExpo	1	0%	49	49
4HatchBeforeGas	1	0%	3	3
4HatchPoolHydra	2	0%	1	27
4PoolHard	6	33%	55	145
4PoolSoft	1	0%	108	108
6Pool	1	0%	81	81
7Pool	1	0%	13	13
8Pool	1	0%	53	53
8PoolHydraRush8D	1	0%	31	31
9PoolGasHatchSpeed8D	18	67%	70	149
9PoolHatchGasSpeed7D	1	0%	34	34
9PoolHatchGasSpeed8D	32	53%	0	146
9PoolSpeed	3	0%	25	147
9PoolSpeedLing	5	20%	7	123
9PoolSunkHatch	1	0%	142	142
Overpool	1	0%	127	127
OverpoolSpeed	3	0%	79	92
ZvP_10Hatch9Pool	3	33%	29	110
ZvP_11Hatch10Pool	1	0%	121	121
ZvZ_Overgas9Pool	1	0%	106	106
ZvZ_Overpool11Gas	2	0%	21	134
33 openings	150	39%

enemy	games	wins
HeavyRush -> HeavyRush	22	32%
HeavyRush -> NakedExpand	14	86%
HeavyRush -> SafeExpand	12	0%
HeavyRush -> Turtle	6	17%
HeavyRush -> Unknown	25	32%
NakedExpand -> HeavyRush	14	43%
NakedExpand -> NakedExpand	14	79%
NakedExpand -> SafeExpand	5	0%
NakedExpand -> Turtle	2	0%
NakedExpand -> Unknown	12	33%
SafeExpand -> HeavyRush	8	25%
SafeExpand -> NakedExpand	4	75%
SafeExpand -> SafeExpand	4	25%
SafeExpand -> Turtle	2	0%
SafeExpand -> Unknown	5	40%
Turtle -> NakedExpand	1	100%
16 openings	150	39%

BananaBrain is not predictable, and Microwave could not predict its play. Again, the more successful zerg builds were zergling openings.

#4 dragon

opening	games	wins	first	last
10HatchTurtleHydra	1	0%	131	131
11HatchTurtleLurker	1	0%	76	76
12PoolMain	1	0%	141	141
2HatchMuta	68	53%	1	148
3HatchHydraExpo	3	33%	122	135
3HatchMutaExpo	38	47%	0	144
4HatchPoolHydra	8	25%	73	136
4PoolSoft	18	28%	38	147
5HatchPoolHydra	5	60%	126	149
5PoolSpeed	1	0%	118	118
7PoolHydraLingRush7D	1	0%	78	78
9PoolHatchGasSpeed8D	1	0%	128	128
9PoolSunkHatch	1	0%	115	115
Overpool	1	0%	53	53
OverpoolLurker	1	0%	107	107
OverpoolTurtle	1	0%	62	62
16 openings	150	43%

enemy	games	wins
Factory -> Factory	18	56%
Factory -> HeavyRush	13	46%
Factory -> SafeExpand	2	0%
Factory -> Unknown	14	71%
Factory -> WorkerRush	3	33%
HeavyRush -> Factory	15	27%
HeavyRush -> HeavyRush	28	54%
HeavyRush -> NakedExpand	1	0%
HeavyRush -> SafeExpand	4	0%
HeavyRush -> Turtle	2	0%
HeavyRush -> Unknown	31	26%
NakedExpand -> HeavyRush	1	100%
SafeExpand -> HeavyRush	1	100%
SafeExpand -> Unknown	2	50%
WorkerRush -> Factory	2	50%
WorkerRush -> HeavyRush	2	100%
WorkerRush -> Unknown	4	50%
WorkerRush -> WorkerRush	7	43%
18 openings	150	43%

Microwave was moderately successful in predicting Dragon’s play, because Dragon tends to stick with a successful strategy as long as it remains successful. Look at that mix of zerg openings! 4 pool, hydra builds, and mutalisk builds.

#5 mcrave

opening	games	wins	first	last
10Hatch9Pool9gas	1	0%	133	133
10HatchMain9Pool9Gas	4	25%	42	72
10HatchTurtleHydra	1	0%	83	83
11HatchTurtleLurker	1	0%	36	36
12Hatch	1	0%	15	15
12Pool	17	18%	4	147
12PoolMain	2	0%	110	139
2HatchLurker	1	0%	32	32
3Hatch	2	0%	113	137
3HatchLurker	1	0%	95	95
3HatchMuta	1	0%	148	148
3HatchMutaExpo	1	0%	106	106
3HatchPoolHydra	2	0%	92	102
3HatchPoolHydraExpo	12	25%	49	145
4HatchBeforeGas	1	0%	71	71
4PoolSoft	1	0%	38	38
5PoolSpeed	1	0%	73	73
6PoolSpeed	4	25%	128	138
7PoolHydraLingRush7D	1	0%	134	134
8Pool	1	0%	62	62
9HatchMain8Pool8Gas	1	0%	47	47
9Pool	1	0%	104	104
9PoolGasHatchSpeed8D	1	0%	17	17
9PoolSpeed	28	46%	33	146
9PoolSpeedLing	1	0%	76	76
Overpool	1	0%	79	79
OverpoolSpeed	27	22%	0	149
ZvP_2HatchHydra	2	0%	14	54
ZvP_9Hatch9Pool	21	33%	1	143
ZvZ_Overpool11Gas	6	0%	7	108
ZvZ_Overpool9Gas	5	0%	8	67
31 openings	150	23%

enemy	games	wins
FastRush -> HeavyRush	1	0%
HeavyRush -> NakedExpand	3	67%
HeavyRush -> Unknown	1	0%
NakedExpand -> FastRush	1	100%
NakedExpand -> HeavyRush	3	33%
NakedExpand -> NakedExpand	8	38%
NakedExpand -> Turtle	7	14%
NakedExpand -> Unknown	29	7%
Turtle -> FastRush	1	0%
Turtle -> HeavyRush	1	100%
Turtle -> NakedExpand	11	64%
Turtle -> Turtle	23	26%
Turtle -> Unknown	61	16%
13 openings	150	23%

Microwave tried a lot of stuff versus McRave—three hatch before pool hydralisk opening in ZvZ? And it worked sometimes? I should try to find some of those games.

#7 steamhammer

opening	games	wins	first	last
10Hatch9Pool9gas	9	44%	68	142
10HatchMain9Pool9Gas	4	25%	101	113
10HatchTurtleHydra	1	0%	39	39
11HatchTurtleMuta	1	0%	108	108
12HatchMain	1	0%	15	15
12Pool	25	20%	0	144
12PoolMain	5	20%	24	92
2HatchLurker	2	0%	54	83
3HatchHydraBust	1	0%	104	104
3HatchHydraExpo	2	0%	67	86
3HatchPoolHydra	2	0%	7	149
4HatchPoolHydra	1	0%	34	34
5Pool	4	0%	5	96
5PoolSpeed	3	33%	94	133
7Pool	1	0%	36	36
7PoolHydraLingRush7D	1	0%	89	89
9Hatch9Pool9Gas	1	0%	106	106
9HatchTurtleHydra	1	0%	127	127
9PoolGasHatchSpeed8D	1	0%	42	42
9PoolHatch	2	0%	19	29
9PoolSpeed	31	55%	9	138
9PoolSpeedLing	1	0%	117	117
9PoolSunken	7	0%	1	95
OverpoolSpeed	3	33%	47	121
ZvP_11Hatch10Pool	4	50%	135	145
ZvP_2HatchHydra	9	0%	3	84
ZvP_9Hatch9Pool	1	0%	16	16
ZvZ_Overgas11Pool	20	50%	6	147
ZvZ_Overpool11Gas	2	0%	79	93
ZvZ_Overpool9Gas	4	25%	61	148
30 openings	150	29%

enemy	games	wins
HeavyRush -> HeavyRush	2	50%
HeavyRush -> Turtle	4	0%
Turtle -> FastRush	1	100%
Turtle -> HeavyRush	14	57%
Turtle -> NakedExpand	16	38%
Turtle -> Turtle	96	23%
Turtle -> Unknown	17	29%
7 openings	150	29%

Microwave recognizes turtle builds in most games. That will be Steamhammer’s OverpoolTurtle opening, which builds as many sunkens at it can afford (2) without delaying mutalisks. It’s tough for bots to handle, because the build is safe on the ground while giving nothing away in the air. Microwave mainly preferred speed zergling openings in response, taking advantage of its superior zergling-on-zergling micro (which is not really a difference in micro as much as in engagement skills).

#8 daqin

opening	games	wins	first	last
1HatchMuta_Sparkle	62	90%	49	148
3HatchLingBust	17	65%	1	130
3HatchMuta	59	90%	0	149
3HatchMutaExpo	9	56%	12	48
3HatchPoolHydraExpo	1	0%	3	3
9Pool	1	0%	22	22
OverpoolLurker	1	0%	19	19
7 openings	150	83%

enemy	games	wins
HeavyRush -> HeavyRush	4	100%
HeavyRush -> SafeExpand	3	100%
HeavyRush -> Turtle	6	100%
HeavyRush -> Unknown	2	100%
NakedExpand -> Turtle	3	67%
SafeExpand -> NakedExpand	1	100%
SafeExpand -> SafeExpand	2	100%
SafeExpand -> Turtle	5	100%
Turtle -> HeavyRush	20	85%
Turtle -> NakedExpand	16	100%
Turtle -> Proxy	1	0%
Turtle -> SafeExpand	16	62%
Turtle -> Turtle	55	85%
Turtle -> Unknown	16	62%
14 openings	150	83%

#9 zzzkbot

opening	games	wins	first	last
OverpoolSpeed	147	95%	2	149
ZvZ_Overgas11Pool	3	0%	0	3
2 openings	150	93%

enemy	games	wins
FastRush -> FastRush	96	94%
FastRush -> Turtle	1	100%
FastRush -> Unknown	51	96%
Turtle -> FastRush	2	0%
4 openings	150	93%

It looks like ZZZKBot played its 4 pool about 2/3 of the time, and the rest of the time did something that Microwave could not recognize. But no matter, Microwave played overpool nearly all the time, fast enough to stop the rush and, in Microwave’s hands, flexible enough to counter ZZZKBot’s other builds.

#10 ualbertabot

opening	games	wins	first	last
1HatchMuta_Sparkle	2	0%	123	143
3HatchHydraExpo	1	0%	91	91
4PoolSoft	51	75%	0	147
5Pool	7	57%	44	133
5PoolSpeed	22	68%	3	146
7PoolHydraLingRush7D	1	0%	79	79
7PoolHydraRush7D	1	0%	50	50
8PoolHydraRush8D	10	50%	34	85
9PoolGasHatchSpeed8D	2	50%	27	28
9PoolSunkHatch	9	56%	113	135
OverpoolSunken	15	53%	102	148
ZvP_10Hatch9Pool	27	56%	21	120
ZvZ_Overpool11Gas	1	0%	82	82
13 openings	149	61%

enemy	games	wins
Factory -> FastRush	2	50%
Factory -> HeavyRush	2	0%
Factory -> NakedExpand	1	100%
Factory -> Unknown	2	0%
FastRush -> Factory	3	100%
FastRush -> FastRush	4	75%
FastRush -> HeavyRush	9	67%
FastRush -> NakedExpand	2	100%
FastRush -> Unknown	3	67%
HeavyRush -> Factory	2	100%
HeavyRush -> FastRush	22	64%
HeavyRush -> HeavyRush	49	41%
HeavyRush -> NakedExpand	9	100%
HeavyRush -> Unknown	22	68%
NakedExpand -> Factory	4	100%
NakedExpand -> FastRush	2	50%
NakedExpand -> HeavyRush	5	40%
NakedExpand -> NakedExpand	3	100%
NakedExpand -> Unknown	2	100%
Unknown -> HeavyRush	1	100%
20 openings	149	61%

Compare this to UAlbertaBot’s table. Microwave did not do perfectly against any UAlbertaBot race, and suffered badly against the zealot rush. Microwave had neither a universal build that works against all UAlbertaBot plays (which is how Steamhammer succeeded against UAlbertaBot), nor was it able to adapt its build well enough to counter what it saw (compare ZZZKBot above). Still, it found that 4 pool and 5 pool were not bad! Fight fire with fire.

#11 willyt

opening	games	wins	first	last
10Hatch9Pool9gas	15	73%	1	143
11HatchTurtleLurker	6	17%	26	74
11HatchTurtleMuta	4	25%	25	73
12PoolMain	3	33%	121	149
12PoolMuta	2	100%	64	80
2HatchMuta_Sparkle	1	0%	36	36
3HatchExpo	1	0%	77	77
3HatchHydra	1	0%	58	58
3HatchLurker	2	0%	57	141
3HatchMuta	4	75%	76	146
3HatchMutaExpo	8	50%	56	145
9Hatch9Pool9Gas	13	77%	123	144
9PoolExpo	40	78%	9	147
9PoolGasHatchSpeed8D	4	100%	53	142
9PoolHydra	1	0%	91	91
9PoolLurker	10	40%	5	115
9PoolSpeed	21	81%	0	148
9PoolSunkHatch	4	25%	3	43
9PoolSunken	9	78%	51	103
ZvZ_Overgas11Pool	1	0%	84	84
20 openings	150	65%

enemy	games	wins
Factory -> Factory	1	100%
Factory -> NakedExpand	4	100%
Factory -> SafeExpand	1	0%
Factory -> Unknown	4	100%
HeavyRush -> Factory	2	50%
HeavyRush -> HeavyRush	4	100%
HeavyRush -> NakedExpand	5	100%
HeavyRush -> SafeExpand	4	25%
HeavyRush -> Unknown	2	50%
NakedExpand -> Factory	8	62%
NakedExpand -> HeavyRush	10	50%
NakedExpand -> NakedExpand	43	100%
NakedExpand -> SafeExpand	13	46%
NakedExpand -> Unknown	31	23%
SafeExpand -> Factory	2	50%
SafeExpand -> NakedExpand	7	100%
SafeExpand -> SafeExpand	6	33%
SafeExpand -> Unknown	3	0%
18 openings	150	65%

#12 ecgberht

opening	games	wins	first	last
2HatchHydra	147	88%	0	149
9PoolLurker	3	67%	5	25
2 openings	150	88%

enemy	games	wins
Factory -> Factory	1	100%
Factory -> NakedExpand	8	100%
Factory -> SafeExpand	1	0%
Factory -> Unknown	6	100%
FastRush -> Factory	1	100%
FastRush -> Unknown	2	100%
HeavyRush -> Factory	4	100%
HeavyRush -> FastRush	2	50%
HeavyRush -> HeavyRush	1	100%
HeavyRush -> NakedExpand	4	100%
HeavyRush -> SafeExpand	1	100%
HeavyRush -> Unknown	1	100%
NakedExpand -> Factory	13	100%
NakedExpand -> FastRush	4	75%
NakedExpand -> HeavyRush	16	88%
NakedExpand -> NakedExpand	35	100%
NakedExpand -> SafeExpand	2	50%
NakedExpand -> Unknown	45	73%
SafeExpand -> Factory	1	100%
SafeExpand -> HeavyRush	1	100%
SafeExpand -> Unknown	1	100%
21 openings	150	88%

#13 eggbot

opening	games	wins	first	last
9Pool	150	100%	0	149
1 openings	150	100%

enemy	games	wins
Proxy -> Turtle	2	100%
Turtle -> Proxy	15	100%
Turtle -> Turtle	84	100%
Turtle -> Unknown	49	100%
4 openings	150	100%

Microwave did not understand how to recognize EggBot’s cannon play, but it knew from training how to win.

AIIDE 2020 - what Microwave learned 1

I’ll cover Microwave over two days because it writes two files for each opponent, a “results” file giving wins/losses for each strategy and a “history” file of more detailed game records. Each summary is bulky in itself, and I don’t want to pile them up. The history file has all the information in the results file and more. In fact, a quick look at Microwave’s code says that it no longer reads the results file at all, but reconstructs its contents from the history file each game. But different presentations of the data have value in themselves; this view makes it easy to read across the columns and see where a given opening was effective.

Today is the results file, the table of strategies versus each opponent. Wow, that’s a lot of opening builds! I count 73, less than half as many as Steamhammer but still too large a number to explore in a tournament of 150 rounds. I think only bots with combinatorial strategies have more. The numbers include not only games played during the tournament, but also Microwave’s prepared data for each opponent, so they add up to more than 150 games versus each opponent. You can compare the overall win rates per opponent to see which ones Microwave was more successful against in training as opposed to in the tournament—it may indicate whether the opponent was updated for the tournament and became stronger than Microwave expected. In general, for stronger opponents training data overestimated Microwave’s success, while for weaker opponents it was the opposite (that is, the training uncovered mistakes that Microwave could then avoid).

	total	#1 stardust	#2 purplewave	#3 bananabrain	#4 dragon	#5 mcrave	#7 steamhammer	#8 daqin	#9 zzzkbot	#10 ualbertabot	#11 willyt	#12 ecgberht	#13 eggbot
10Hatch9Pool9gas	28-48 37%	0-4 0%	0-12 0%	3-19 14%	-	0-1 0%	4-5 44%	-	-	-	21-7 75%	-	-
10HatchMain9Pool9Gas	2-8 20%	0-1 0%	-	0-1 0%	-	1-3 25%	1-3 25%	-	-	-	-	-	-
10HatchTurtleHydra	0-3 0%	-	-	-	0-1 0%	0-1 0%	0-1 0%	-	-	-	-	-	-
11HatchTurtleHydra	0-11 0%	-	0-10 0%	0-1 0%	-	-	-	-	-	-	-	-	-
11HatchTurtleLurker	11-17 39%	-	0-1 0%	-	0-2 0%	0-1 0%	0-1 0%	-	-	-	11-12 48%	-	-
11HatchTurtleMuta	4-15 21%	-	0-7 0%	0-2 0%	-	-	0-1 0%	-	-	-	4-5 44%	-	-
12Hatch	0-3 0%	-	-	0-2 0%	-	0-1 0%	-	-	-	-	-	-	-
12HatchMain	0-4 0%	0-2 0%	0-1 0%	-	-	-	0-1 0%	-	-	-	-	-	-
12Pool	35-51 41%	0-1 0%	-	-	-	9-17 35%	26-33 44%	-	-	-	-	-	-
12PoolMain	25-34 42%	0-1 0%	-	22-21 51%	0-1 0%	0-2 0%	2-7 22%	-	-	-	1-2 33%	-	-
12PoolMuta	7-20 26%	0-2 0%	2-9 18%	0-1 0%	-	-	-	-	-	-	5-8 38%	-	-
1HatchMuta_Sparkle	56-9 86%	-	-	0-1 0%	-	-	-	56-6 90%	-	0-2 0%	-	-	-
2HatchHydra	161-24 87%	0-1 0%	0-2 0%	-	-	-	-	0-1 0%	-	-	-	161-20 89%	-
2HatchLurker	0-8 0%	0-1 0%	0-1 0%	0-2 0%	-	0-1 0%	0-2 0%	-	-	-	-	0-1 0%	-
2HatchLurkerAllIn	0-2 0%	0-1 0%	0-1 0%	-	-	-	-	-	-	-	-	-	-
2HatchMuta	74-59 56%	1-14 7%	0-1 0%	3-9 25%	60-33 65%	0-1 0%	-	-	-	-	-	-	10-1 91%
2HatchMuta_Sparkle	0-1 0%	-	-	-	-	-	-	-	-	-	0-1 0%	-	-
3Hatch	0-5 0%	0-3 0%	-	-	-	0-2 0%	-	-	-	-	-	-	-
3HatchExpo	0-3 0%	0-2 0%	-	-	-	-	-	-	-	-	0-1 0%	-	-
3HatchHydra	0-2 0%	0-1 0%	-	-	-	-	-	-	-	-	0-1 0%	-	-
3HatchHydraBust	0-14 0%	0-7 0%	0-2 0%	0-2 0%	-	0-1 0%	0-1 0%	0-1 0%	-	-	-	-	-
3HatchHydraExpo	1-5 17%	-	-	-	1-2 33%	-	0-2 0%	-	-	0-1 0%	-	-	-
3HatchHydra_BHG	0-4 0%	0-1 0%	0-1 0%	0-1 0%	-	0-1 0%	-	-	-	-	-	-	-
3HatchLingBust	36-41 47%	2-20 9%	-	2-6 25%	-	-	-	32-15 68%	-	-	-	-	-
3HatchLurker	0-4 0%	-	-	-	-	0-1 0%	-	-	-	-	0-3 0%	-	-
3HatchMuta	90-106 46%	7-58 11%	3-9 25%	3-19 14%	-	0-1 0%	-	72-14 84%	-	-	5-5 50%	-	-
3HatchMutaExpo	48-64 43%	0-1 0%	1-25 4%	0-1 0%	32-22 59%	0-1 0%	-	9-7 56%	-	-	6-7 46%	-	-
3HatchPoolHydra	1-24 4%	1-15 6%	0-2 0%	-	-	0-2 0%	0-2 0%	0-3 0%	-	-	-	-	-
3HatchPoolHydraExpo	3-12 20%	0-1 0%	-	0-1 0%	-	3-9 25%	-	0-1 0%	-	-	-	-	-
4HatchBeforeGas	0-10 0%	0-5 0%	0-1 0%	0-3 0%	-	0-1 0%	-	-	-	-	-	-	-
4HatchPoolHydra	4-25 14%	0-2 0%	0-1 0%	2-15 12%	2-6 25%	-	0-1 0%	-	-	-	-	-	-
4PoolHard	3-13 19%	0-1 0%	1-7 12%	2-4 33%	-	-	-	-	-	-	0-1 0%	-	-
4PoolSoft	61-44 58%	0-3 0%	0-2 0%	0-2 0%	7-14 33%	0-3 0%	-	-	-	54-17 76%	0-3 0%	-	-
5HatchPoolHydra	5-28 15%	2-26 7%	-	-	3-2 60%	-	-	-	-	-	-	-	-
5Pool	7-17 29%	0-2 0%	-	-	-	0-1 0%	3-10 23%	0-1 0%	-	4-3 57%	-	-	-
5PoolSpeed	29-18 62%	-	-	-	0-1 0%	0-1 0%	1-2 33%	-	-	28-14 67%	-	-	-
6Pool	0-6 0%	0-2 0%	0-1 0%	0-2 0%	-	-	0-1 0%	-	-	-	-	-	-
6PoolSpeed	4-14 22%	0-4 0%	3-7 30%	-	-	1-3 25%	-	-	-	-	-	-	-
7Pool	0-6 0%	0-1 0%	0-3 0%	0-1 0%	-	-	0-1 0%	-	-	-	-	-	-
7PoolHydraLingRush7D	0-5 0%	-	0-1 0%	-	0-1 0%	0-1 0%	0-1 0%	-	-	0-1 0%	-	-	-
7PoolHydraRush7D	0-2 0%	-	-	-	-	-	-	0-1 0%	-	0-1 0%	-	-	-
8Pool	0-2 0%	-	-	0-1 0%	-	0-1 0%	-	-	-	-	-	-	-
8PoolHydraRush8D	5-8 38%	-	0-2 0%	0-1 0%	-	-	-	-	-	5-5 50%	-	-	-
9Hatch9Pool9Gas	10-12 45%	0-1 0%	0-6 0%	0-1 0%	-	-	0-1 0%	-	-	-	10-3 77%	-	-
9HatchMain8Pool8Gas	2-8 20%	0-1 0%	2-6 25%	-	-	0-1 0%	-	-	-	-	-	-	-
9HatchTurtleHydra	0-3 0%	0-1 0%	0-1 0%	-	-	-	0-1 0%	-	-	-	-	-	-
9Pool	183-11 94%	0-4 0%	0-3 0%	-	-	0-1 0%	-	0-1 0%	-	-	0-1 0%	0-1 0%	183-0 100%
9PoolExpo	31-9 78%	-	-	-	-	-	-	-	-	-	31-9 78%	-	-
9PoolGasHatchSpeed7D	18-19 49%	0-1 0%	18-18 50%	-	-	-	-	-	-	-	-	-	-
9PoolGasHatchSpeed8D	21-29 42%	0-4 0%	-	12-6 67%	-	0-2 0%	0-1 0%	0-3 0%	-	1-1 50%	8-12 40%	-	-
9PoolHatch	0-3 0%	-	-	-	-	-	0-2 0%	-	0-1 0%	-	-	-	-
9PoolHatchGasSpeed7D	11-8 58%	0-1 0%	11-6 65%	0-1 0%	-	-	-	-	-	-	-	-	-
9PoolHatchGasSpeed8D	36-50 42%	1-16 6%	5-6 45%	30-24 56%	0-1 0%	0-2 0%	0-1 0%	-	-	-	-	-	-
9PoolHydra	0-3 0%	0-1 0%	-	-	-	-	-	0-1 0%	-	-	0-1 0%	-	-
9PoolLurker	15-14 52%	0-2 0%	-	-	-	-	-	-	-	-	6-8 43%	9-4 69%	-
9PoolSpeed	68-71 49%	0-3 0%	1-4 20%	0-4 0%	-	13-16 45%	24-21 53%	0-2 0%	-	-	30-21 59%	-	-
9PoolSpeedLing	2-22 8%	0-3 0%	0-3 0%	2-7 22%	-	0-3 0%	0-2 0%	0-3 0%	-	-	0-1 0%	-	-
9PoolSunkHatch	6-12 33%	0-2 0%	0-1 0%	0-1 0%	0-1 0%	-	-	-	-	5-4 56%	1-3 25%	-	-
9PoolSunken	9-15 38%	0-1 0%	0-1 0%	-	-	0-1 0%	2-10 17%	-	-	-	7-2 78%	-	-
Overpool	0-4 0%	0-1 0%	-	0-1 0%	0-1 0%	0-1 0%	-	-	-	-	-	-	-
OverpoolLurker	0-4 0%	0-1 0%	-	-	0-1 0%	-	-	0-1 0%	-	-	0-1 0%	-	-
OverpoolSpeed	187-60 76%	0-1 0%	1-6 14%	0-3 0%	-	27-37 42%	1-3 25%	-	158-9 95%	-	0-1 0%	-	-
OverpoolSunken	8-7 53%	-	-	-	-	-	-	-	-	8-7 53%	-	-	-
OverpoolTurtle	1-13 7%	0-1 0%	1-11 8%	-	0-1 0%	-	-	-	-	-	-	-	-
ZvP_10Hatch9Pool	17-30 36%	0-5 0%	0-8 0%	2-5 29%	-	-	-	-	-	15-12 56%	-	-	-
ZvP_11Hatch10Pool	2-12 14%	0-2 0%	0-7 0%	0-1 0%	-	-	2-2 50%	-	-	-	-	-	-
ZvP_2HatchHydra	2-28 7%	0-3 0%	0-9 0%	0-4 0%	-	0-2 0%	2-10 17%	-	-	-	-	-	-
ZvP_9Hatch9Pool	13-34 28%	0-2 0%	0-10 0%	0-1 0%	-	13-18 42%	0-1 0%	0-2 0%	-	-	-	-	-
ZvZ_Overgas11Pool	30-24 56%	0-3 0%	-	-	-	-	13-15 46%	-	17-5 77%	-	0-1 0%	-	-
ZvZ_Overgas9Pool	0-5 0%	0-2 0%	-	0-1 0%	-	-	0-2 0%	-	-	-	-	-	-
ZvZ_Overpool11Gas	5-21 19%	0-2 0%	-	0-2 0%	-	5-13 28%	0-3 0%	-	-	0-1 0%	-	-	-
ZvZ_Overpool9Gas	2-13 13%	0-2 0%	0-1 0%	-	-	1-7 12%	1-3 25%	-	-	-	-	-	-
ZvZ_OverpoolTurtle	0-1 0%	0-1 0%	-	-	-	-	-	-	-	-	-	-	-
total	- 51%	14-250 5%	49-216 18%	83-180 32%	105-90 54%	73-161 31%	82-153 35%	169-63 73%	175-15 92%	120-69 63%	146-120 55%	170-26 87%	193-1 99%

Microwave explored widely against top opponents, and concentrated efficiently on a few winning openings against weaker ones. On the other hand, although there is a flag in the configuration file named PlayGoodStrategiesFirst (turned on), Microwave seems to have little idea which strategies are most likely to work. Versus DaQin, 1 hatch mutalisk and 3 hatch mutalisk are successful, but the most natural 2 hatch muta is never tried. Of course that’s a widespread weakness among bots.

The 3 hatch muta strategies were relatively successful overall. That’s interesting.

AIIDE 2020 - what UAlbertaBot learned

Though UAlbertaBot has been surpassed over the years and become a low-end bot, we can still gain insight from its experience. The table summarizes the contents of its learning files. Last year this table had the bots down the left and the strategies across the top, but this year I turned it on its side—I am looking ahead to the table for Microwave, which has many strategies.

Some of the numbers here are slightly different from those in the official crosstable, because of games where UAlbertaBot did not record a result (no doubt due to crashes).

	total	#1 stardust	#2 purplewave	#3 bananabrain	#4 dragon	#5 mcrave	#6 microwave	#7 steamhammer	#8 daqin	#9 zzzkbot	#11 willyt	#12 ecgberht	#13 eggbot
4RaxMarines	33-120 22%	2-18 10%	1-6 14%	0-13 0%	1-8 11%	7-12 37%	3-14 18%	0-9 0%	0-6 0%	0-10 0%	17-19 47%	2-5 29%	-
MarineRush	66-140 32%	0-10 0%	0-5 0%	1-18 5%	6-16 27%	8-12 40%	4-12 25%	4-23 15%	0-5 0%	0-10 0%	0-5 0%	6-9 40%	37-15 71%
TankPush	17-105 14%	0-10 0%	5-18 22%	0-12 0%	0-5 0%	0-3 0%	0-5 0%	0-9 0%	8-23 26%	0-9 0%	4-8 33%	0-3 0%	-
VultureRush	17-82 17%	0-10 0%	0-5 0%	0-12 0%	0-5 0%	0-3 0%	0-5 0%	0-9 0%	0-5 0%	0-9 0%	0-3 0%	17-16 52%	-
DTRush	19-119 14%	1-22 4%	0-13 0%	1-13 7%	11-26 30%	0-16 0%	-	0-14 0%	6-13 32%	-	0-2 0%	-	-
DragoonRush	13-117 10%	0-16 0%	0-13 0%	0-9 0%	0-12 0%	2-28 7%	-	1-20 5%	10-17 37%	-	0-2 0%	-	-
ZealotRush	185-170 52%	0-15 0%	2-24 8%	4-20 17%	0-12 0%	0-16 0%	37-13 74%	1-20 5%	0-5 0%	48-11 81%	25-18 58%	28-15 65%	40-1 98%
2HatchHydra	9-83 10%	0-12 0%	2-21 9%	1-6 14%	6-13 32%	0-11 0%	0-6 0%	0-8 0%	0-2 0%	0-1 0%	0-3 0%	-	-
3HatchMuta	0-63 0%	0-12 0%	0-12 0%	0-4 0%	0-4 0%	0-11 0%	0-6 0%	0-8 0%	0-2 0%	0-1 0%	0-3 0%	-	-
3HatchScourge	0-59 0%	0-11 0%	0-11 0%	0-4 0%	0-4 0%	0-10 0%	0-6 0%	0-7 0%	0-2 0%	0-1 0%	0-3 0%	-	-
ZerglingRush	195-168 54%	0-11 0%	0-11 0%	11-21 34%	5-11 31%	0-10 0%	14-23 38%	1-11 8%	22-23 49%	29-18 62%	21-17 55%	39-10 80%	53-2 96%
total	- 31%	3-147 2%	10-139 7%	18-132 12%	29-116 20%	17-132 11%	58-90 39%	7-138 5%	46-103 31%	77-70 52%	67-83 45%	92-58 61%	130-18 88%

Random UAlbertaBot starts off with its default strategies of marine rush, zealot rush, or zergling rush, and tries alternatives only if the strategy scores poorly. The table shows that the default strategies chosen years ago are still the best choices. The zealot rush even scored well against #6 Microwave. Also constant over the years is that the 3 hatch scourge build, which was designed to counter the carrier bot XIMP, has no other use; UAlbertaBot would have done better without it.

It’s curious that UAlbertaBot’s overall weakest race is terran, but that its terran scored best against many stronger opponents: Terran was UAlbertaBot’s happiest roll versus #7 Steamhammer, #5 McRave, #2 PurpleWave, and #1 Stardust. But these stronger opponents allowed few wins. #1 Stardust (2%), #7 Steamhammer (5%), and #2 PurpleWave (7%) shut down UAlbertaBot hard.

If your bot is ranked above UAlbertaBot, then pink or blue boxes suggest weaknesses that you might benefit from working on. If a weaker bot beats you this way, presumably a stronger one can too. UAlbertaBot benefits from its random race and the big differences between its strategies, so maybe something went wrong in your scouting or reactions. #6 Microwave had trouble with zealots, #5 McRave had trouble with marines, and #4 Dragon had some trouble with 4 different rushes.

AIIDE 2020 - what bots wrote data

I looked in each bot’s final write directory to see what files it wrote, if any, and in its AI directory to see if it had prepared data for any opponents. Standard disclaimers apply: A bot does not necessarily use the data it writes. Preparation for specific opponents is not necessarily in the form of data in the AI directory, it might be in code.

#	bot	info
1	Stardust	Nothing. Stardust relies on its great execution.
2	PurpleWave	The learning files have a sequence of PurpleWave’s strategy choices followed by a sequence of “fingerprinted” enemy strategies. (PurpleWave also has specific preparation for its opponents, but that’s in code rather than data.) There are also debug logs that show some decisions, but are probably only for the author.
3	BananaBrain	The learning files look just like last year’s: One file for each opponent in the form of brief records of results. Each record consists of date+time, map, BananaBrain’s strategy (“PvZ_9/9proxygate”), the opponent’s recognized strategy (“Z_9pool”), a floating point number which we were told last year is the game duration in minutes, and the game result. Pre-learned data for 6 opponents, with the largest file by far for Stardust. Maybe if you have pegged your opponent as having a narrow range of adaptation, you don’t have to leave room for surprises.
4	Dragon	Very simple game records with strategy and game result, like `"siege expand" won`.
5	McRave	Two files for each opponent, named like `ZvU UAlbertaBot.txt` and `ZvU UAlbertaBot Info.txt`. The first file is short and counts wins and losses overall and for each of McRave’s strategies. The info file (now working correctly, unlike last year) has detailed game records with aspects of the opponent’s strategy (`2Gate,Main,ZealotRush`), McRave’s strategy at 3 levels of abstraction (`PoolHatch,Overpool,2HatchMuta`), timings, and unit counts. I want to look more closely at the game records and see how they are used (maybe they are only logs for the author).
6	Microwave	Result and history files for each opponent that look similar to last year’s. The result files count wins and losses for each Microwave strategy, and no longer limit the counts to 10—apparently Microwave no longer deliberately forgets history. The history files have a one-line record of data about each game and look the same as last year. Also pre-learned history files for all 12 opponents.
7	Steamhammer	Steamhammer’s learning file format is documented here.
8	DaQin	Carried over from last year. Learning files straight from its parent Locutus (very similar to the old format Steamhammer files). There is no visible pre-learned data (in a quick check I also found no opponent-specific code).
9	ZZZKBot	Learning files for each opponent that look the same as last year, with detailed but hard-to-interpret information about each game.
10	UAlbertaBot	Carried over from past years. For each opponent, a file listing strategies with win and loss counts for each.
11	WillyT	A single log file with 150 lines apparently giving data for 150 games against various opponents. Each line looks like `20201009,Ecgberht,T,01,0`. The items look like date, opponent, opponent race, a number 01 02 or 03, and win/loss. There were 150 rounds in the tournament, so maybe this is a log of one game per round—the dates seem to back that up, but if so, how is the single game chosen? Is it the last one played? This is either broken, or else it is doing something I can’t fathom.
12	Ecgberht	Two files for each opponent, named like `Dragon_Terran.json` and `Dragon_Terran-History.json`. The plain file counts wins and losses of each of Ecgberht’s strategies separately for each map size (number of starting locations, 2 3 or 4). (The map size breakdown is similar to AIUR’s.) There is also an overall win/loss count, plus flags named `naughty` and `defendHarass`. Of all bots in the tournament, only ZZZKBot is flagged `naughty`, so maybe it means the opponent likes fast rushes. `defendHarass` tells whether the opponent defends its workers if Ecgberht’s scouting SCV attacks them (that way it can exploit weak opponents without risking its SCV against prepared ones). The history file is a list of game records, giving opponent name, opponent race, game outcome, Ecgberht’s strategy, the map, and the opponent’s recognized strategy (which is often `Unknown`).
13	EggBot	Nothing. EggBot is the only entrant other than Stardust to record no data.

In recent years, nearly all top bots have relied on opening learning to adapt to their opponents. The strongest bot without learning was Iron, which came in #1 in AIIDE 2016 and slipped down the ranks until it fell to #8 in AIIDE 2019, scoring under 50%. Stardust is the only high finisher since then to get by without. Stardust plays with a restricted set of units, only zealots and dragoons with observers as needed. On the one hand, that shows the value of specializing and becoming extremely skilled at the most important aspects of the game (the opposite of Steamhammer’s development strategy). On the other hand, it points out how much headroom all bots have to improve.