strategy learning - 4 | Starcraft AI blog

RPS analyzer and game solver

Regardless of any other adaptation skills you may have, if you can predict your enemy’s opening build, you can better counter it. But it’s not simple. Steamhammer tries to distinguish between opponents that play a fixed build, and those that vary their openings. For those that vary their openings, there is much more: Some choose randomly. Some like to repeat the opening that won the last game, switching only on loss. Some stick with an opening that wins more than a given percentage. Some try openings systematically, “A didn’t work, B is next.” Some choose randomly with some probability distribution over the more successful choices. Sometimes openings are treated as black boxes distinguished only by their names, sometimes as strategies which are understood to counter other strategies (Steamhammer does both at different times).

I am wondering whether it makes sense to write a rock-paper-scissors analyzer that tries to tease out exploitable patterns in the opponent’s behavior (there are established techniques), and combine it with a game solver to make better initial opening choices. On the one hand, many bots have exploitable patterns that I know about. If an RPS analyzer can find the patterns too, Steamhammer might seem to gain the mysterious “star sense” to always play the right thing for no visible reason. On the other hand, it’s relatively easy to reduce your exploitability to a low level—use randomness wisely. Also, as Steamhammer gains skills to adapt its strategy during the game, the initial opening choices make less difference. The gain might be little. And by trying to exploit patterns, Steamhammer could itself become more exploitable; it might backfire.

The parts of the system would be:

1. Classify the enemy build. Steamhammer already does this, though it needs improvement.

2. Statistically analyze the sequence of (win/loss, my opening, your opening) under the assumption that the opponent is trying to counter what we’re doing. Knowing what-counters-what may factor in. The output should be a probability distribution over opening classes, “what are they likely to do?”

3. Knowing what-counters-what definitely factors in here: Solve the game. We start with a prior probability of winning for each of our openings against each opening class the opponent might play, and thanks to Bayes we can update it as we learn about the opponent. That gives us a game matrix with uncertainties in the payoffs. (Since Steamhammer knows a huge number of opening builds, making the game matrix too big to fill in, I would classify Steamhammer’s openings too so that the output only decides which class of opening to play.) Without an RPS analyzer, we can solve the game (I expect I would use a Monte Carlo method to handle the uncertainties) and play not far from a Nash equilibrium (i.e., almost perfectly assuming unexploitable play from the opponent). If an RPS analyzer can make a good estimate of the opponent’s plans, in the best case we can do better: We can exploit the opponent’s deviation from the Nash equilibrium to win more than if we played to a Nash equilibrium ourselves.

It’s unclear to me whether either the RPS analyzer or the game solver is worth the effort. Does anybody have an opinion? Perhaps some bot I haven’t looked at has similar ideas?

AIIDE 2019 - what Microwave did

Here’s data from Microwave’s history files, using the same script as for BananaBrain with a little customization. Unlike Microwave’s learning files, which deliberately omit data and include information from pre-learning, the history files tell what Microwave actually did during the games. Microwave didn’t record information about the opponent’s strategy, so that table is left out. That made it look a little sparse, so I added columns giving the first and last games when the opening was tried, where the first game in the history file is game 0. We can see things like when a winning opening was found, and whether it kept winning. If there are fewer than 100 games recorded for an opponent because Microwave crashed, then the game numbers generally do not align with the tournament round numbers.

Against difficult opponents, Microwave experimented widely. Against some opponents that Microwave pre-trained against, it played whatever came out of pre-training. So I don’t have much to say about opponents in the top half of the post. But toward the bottom I’ve made some comments. Especially see the note to AITP.

#1 locutus

opening	games	wins	first	last
10Hatch9Pool9gas	8	12%	1	52
2HatchHydra	7	0%	0	53
2HatchLurker	7	29%	83	89
2HatchLurkerAllIn	2	0%	63	90
2HatchMuta	12	25%	3	56
3HatchHydraBust	3	0%	10	57
3HatchLingBust	3	0%	38	91
3HatchPoolHydra	5	0%	16	92
4HatchBeforeGas	4	0%	27	93
4PoolHard	3	0%	15	58
4PoolSoft	4	0%	21	59
5Pool	2	0%	36	60
5PoolSpeed	3	0%	41	94
6Pool	3	0%	42	95
6PoolSpeed	3	0%	43	96
7Pool	2	0%	37	61
8Pool	3	0%	44	97
9Pool	9	22%	45	78
9PoolLurker	2	0%	46	79
9PoolSpeed	3	0%	11	62
9PoolSpeedLing	2	0%	47	80
ZvP_10Hatch9Pool	4	0%	17	81
ZvZ_Overpool11Gas	4	0%	18	82
23 openings	98	8%

#2 purplewave

opening	games	wins	first	last
10Hatch9Pool9gas	11	9%	20	93
2HatchHydra	6	0%	14	87
2HatchMuta	5	0%	35	94
3HatchHydraBust	9	0%	3	95
3HatchLingBust	14	7%	0	74
4PoolHard	1	0%	80	80
4PoolSoft	7	14%	30	75
5Pool	8	12%	15	90
5PoolSpeed	1	0%	81	81
6Pool	1	0%	82	82
6PoolSpeed	1	0%	83	83
7Pool	8	0%	17	76
8Pool	4	0%	42	91
9Pool	1	0%	84	84
9PoolSpeed	3	0%	52	92
9PoolSpeedLing	14	21%	4	77
ZvP_10Hatch9Pool	1	0%	85	85
ZvZ_Overpool11Gas	1	0%	86	86
18 openings	96	7%

#3 bananabrain

opening	games	wins	first	last
10Hatch9Pool9gas	1	0%	54	54
2HatchHydra	1	0%	51	51
2HatchMuta	1	0%	52	52
3HatchLingBust	37	49%	0	92
4PoolHard	3	0%	29	63
4PoolSoft	4	0%	28	67
5Pool	11	45%	22	76
5PoolSpeed	7	29%	19	78
6Pool	1	0%	62	62
6PoolSpeed	5	20%	20	68
7Pool	1	0%	55	55
8Pool	3	0%	24	69
9Pool	7	43%	56	70
9PoolSpeed	1	0%	53	53
9PoolSpeedLing	3	0%	25	71
ZvZ_Overgas9Pool	4	0%	26	77
ZvZ_Overpool11Gas	3	0%	35	79
17 openings	93	31%

#4 daqin

opening	games	wins	first	last
10Hatch9Pool9gas	11	18%	2	77
2HatchHydra	4	0%	18	78
2HatchLurker	4	0%	23	79
2HatchMuta	13	23%	17	89
3HatchHydraBust	3	0%	20	51
3HatchLingBust	31	39%	16	76
3HatchPoolHydra	3	0%	25	52
4PoolSoft	3	0%	6	53
5Pool	3	0%	7	54
7Pool	3	0%	11	55
9Pool	3	0%	1	56
9PoolSpeed	3	0%	10	57
9PoolSpeedLing	3	0%	0	58
ZvP_10Hatch9Pool	3	0%	5	59
14 openings	90	19%

#5 steamhammer

opening	games	wins	first	last
9PoolSpeed	100	75%	0	99
1 openings	100	75%

#6 zzzkbot

opening	games	wins	first	last
9PoolHatch	1	0%	0	0
ZvZ_Overgas11Pool	70	80%	1	70
2 openings	71	79%

Why are only 71 games recorded? According to the official results, Microwave crashed in 56 games throughout the tournament, and 29 of those crashes happened against ZZZKBot. Microwave recorded every game in which it did not crash. It’s a debugging opportunity. :-/

#8 iron

opening	games	wins	first	last
10Hatch9Pool9gas	2	0%	53	82
2HatchHydra	1	0%	83	83
2HatchLurkerAllIn	2	0%	63	88
2HatchMuta	11	9%	0	72
3HatchHydraBust	15	33%	5	77
3HatchHydraExpo	1	0%	84	84
3HatchPoolHydra	1	0%	85	85
4HatchBeforeGas	4	0%	18	89
4PoolHard	6	0%	13	78
4PoolSoft	7	14%	11	71
5Pool	1	0%	86	86
5PoolSpeed	6	0%	14	79
6Pool	2	0%	54	87
6PoolSpeed	5	20%	35	92
7Pool	10	30%	19	68
8Pool	7	14%	17	80
9Pool	8	12%	1	95
9PoolSpeedLing	4	0%	21	96
OverpoolTurtle	4	0%	22	81
19 openings	97	13%

#9 xiaoyi

opening	games	wins	first	last
10Hatch9Pool9gas	2	0%	42	47
2HatchLurker	1	0%	48	48
2HatchMuta	2	0%	45	46
4PoolSoft	38	63%	1	38
5Pool	2	50%	0	39
7Pool	51	76%	49	99
9Pool	2	50%	40	41
9PoolSpeedLing	2	0%	43	44
8 openings	100	65%

As soon as Microwave found that 7 pool worked, it played 7 pool exclusively.

#10 mcrave

opening	games	wins	first	last
2HatchMuta	40	62%	0	79
3HatchHydraBust	13	92%	86	98
4PoolHard	1	0%	80	80
4PoolSoft	40	62%	1	40
9Pool	1	0%	85	85
ZvZ_Overgas11Pool	4	50%	81	84
6 openings	99	65%

Microwave was late to discover the success of the hydra bust opening. That’s why it was played so little. The example shows the importance of finding good ideas as early as possible. I am adding smarts to Steamhammer to make it better at finding the good tries fast.

It’s interesting that 2HatchMuta and 4PoolSoft have the same numbers, but were given up on at different times.

#11 ualbertabot

opening	games	wins	first	last
4PoolSoft	100	82%	0	99
1 openings	100	82%

The choice against UAlbertaBot was determined by pre-training. From scratch, I expect Microwave would have tried a wider variety.

#12 aitp

opening	games	wins	first	last
9PoolSpeedLing	100	93%	0	99
1 openings	100	93%

If the first try wins, keep it up. What if Microwave had an opening that would have won more than 93%? The theory is that, above some winning rate, the risk of losing by trying alternatives is higher than the risk of losing by sticking with a known good opening. But what winning rate is high enough to stick with? It depends on how much you respect your opponents. If you expect to win nearly every game, like Locutus, maybe you should switch to an alternative as soon as you lose a single game. If you expect to finish near the bottom, maybe you should stick with a strategy that wins 50%.

But more: How much do you respect each opponent? Maybe bots should have a “contempt factor” like chess programs may use to decide whether to aim for a draw: Accept a low winning rate strategy against Locutus, but demand 95% wins against the unknown who you’ve decided is a weak newbie. I would rather call it a respect factor! In a UCB algorithm, a level of respect is implicitly encoded in the exploration rate constant. Does any bot already have a respect factor for specific opponents?

#13 bunkerboxer

opening	games	wins	first	last
5Pool	100	99%	0	99
1 openings	100	99%

Apparently the initial choice against an unknown is random.

AIIDE 2019 - what BananaBrain learned

I wrote a script to analyze BananaBrain’s game history files, which record its experience with each opponent. For now, I had the script summarize the strategies played and the enemy strategies recognized. The history files also record the map and a value that represents the game duration. History files are rich with information, and there are many ways to summarize it. It would be interesting to see how strategy usage and win rate vary by map, among other possibilities.

The same script should work with minor changes to summarize Microwave’s history files.

BananaBrain had prepared history files for the opponents #1 Locutus, #2 PurpleWave, #5 Steamhammer, #6 ZZZKBot, #7 Microwave, and #8 Iron. Data from the prepared history files was not copied into the write directory. That is different from how Steamhammer and Locutus keep their game records, and it has the nice effect that the tables show exactly what happened in the tournament, from BananaBrain’s point of view.

For each opponent, the left table is BananaBrain’s choice. The right table is BananaBrain’s idea of what the opponent did. All the win rates are from BananaBrain’s point of view, so that, for example, when Locutus played P_1gatecore, BananaBrain won 5% of the time. Of course, the opponent’s view of its own strategy is likely to be more fine-grained than BananaBrain’s. To take the extreme case, Steamhammer played 30 different openings against BananaBrain, and BananaBrain recognized them in 8 categories.

#1 locutus

opening	games	wins
PvP_10/12gate	6	17%
PvP_12nexus	11	36%
PvP_2gatedt	10	0%
PvP_2gatedtexpo	9	0%
PvP_3gaterobo	5	0%
PvP_3gatespeedzeal	8	25%
PvP_4gategoon	6	0%
PvP_9/9gate	12	8%
PvP_9/9proxygate	9	0%
PvP_nzcore	8	12%
PvP_zcore	4	0%
PvP_zcorez	6	0%
PvP_zzcore	6	17%
13 openings	100	10%

enemy	games	wins
P_1gatecore	20	5%
P_cannonrush	29	7%
P_fastexpand	1	0%
P_ffe	19	21%
P_unknown	31	10%
5 openings	100	10%

As you might expect against Locutus, the best choice was a fast expansion.

Is the single game of enemy P_fastexpand a misrecognition? I suspect that Locutus played otherwise, and BananaBrain didn’t see everything and wasn’t able to draw the right conclusion. Or maybe it’s a bug somewhere. PurpleWave and McRave also show a single P_fastexpand game.

#2 purplewave

opening	games	wins
PvP_10/12gate	23	70%
PvP_12nexus	2	0%
PvP_2gatedt	6	17%
PvP_2gatedtexpo	3	33%
PvP_3gaterobo	2	0%
PvP_3gatespeedzeal	1	0%
PvP_4gategoon	8	38%
PvP_9/9gate	26	88%
PvP_9/9proxygate	13	62%
PvP_nzcore	3	0%
PvP_zcore	4	25%
PvP_zcorez	5	40%
PvP_zzcore	4	25%
13 openings	100	56%

enemy	games	wins
P_1gatecore	54	56%
P_2gate	25	60%
P_2gatefast	6	33%
P_fastexpand	1	0%
P_ffe	2	50%
P_unknown	12	67%
6 openings	100	56%

Against PurpleWave, different zealot rushes worked best. Maybe it is because zealot rushes depend for their success more on execution than on the enemy’s strategic reaction. PurpleWave is particularly good at reacting to the enemy strategy, and BananaBrain is good at execution.

#4 daqin

opening	games	wins
PvP_10/12gate	8	62%
PvP_12nexus	6	33%
PvP_2gatedt	6	17%
PvP_2gatedtexpo	12	83%
PvP_3gaterobo	7	14%
PvP_3gatespeedzeal	6	33%
PvP_4gategoon	5	0%
PvP_9/9gate	14	93%
PvP_9/9proxygate	9	67%
PvP_nzcore	7	43%
PvP_zcore	6	33%
PvP_zcorez	7	43%
PvP_zzcore	7	43%
13 openings	100	51%

enemy	games	wins
P_1gatecore	82	50%
P_unknown	18	56%
2 openings	100	51%

BananaBrain made quite a variety of tries, and was most successful with... zealot rush and dark templars, which are kind of different. BananaBrain’s varied opening choice is a strength.

#5 steamhammer

opening	games	wins
PvZ_10/12gate	15	100%
PvZ_1basespeedzeal	8	88%
PvZ_2basespeedzeal	11	82%
PvZ_4gate2archon	7	57%
PvZ_5gategoon	7	86%
PvZ_9/9gate	12	92%
PvZ_9/9proxygate	15	100%
PvZ_bisu	4	75%
PvZ_neobisu	2	50%
PvZ_sairdt	7	100%
PvZ_sairgoon	2	0%
PvZ_stove	10	70%
12 openings	100	85%

enemy	games	wins
Z_10hatch	38	76%
Z_12hatch	31	84%
Z_12pool	11	91%
Z_4/5pool	3	100%
Z_9pool	1	100%
Z_9poolspeed	4	100%
Z_overpool	2	100%
Z_unknown	10	100%
8 openings	100	85%

2 gate zealot openings work well against Steamhammer—but only when played by PurpleWave or BananaBrain. Steamhammer can usually defend versus a lesser protoss.

#6 zzzkbot

opening	games	wins
PvZ_10/12gate	17	100%
PvZ_1basespeedzeal	11	91%
PvZ_2basespeedzeal	4	25%
PvZ_4gate2archon	4	50%
PvZ_5gategoon	6	67%
PvZ_9/9gate	15	100%
PvZ_9/9proxygate	3	67%
PvZ_bisu	5	60%
PvZ_neobisu	4	25%
PvZ_sairdt	12	100%
PvZ_sairgoon	6	50%
PvZ_stove	13	100%
12 openings	100	83%

enemy	games	wins
Z_4/5pool	33	85%
Z_9pool	17	100%
Z_9poolspeed	2	100%
Z_overpool	23	65%
Z_unknown	25	84%
5 openings	100	83%

I like that BananaBrain varies its opening choice even when several openings win 100%. (Steamhammer does too; if more than one opening has scored 100% so far, Steamhammer chooses randomly among them.) Playing a strong opening gives the opponent one problem to solve (“how do I survive this?”). Unpredictably playing one of several strong openings sets the opponent two problems (“what is this fiend doing, and then how do I live through it?”) which must both be solved, more than twice as difficult.

#7 microwave

opening	games	wins
PvZ_10/12gate	20	90%
PvZ_1basespeedzeal	11	73%
PvZ_2basespeedzeal	3	33%
PvZ_4gate2archon	6	50%
PvZ_5gategoon	8	75%
PvZ_9/9gate	17	88%
PvZ_9/9proxygate	8	75%
PvZ_bisu	10	60%
PvZ_neobisu	3	33%
PvZ_sairdt	4	50%
PvZ_sairgoon	2	0%
PvZ_stove	8	62%
12 openings	100	71%

enemy	games	wins
Z_10hatch	8	88%
Z_12hatch	38	55%
Z_12pool	2	100%
Z_4/5pool	28	71%
Z_9pool	9	67%
Z_9poolspeed	7	100%
Z_overpool	3	100%
Z_unknown	5	100%
8 openings	100	71%

#8 iron

opening	games	wins
PvT_10/12gate	6	67%
PvT_10/15gate	3	0%
PvT_12nexus	4	25%
PvT_1gatedtexpo	25	84%
PvT_2gatedt	10	60%
PvT_9/9gate	10	60%
PvT_9/9proxygate	4	75%
PvT_bulldog	1	0%
PvT_dtdrop	14	64%
PvT_nzcore	5	40%
PvT_proxydt	2	0%
PvT_stove	4	25%
PvT_zcore	5	40%
PvT_zzcore	7	43%
14 openings	100	58%

enemy	games	wins
T_1fac	30	63%
T_2fac	1	0%
T_fastexpand	29	48%
T_unknown	40	62%
4 openings	100	58%

Bulldog! That involves protoss dropping zealots, typically on cliff tanks, with a simultaneous attack by ground. When successful, a bulldog can abruptly break a terran defense that is sound against any purely ground attack. I don’t think I’ve seen BananaBrain play that; I should watch more games versus terran. Can anybody point out an example?

#9 xiaoyi

opening	games	wins
PvT_10/12gate	10	90%
PvT_10/15gate	7	43%
PvT_12nexus	5	20%
PvT_1gatedtexpo	11	100%
PvT_2gatedt	7	57%
PvT_9/9gate	6	33%
PvT_9/9proxygate	6	17%
PvT_bulldog	5	0%
PvT_dtdrop	9	89%
PvT_nzcore	6	17%
PvT_proxydt	7	71%
PvT_stove	8	75%
PvT_zcore	6	33%
PvT_zzcore	7	57%
14 openings	100	57%

enemy	games	wins
T_1fac	37	57%
T_fastexpand	20	65%
T_unknown	43	53%
3 openings	100	57%

The Stove worked against XiaoYi? Again, XiaoYi shows weakness against tricks. The Stove involves making scouts to harass while teching to dark templar. It should not be hard for a good terran to defend against; notice that Iron dealt with it well enough.

#10 mcrave

opening	games	wins
PvP_10/12gate	7	71%
PvP_12nexus	6	50%
PvP_2gatedt	6	67%
PvP_2gatedtexpo	8	50%
PvP_3gaterobo	9	78%
PvP_3gatespeedzeal	8	62%
PvP_4gategoon	7	57%
PvP_9/9gate	8	75%
PvP_9/9proxygate	6	33%
PvP_nzcore	10	90%
PvP_zcore	7	57%
PvP_zcorez	10	90%
PvP_zzcore	8	88%
13 openings	100	69%

enemy	games	wins
P_1gatecore	34	74%
P_2gate	26	65%
P_2gatefast	29	69%
P_fastexpand	1	0%
P_proxygate	4	100%
P_unknown	6	50%
6 openings	100	69%

It looks like most openings performed similarly against McRave, and BananaBrain struggled to identify what worked. I imagine a fierce learning battle, both trying to keep one step ahead.

#11 ualbertabot

opening	games	wins
PvU_10/12gate	17	94%
PvU_9/9gate	17	100%
PvU_9/9proxygate	13	85%
PvU_flex	12	67%
PvU_nzcore	11	64%
PvU_zcore	16	88%
PvU_zzcore	13	77%
7 openings	99	84%

enemy	games	wins
P_1gatecore	8	100%
P_2gate	6	83%
P_2gatefast	21	71%
P_unknown	3	33%
T_1fac	5	100%
T_2fac	7	100%
T_2rax	10	90%
T_fastexpand	3	100%
T_unknown	5	100%
Z_10hatch	2	100%
Z_12hatch	8	100%
Z_4/5pool	17	71%
Z_unknown	4	75%
13 openings	99	84%

#12 aitp

opening	games	wins
PvT_10/12gate	7	100%
PvT_10/15gate	8	100%
PvT_12nexus	6	100%
PvT_1gatedtexpo	8	100%
PvT_2gatedt	7	100%
PvT_9/9gate	6	100%
PvT_9/9proxygate	7	100%
PvT_bulldog	9	100%
PvT_dtdrop	7	100%
PvT_nzcore	7	100%
PvT_proxydt	7	100%
PvT_stove	9	100%
PvT_zcore	6	100%
PvT_zzcore	6	100%
14 openings	100	100%

enemy	games	wins
T_1fac	4	100%
T_2fac	12	100%
T_fastexpand	24	100%
T_unknown	60	100%
4 openings	100	100%

#13 bunkerboxer

opening	games	wins
PvT_10/12gate	7	100%
PvT_10/15gate	7	100%
PvT_12nexus	7	100%
PvT_1gatedtexpo	7	100%
PvT_2gatedt	7	100%
PvT_9/9gate	6	100%
PvT_9/9proxygate	7	100%
PvT_bulldog	8	100%
PvT_dtdrop	7	100%
PvT_nzcore	6	100%
PvT_proxydt	8	100%
PvT_stove	8	100%
PvT_zcore	7	100%
PvT_zzcore	8	100%
14 openings	100	100%

enemy	games	wins
T_unknown	100	100%
1 openings	100	100%

BananaBrain apparently does not have a bunker rush recognizer.

AIIDE 2019 - what AITP learned

AITP scored zero against over half of the participants, so its learning results are not deeply interesting. Also, its strategies are labeled with opaque sequences of letters and numbers. But it was easy to generate the tables, and they offer a little insight into AITP’s interesting design, so here they are.

Unlike other Steamhammer forks, AITP does not spell out concrete opening builds in the configuration file, at least not beyond 4 x SCV—start by making workers. The strategy names themselves are code sequences that tell what to do throughout the game. The letters A, B, C are stages of the game, and the combinations A1, A2 etc. are “modules” that may be active during the matching stage. Each module has its own update method to decide what to build, and the StrategyManager sometimes checks the current module for other decisions. There is module switching code in case of surprises (StrategyManager::shouldSwitchModule()); it also sets flags and updates other information.

I like it, it’s a flexible way to specify a plan for the whole game, and allows for changing plans on the fly. It’s an abstract strategy system, similar in principle to what I plan for Steamhammer. My implementation will look entirely different, though.

AITP has only 5 strategies configured. I gather that it can switch to other sequences on the fly if circumstances warrant. 5 is not many, though; I think they have only completed the basics. Here is the Steamhammer opening group it assigns to each strategy. It does not use the opening group strings, but they may have some heuristic value:

• A1-B3-C2 AntiRush
• A1-B1-B2-C2 Rush
• A3-B5-C1 NoneBunker
• A3-B7-C1 NoneBunker
• A4-B2-C1 8BB (does that mean BBS?)

#1 locutus

opening	games	wins
A1-B1-B2-C2	7	0%
A1-B3-C2	10	0%
A3-B5-C1	16	0%
A3-B7-C1	27	0%
A4-B2-C1	40	0%
5 openings	100	0%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Naked expand	59	59%	0%	11	11%	0%	7%	83%
Proxy	27	27%	0%	9	9%	0%	11%	67%
Turtle	9	9%	0%	3	3%	0%	0%	67%
Unknown	5	5%	0%	77	77%	0%	0%	80%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	100	2:01	1:17	8:58
enemy combat units	100	3:49	2:42	8:06
enemy air units	35	7:18	6:14	7:57
enemy cloaked units	61	7:34	6:14	11:26

AITP lost every game, but did not explore its possible strategies equally. It seems to have priorities. Maybe later I will look into how that works. AutoGasSteal is set true in the configuration file, but AITP did not record itself as stealing gas against any opponent. Presumably it is turned off in the code.

#2 purplewave

opening	games	wins
A1-B1-B2-C2	7	0%
A1-B3-C2	15	0%
A3-B5-C1	17	0%
A3-B7-C1	20	0%
A4-B2-C1	30	0%
5 openings	89	0%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Naked expand	80	90%	0%	3	3%	0%	2%	98%
Unknown	9	10%	0%	86	97%	0%	0%	89%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	89	2:03	1:19	6:53
enemy combat units	89	3:39	3:11	6:22
enemy air units	83	6:53	6:03	11:38
enemy cloaked units	60	6:53	5:25	14:01

#3 bananabrain

opening	games	wins
A1-B1-B2-C2	11	0%
A1-B3-C2	5	0%
A3-B5-C1	11	0%
A3-B7-C1	37	0%
A4-B2-C1	35	0%
5 openings	99	0%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	9	9%	0%	3	3%	0%	0%	78%
Heavy rush	20	20%	0%	5	5%	0%	10%	70%
Naked expand	30	30%	0%	5	5%	0%	3%	83%
Proxy	31	31%	0%	4	4%	0%	0%	90%
Unknown	9	9%	0%	82	83%	0%	0%	89%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	99	2:01	0:45	6:38
enemy combat units	99	3:39	2:25	8:07
enemy air units	69	6:31	3:39	11:25
enemy cloaked units	78	6:34	3:47	9:46

#4 daqin

opening	games	wins
A1-B1-B2-C2	5	0%
A1-B3-C2	7	0%
A3-B5-C1	17	0%
A3-B7-C1	16	0%
A4-B2-C1	28	0%
5 openings	73	0%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Unknown	73	100%	0%	73	100%	0%	0%	100%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	73	4:03	2:43	10:39
enemy combat units	73	3:42	2:34	7:59
enemy air units	73	7:01	5:55	8:43
enemy cloaked units	33	11:01	10:01	15:01

#5 steamhammer

opening	games	wins
A1-B1-B2-C2	5	0%
A1-B3-C2	17	6%
A3-B5-C1	4	0%
A3-B7-C1	22	5%
A4-B2-C1	21	5%
5 openings	69	4%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Naked expand	67	97%	3%	13	19%	0%	18%	82%
Unknown	2	3%	50%	56	81%	5%	0%	50%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	69	2:27	0:51	5:46
enemy combat units	69	3:31	2:53	7:15
enemy air units	48	7:24	5:22	14:13
enemy cloaked units	7	8:22	7:34	11:21

Steamhammer is the highest-ranked opponent that AITP scored wins against. It looks like a few scattered games, though.

#6 zzzkbot

opening	games	wins
A1-B3-C2	51	47%
A3-B7-C1	3	0%
A4-B2-C1	4	25%
3 openings	58	43%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	57	98%	44%	37	64%	51%	63%	37%
Unknown	1	2%	0%	21	36%	29%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	58	2:37	0:58	8:29
enemy combat units	58	2:56	2:18	5:11
enemy air units	43	8:13	6:38	11:51
enemy cloaked units	0	-	-	-

It looks like ZZZKBot played its 4 pool in over half the games, and perhaps its guardian rush in the remainder. A1-B3-C2 is the strategy labeled AntiRush. AITP recorded more wins for itself than it actually scored, despite recording fewer games than it played. I suspect that AITP has changed the meaning of the numbers.

#7 microwave

opening	games	wins
A1-B1-B2-C2	5	0%
A1-B3-C2	32	22%
A3-B5-C1	14	0%
A3-B7-C1	17	0%
A4-B2-C1	7	0%
5 openings	75	9%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	59	79%	8%	5	7%	0%	3%	93%
Naked expand	15	20%	13%	3	4%	0%	7%	80%
Unknown	1	1%	0%	67	89%	10%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	75	2:38	1:43	5:53
enemy combat units	75	3:29	2:46	4:29
enemy air units	13	13:01	11:26	15:46
enemy cloaked units	0	-	-	-

#8 iron

opening	games	wins
A1-B1-B2-C2	19	0%
A1-B3-C2	5	0%
A3-B5-C1	27	0%
A3-B7-C1	12	0%
A4-B2-C1	36	0%
5 openings	99	0%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	98	99%	0%	99	100%	0%	100%	0%
Unknown	1	1%	0%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	99	2:18	1:55	7:23
enemy combat units	99	4:14	3:33	4:55
enemy air units	95	6:05	5:37	6:39
enemy cloaked units	95	5:55	5:26	6:38

#9 xiaoyi

opening	games	wins
A1-B1-B2-C2	10	0%
A1-B3-C2	13	0%
A3-B5-C1	25	0%
A3-B7-C1	11	0%
A4-B2-C1	36	0%
5 openings	95	0%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	94	99%	0%	95	100%	0%	100%	0%
Unknown	1	1%	0%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	95	1:47	1:22	8:27
enemy combat units	95	3:13	2:39	4:46
enemy air units	92	9:13	7:22	11:14
enemy cloaked units	89	6:26	5:41	11:14

#10 mcrave

opening	games	wins
A1-B1-B2-C2	3	0%
A1-B3-C2	15	27%
A3-B5-C1	14	0%
A3-B7-C1	26	8%
A4-B2-C1	40	30%
5 openings	98	18%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	6	6%	17%	2	2%	0%	17%	67%
Heavy rush	37	38%	19%	4	4%	0%	5%	89%
Naked expand	42	43%	21%	7	7%	14%	10%	86%
Unknown	13	13%	8%	85	87%	20%	0%	92%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	98	1:59	1:14	12:41
enemy combat units	98	4:13	2:55	7:13
enemy air units	36	7:39	6:17	10:39
enemy cloaked units	72	6:34	5:29	11:01

#11 ualbertabot

opening	games	wins
A1-B1-B2-C2	14	21%
A1-B3-C2	30	57%
A3-B5-C1	1	0%
A4-B2-C1	14	21%
4 openings	59	39%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	50	85%	40%	13	22%	54%	20%	74%
Naked expand	7	12%	14%	3	5%	33%	0%	71%
Unknown	2	3%	100%	43	73%	35%	0%	50%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	58	2:41	1:17	12:26
enemy combat units	58	3:29	2:18	7:27
enemy air units	8	7:05	6:46	15:01
enemy cloaked units	0	-	-	-

Again, AITP recorded fewer games and more wins than happened. Is it a bug, or is it intentionally over-recording wins for certain strategies to focus its search? Or what? AITP is interesting, it deserves a closer look into the code.

#13 bunkerboxer

opening	games	wins
A1-B3-C2	56	98%
A3-B5-C1	3	100%
2 openings	59	98%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Unknown	2	3%	100%	58	98%	98%	0%	50%
Worker rush	57	97%	98%	1	2%	100%	0%	100%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	56	2:07	1:50	3:25
enemy combat units	36	7:44	2:58	9:22
enemy air units	0	-	-	-
enemy cloaked units	0	-	-	-

overall

	total		TvT		TvP		TvZ		TvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
A1-B1-B2-C2	86	3%	29	0%	33	0%	10	0%	14	21%
A1-B3-C2	256	42%	74	74%	52	8%	100	32%	30	57%
A3-B5-C1	149	2%	55	5%	75	0%	18	0%	1	0%
A3-B7-C1	191	2%	23	0%	126	2%	42	2%
A4-B2-C1	291	6%	72	0%	173	7%	32	6%	14	21%
total	973	14%	253	23%	459	4%	202	17%	59	39%
openings played	5		5		5		5		4

AIIDE 2019 - what DaQin learned

DaQin is derived from Locutus and also keeps 200 game records. But DaQin did not have pre-learned data. No games were left uncompleted; there are 100 against each opponent.

DaQin plays fewer builds than the other bots I’ve looked at so far.

#1 locutus

opening	games	wins
3GateDT	100	17%
1 openings	100	17%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	89	89%	16%	96	96%	17%	97%	2%
Proxy	6	6%	17%	2	2%	0%	0%	0%
Unknown	5	5%	40%	2	2%	50%	0%	0%

timing	#	median	early	late
gas steal attempt	47	1:43	1:39	2:06
gas steal success	0	-	-	-
enemy scout	99	6:07	1:21	9:07
enemy combat units	100	4:34	2:22	6:47
enemy air units	96	6:30	4:02	18:41
enemy cloaked units	0	-	-	-

DaQin had an enemy-specific strategy configured for Locutus, so it didn’t try anything else. Locutus is the only opponent that DaQin tried to prepare for, as far as I can see.

DaQin incorrectly recognized dark templar rush as Locutus’s strategy in most games, then correctly recorded that no cloaked units were seen during the game. See yesterday for Locutus’s play against DaQin, which did not include any DT build. I assume that the dark templar recognition is deliberately over-cautious, because DTs are dangerous. Locutus does have a fake dark templar build, where it adds a citadel of Adun to fool opponents into expecting dark templar (it works against most UAlbertaBot-derived bots).

#2 purplewave

opening	games	wins
2GateDT	23	22%
3GateDT	3	0%
4GateGoon	74	14%
3 openings	100	15%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	32	32%	16%	35	35%	23%	69%	0%
Fast rush	66	66%	14%	64	64%	11%	80%	0%
Proxy	1	1%	100%	1	1%	0%	0%	0%
Unknown	1	1%	0%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	29	0:46	0:46	0:50
gas steal success	8	-	-	-
enemy scout	99	2:17	1:18	4:41
enemy combat units	99	2:47	2:21	5:13
enemy air units	41	8:42	4:05	18:10
enemy cloaked units	85	6:07	5:06	15:41

Against PurpleWave, in contrast, DaQin less often foresaw dark templar, but apparently often faced them. (Arbiters can’t get out that fast.)

#3 bananabrain

opening	games	wins
2GateDT	4	25%
3GateDT	68	56%
4GateGoon	28	36%
3 openings	100	49%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	47	47%	53%	55	55%	62%	51%	0%
Fast rush	48	48%	44%	39	39%	33%	35%	0%
Heavy rush	1	1%	0%	2	2%	50%	0%	0%
Not fast rush	1	1%	100%	2	2%	0%	0%	0%
Proxy	1	1%	100%	2	2%	50%	0%	0%
Unknown	2	2%	50%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	43	1:42	0:46	1:48
gas steal success	9	-	-	-
enemy scout	100	1:59	1:21	3:09
enemy combat units	100	2:57	2:19	5:43
enemy air units	67	8:14	3:58	12:42
enemy cloaked units	28	5:47	4:57	19:38

#5 steamhammer

opening	games	wins
ForgeExpand5GateGoon	100	94%
1 openings	100	94%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush		-	-	1	1%	100%	0%	0%
Heavy rush	29	29%	97%	18	18%	100%	14%	3%
Hydra bust	1	1%	100%	2	2%	100%	0%	0%
Not fast rush	64	64%	92%	72	72%	93%	69%	8%
Proxy		-	-	1	1%	100%	0%	0%
Unknown	6	6%	100%	6	6%	83%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	97	2:25	0:51	6:03
enemy combat units	100	3:17	1:57	7:03
enemy air units	18	9:23	5:30	16:18
enemy cloaked units	16	5:51	4:57	13:43

#6 zzzkbot

opening	games	wins
ForgeExpand5GateGoon	97	10%
ForgeExpandSpeedlots	3	0%
2 openings	100	10%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	3	3%	33%	5	5%	100%	0%	33%
Heavy rush	90	90%	3%	93	93%	4%	100%	0%
Not fast rush		-	-	1	1%	100%	0%	0%
Unknown	7	7%	86%	1	1%	0%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	97	2:57	0:59	7:30
enemy combat units	100	2:39	1:47	4:31
enemy air units	7	7:58	7:46	8:25
enemy cloaked units	0	-	-	-

How did ZZZKBot upset DaQin? These numbers suggest zergling bust (it could be hydras, but DaQin does have a hydra bust recognizer which did not fire): Mostly “heavy rush,” few mutalisks, no lurkers. Steamhammer also settled on zergling bust as the best bet, but was much less successful. Microwave tried its zergling bust build versus DaQin without success. Maybe ZZZKBot’s extreme aggression is the key.

#7 microwave

opening	games	wins
ForgeExpand5GateGoon	84	85%
ForgeExpandSpeedlots	16	75%
2 openings	100	83%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	15	15%	93%	15	15%	100%	33%	0%
Heavy rush	32	32%	81%	20	20%	85%	16%	9%
Not fast rush	50	50%	80%	59	59%	76%	66%	4%
Proxy		-	-	1	1%	100%	0%	0%
Unknown	3	3%	100%	5	5%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	97	2:33	1:10	6:10
enemy combat units	90	3:29	1:50	6:37
enemy air units	41	10:37	5:15	14:07
enemy cloaked units	5	6:31	6:23	10:23

#8 iron

opening	games	wins
12NexusCarriers	92	96%
4GateGoon	8	50%
2 openings	100	92%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	55	55%	96%	95	95%	94%	95%	2%
Proxy	8	8%	50%	3	3%	33%	0%	12%
Unknown	37	37%	95%	2	2%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	92	2:19	2:15	2:35
gas steal success	0	-	-	-
enemy scout	87	2:58	1:41	12:09
enemy combat units	100	4:18	2:50	5:49
enemy air units	36	8:23	6:29	15:43
enemy cloaked units	30	8:25	7:54	15:43

12NexusCarriers seems to be the default build versus terran. Apparently terrans, even Iron, were not able to punish the fast expand. Well, they’re not supposed to be able to without risk, that’s the point of cutting probes for nexus on 12, but it does require good play from protoss to ensure.

#9 xiaoyi

opening	games	wins
12NexusCarriers	93	84%
3GateDT	1	0%
4GateGoon	1	0%
DTDrop	5	80%
4 openings	100	82%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	60	60%	83%	47	47%	94%	48%	42%
Not fast rush	29	29%	76%	10	10%	80%	14%	48%
Proxy	1	1%	0%	2	2%	100%	0%	0%
Safe expand	4	4%	100%	1	1%	0%	0%	0%
Unknown	6	6%	100%	40	40%	70%	0%	17%

timing	#	median	early	late
gas steal attempt	99	2:19	0:46	2:25
gas steal success	3	-	-	-
enemy scout	93	2:23	2:10	19:03
enemy combat units	100	3:24	2:33	7:06
enemy air units	80	8:23	7:09	17:30
enemy cloaked units	11	8:15	7:57	8:27

XiaoYi usually got air tech pretty fast, that’s unusual and interesting. I’m guessing it scouted the carriers coming and prepared wraiths.

#10 mcrave

opening	games	wins
2GateDT	1	0%
3GateDT	62	52%
4GateGoon	37	24%
3 openings	100	41%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	19	19%	53%	16	16%	62%	63%	0%
Fast rush	79	79%	39%	83	83%	36%	96%	0%
Naked expand		-	-	1	1%	100%	0%	0%
Unknown	2	2%	0%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	13	1:41	0:46	1:46
gas steal success	2	-	-	-
enemy scout	100	2:22	1:25	6:11
enemy combat units	100	3:03	2:21	5:29
enemy air units	21	6:11	3:38	15:57
enemy cloaked units	76	6:23	5:17	8:33

McRave upset DaQin. Dark templar in 3 out of 4 games, and they came out pretty early. PurpleWave showed a similar pattern, but it wasn’t as salient because it wasn’t an upset. The dark templar rush recognizer did not seem to be fully effective, possibly because it was overridden by the fast rush recognizer. DaQin’s best counter was DT-back-atcha.

#11 ualbertabot

opening	games	wins
12NexusCarriers	2	50%
3GateDT	25	88%
4GateGoon	4	50%
DTDrop	2	50%
ForgeExpand5GateGoon	67	78%
5 openings	100	78%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
DarkTemplar rush	12	12%	75%	11	11%	91%	17%	8%
Factory	3	3%	67%	11	11%	100%	0%	0%
Fast rush	67	67%	78%	47	47%	57%	51%	7%
Heavy rush	1	1%	100%	5	5%	100%	0%	0%
Hydra bust		-	-	1	1%	100%	0%	0%
Not fast rush	13	13%	92%	15	15%	100%	8%	15%
Proxy	1	1%	0%	2	2%	50%	0%	0%
Unknown	3	3%	67%	8	8%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	24	1:43	0:46	2:17
gas steal success	2	-	-	-
enemy scout	87	1:47	1:14	9:30
enemy combat units	98	3:01	1:38	6:58
enemy air units	9	7:37	6:07	15:47
enemy cloaked units	4	5:09	4:33	5:19

DaQin had some trouble adapting to random UAlbertaBot. This is a point where preparation for the opponent would have been valuable: Make a build that UAlbertaBot can’t beat and ensure that it is played. It can be a general-purpose build; PurpleWave included a cannon turtle build that is safe against all sorts of rushes.

#12 aitp

opening	games	wins
12NexusCarriers	100	100%
1 openings	100	100%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	79	79%	100%	5	5%	100%	4%	96%
Unknown	21	21%	100%	95	95%	100%	0%	90%

timing	#	median	early	late
gas steal attempt	100	2:19	2:16	2:25
gas steal success	0	-	-	-
enemy scout	11	7:53	2:38	11:45
enemy combat units	100	5:55	2:43	7:29
enemy air units	67	10:07	8:50	14:01
enemy cloaked units	0	-	-	-

#13 bunkerboxer

opening	games	wins
12NexusCarriers	95	98%
4GateGoon	5	100%
2 openings	100	98%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush		-	-	1	1%	100%	0%	0%
Not fast rush	78	78%	97%	35	35%	100%	35%	60%
Proxy	5	5%	100%	5	5%	100%	0%	0%
Unknown	17	17%	100%	59	59%	97%	0%	71%

timing	#	median	early	late
gas steal attempt	93	2:20	2:15	7:19
gas steal success	28	-	-	-
enemy scout	62	2:07	1:47	7:18
enemy combat units	59	2:59	2:09	7:51
enemy air units	0	-	-	-
enemy cloaked units	0	-	-	-

Beating BunkerBoxeR with a build of fast expansion into carriers is... not the intuitive choice. But I guess it worked.

overall

	total		PvT		PvP		PvZ		PvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
12NexusCarriers	382	94%	380	94%					2	50%
2GateDT	28	21%			28	21%
3GateDT	259	42%	1	0%	233	37%			25	88%
4GateGoon	157	25%	14	64%	139	21%			4	50%
DTDrop	7	71%	5	80%					2	50%
ForgeExpand5GateGoon	348	65%					281	62%	67	78%
ForgeExpandSpeedlots	19	63%					19	63%
total	1200	63%	400	93%	400	30%	300	62%	100	78%
openings played	7		4		3		2		5

AIIDE 2019 - what Locutus learned

Locutus’s game records are in almost the same format as Steamhammer’s and can be summarized by the same script. I expect it will also work for DaQin and AITP.

Where Steamhammer was set to keep 100 game records per opponent, Locutus was set to keep 200. Since there were 100 rounds in the tournament, game counts over 100 mean that pre-learned data is included in the table alongside the tournament data. If Locutus was not trained on a near-final version of the opponent, then the two could be significantly different.

#2 purplewave

opening	games	wins
4GateGoon	28	54%
4GateGoonWithObs	16	62%
FakeDTRush	10	20%
ForgeExpand	19	63%
ZealotDrop	127	73%
5 openings	200	66%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Dark templar	20	10%	50%	30	15%	37%	45%	0%
Fast rush	3	2%	100%	5	2%	80%	0%	0%
Heavy rush	6	3%	17%	12	6%	67%	0%	0%
Not fast rush	171	86%	69%	153	76%	71%	81%	0%

timing	#	median	early	late
gas steal attempt	92	1:44	0:44	2:01
gas steal success	15	-	-	-
enemy scout	186	2:27	1:09	16:11
enemy combat units	198	3:29	2:19	7:26
enemy air units	55	6:50	4:50	20:31
enemy cloaked units	93	11:07	5:13	19:54

After seeing a few Locutus-PurpleWave games I got the impression that PurpleWave reacted adequately to Locutus’s trick strategy of cannoning the ramp and then dropping zealots. So I was surprised that Locutus considered it the best choice. But the overall win rate is high compared to the tournament results, so I suspect it is influenced by pre-learned data from games against a weaker version of PurpleWave.

#3 bananabrain

opening	games	wins
4GateGoon	12	83%
ForgeExpand	37	84%
ZealotDrop	151	95%
3 openings	200	92%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Dark templar	5	2%	80%	28	14%	100%	40%	0%
Fast rush	6	3%	100%	12	6%	83%	0%	0%
Heavy rush	1	0%	100%	19	10%	89%	0%	0%
Not fast rush	188	94%	92%	141	70%	91%	70%	0%

timing	#	median	early	late
gas steal attempt	59	1:45	0:46	1:52
gas steal success	6	-	-	-
enemy scout	196	1:57	0:46	10:09
enemy combat units	200	3:30	2:18	7:25
enemy air units	20	15:41	13:05	17:35
enemy cloaked units	31	6:13	5:46	16:11

#4 daqin

opening	games	wins
4GateGoon	11	64%
FakeDTRush	1	0%
ForgeExpand	1	0%
ZealotDrop	87	87%
4 openings	100	83%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Dark templar	9	9%	89%	21	21%	76%	11%	0%
Fast rush	2	2%	100%	3	3%	67%	0%	0%
Not fast rush	88	88%	82%	76	76%	86%	74%	0%
Unknown	1	1%	100%		-	-	0%	0%

timing	#	median	early	late
gas steal attempt	26	1:45	0:45	1:49
gas steal success	4	-	-	-
enemy scout	97	3:02	2:15	18:05
enemy combat units	100	3:31	2:19	5:22
enemy air units	2	17:47	16:23	19:10
enemy cloaked units	91	7:18	6:02	9:17

#5 steamhammer

opening	games	wins
4GateGoon	7	100%
9-9GateDefensive	5	100%
CannonFirst4GateGoon	11	100%
ForgeExpand4Gate2Archon	11	73%
ForgeExpand5GateGoon	155	95%
ForgeExpandSpeedlots	1	100%
PlasmaCorsairsCarriers	9	100%
ProxyHeavyZealotRush2	1	0%
8 openings	200	94%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	14	7%	100%	26	13%	96%	57%	0%
Heavy rush	26	13%	81%	59	30%	92%	38%	0%
Hydra bust	4	2%	100%	23	12%	87%	50%	0%
Not fast rush	156	78%	96%	92	46%	98%	53%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	188	2:21	0:51	18:45
enemy combat units	199	3:10	2:02	7:11
enemy air units	12	5:40	5:01	6:14
enemy cloaked units	7	10:02	5:15	19:39

The numbers in the “recognized” columns of the plan table show how widely Steamhammer cast its net for a solution to Locutus.

Locutus never tried to steal the gas of a zerg. Objectively, that makes sense. In the context of bot play, I’m not so sure; many bots of all races mess up their builds in the face of a gas steal.

#6 zzzkbot

opening	games	wins
9-9GateDefensive	4	100%
CannonAtChokeFirst4GateGoon	13	54%
CannonFirst4GateGoon	178	99%
PlasmaCorsairsCarriers	1	100%
PlasmaProxy2Gate	4	100%
5 openings	200	96%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	115	57%	97%	107	54%	94%	73%	0%
Heavy rush	78	39%	96%	66	33%	100%	54%	0%
Hydra bust	1	0%	100%	2	1%	100%	100%	0%
Not fast rush	6	3%	100%	25	12%	96%	17%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	200	2:25	0:51	5:58
enemy combat units	196	2:28	2:03	7:59
enemy air units	52	7:53	5:26	13:43
enemy cloaked units	0	-	-	-

#7 microwave

opening	games	wins
9-9GateDefensive	2	100%
ForgeExpand4Gate2Archon	3	67%
ForgeExpand5GateGoon	146	99%
ForgeExpandSpeedlots	44	80%
PlasmaCorsairsCarriers	2	100%
PlasmaProxy2Gate	3	100%
6 openings	200	94%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	50	25%	82%	41	20%	98%	32%	0%
Heavy rush	34	17%	100%	50	25%	90%	47%	0%
Hydra bust		-	-	17	8%	100%	0%	0%
Not fast rush	115	57%	98%	91	46%	95%	57%	0%
Proxy	1	0%	100%	1	0%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	0	-	-	-
gas steal success	0	-	-	-
enemy scout	195	2:30	1:07	21:14
enemy combat units	198	3:03	1:47	7:59
enemy air units	69	11:19	5:49	24:25
enemy cloaked units	32	6:37	5:21	13:49

#8 iron

opening	games	wins
CautiousDTDrop	200	98%
1 openings	200	98%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Not fast rush	13	6%	100%	61	30%	100%	46%	0%
Wall-in	187	94%	98%	139	70%	97%	71%	0%

timing	#	median	early	late
gas steal attempt	35	0:46	0:45	0:48
gas steal success	11	-	-	-
enemy scout	190	2:45	1:42	10:46
enemy combat units	200	4:07	2:34	6:39
enemy air units	117	8:18	6:55	13:39
enemy cloaked units	117	8:18	6:55	13:39

Locutus declared an enemy-specific strategy against Iron. I’m not sure why it also had pre-learned data.

#9 xiaoyi

opening	games	wins
10-15GateGoon	1	0%
10Gate25NexusFE	2	50%
DTDrop	1	0%
ForgeExpand	1	0%
Proxy2ZealotsIntoGoons	30	93%
ProxyDTRush	165	95%
6 openings	200	93%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Not fast rush	200	100%	93%	200	100%	93%	100%	0%

timing	#	median	early	late
gas steal attempt	68	1:17	1:12	1:50
gas steal success	12	-	-	-
enemy scout	194	3:01	2:11	15:29
enemy combat units	200	4:20	2:29	6:57
enemy air units	13	12:59	7:54	15:10
enemy cloaked units	4	8:00	7:54	8:18

Proxy DT rush. That tends to confirm my picture of XiaoYi as vulnerable to tricks.

#10 mcrave

opening	games	wins
4GateGoon	5	80%
4GateGoonWithObs	3	100%
FakeDTRush	1	0%
ForgeExpand	2	50%
ZealotDrop	189	94%
5 openings	200	93%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Dark templar	3	2%	100%	16	8%	94%	67%	0%
Fast rush	3	2%	100%	18	9%	100%	0%	0%
Heavy rush	2	1%	50%	5	2%	100%	0%	0%
Not fast rush	192	96%	93%	161	80%	92%	82%	0%

timing	#	median	early	late
gas steal attempt	99	1:46	0:45	1:57
gas steal success	2	-	-	-
enemy scout	193	2:09	1:21	14:38
enemy combat units	200	3:35	2:21	7:26
enemy air units	24	11:44	7:19	20:31
enemy cloaked units	60	11:03	5:15	14:25

#11 ualbertabot

opening	games	wins
CannonFirst4GateGoon	188	99%
PlasmaProxy2Gate	10	100%
Proxy9-9Gate	2	0%
3 openings	200	98%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Dark templar	1	0%	100%	11	6%	100%	0%	0%
Fast rush	23	12%	100%	29	14%	93%	22%	0%
Heavy rush	48	24%	96%	63	32%	100%	29%	0%
Not fast rush	128	64%	99%	97	48%	99%	51%	0%

timing	#	median	early	late
gas steal attempt	95	2:00	1:57	2:03
gas steal success	0	-	-	-
enemy scout	127	2:11	1:18	5:45
enemy combat units	135	3:22	2:01	6:54
enemy air units	7	6:45	6:41	6:53
enemy cloaked units	11	4:34	4:30	5:13

Locutus configured an enemy-specific strategy against UAlbertaBot. Openings other than CannonFirst4GateGoon are from pre-learned data, which was ignored in making the opening decision.

#12 aitp

opening	games	wins
DTDrop	66	100%
ForgeExpand	33	97%
Turtle	1	100%
3 openings	100	99%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	2	2%	100%	7	7%	100%	0%	0%
Heavy rush	1	1%	100%	1	1%	100%	0%	0%
Not fast rush	77	77%	99%	62	62%	100%	62%	0%
Unknown	1	1%	100%		-	-	0%	0%
Wall-in	19	19%	100%	30	30%	97%	42%	0%

timing	#	median	early	late
gas steal attempt	43	0:46	0:45	1:27
gas steal success	15	-	-	-
enemy scout	26	3:19	2:41	6:02
enemy combat units	100	3:48	2:01	7:49
enemy air units	0	-	-	-
enemy cloaked units	0	-	-	-

#13 bunkerboxer

opening	games	wins
10Gate25NexusFE	21	95%
CannonFirst4GateGoon	88	100%
ForgeExpand	79	100%
PlasmaProxy2Gate	10	100%
Proxy9-9Gate	2	0%
5 openings	200	98%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush	17	8%	100%	20	10%	90%	24%	0%
Heavy rush	36	18%	94%	35	18%	100%	31%	0%
Not fast rush	147	74%	99%	144	72%	99%	83%	1%
Unknown		-	-	1	0%	100%	0%	0%

timing	#	median	early	late
gas steal attempt	87	1:29	0:45	2:03
gas steal success	16	-	-	-
enemy scout	104	2:07	1:27	3:18
enemy combat units	113	2:26	2:01	8:14
enemy air units	0	-	-	-
enemy cloaked units	0	-	-	-

overall

	total		PvT		PvP		PvZ		PvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
10-15GateGoon	1	0%	1	0%
10Gate25NexusFE	23	91%	23	91%
4GateGoon	63	68%			56	64%	7	100%
4GateGoonWithObs	19	68%			19	68%
9-9GateDefensive	11	100%					11	100%
CannonAtChokeFirst4GateGoon	13	54%					13	54%
CannonFirst4GateGoon	465	100%					189	99%	276	100%
CautiousDTDrop	200	98%	200	98%
DTDrop	67	99%	67	99%
FakeDTRush	12	17%			12	17%
ForgeExpand	172	90%	113	98%	59	75%
ForgeExpand4Gate2Archon	14	71%					14	71%
ForgeExpand5GateGoon	301	97%					301	97%
ForgeExpandSpeedlots	45	80%					45	80%
PlasmaCorsairsCarriers	12	100%					12	100%
PlasmaProxy2Gate	27	100%					7	100%	20	100%
Proxy2ZealotsIntoGoons	30	93%	30	93%
Proxy9-9Gate	4	0%							4	0%
ProxyDTRush	165	95%	165	95%
ProxyHeavyZealotRush2	1	0%					1	0%
Turtle	1	100%	1	100%
ZealotDrop	554	88%			554	88%
total	2200	92%	600	97%	700	84%	600	95%	300	98%
openings played	22		8		5		10		3

AIIDE 2019 - what Steamhammer learned

Today is Steamhammer. With a mid-rank finish and the widest range of builds, plus informative game records, Steamhammer may give us the best insight into how other bots played.

The tournament was 100 rounds, and Steamhammer was configured to remember the previous 100 game records, because in play there is no reason to remember more (earlier records are increasingly discounted). Steamhammer also had pre-learned game records for many opponents, so when the game record count reached 100, new records added caused old pre-learned records to drop away. Not all 100 tournament games happened for each opponent, but the pre-learned games filled in the small gaps so that Steamhammer ended up with exactly 100 game records per opponent in every case.

The “opening” table counts Steamhammer’s opening choices. The “plan” table shows the plan that Steamhammer first predicted that the opponent would play, then recognized that the opponent was playing. Both prediction and recognition can be wrong. The timing table is new this year, an attempt to get a little more information out of Steamhammer’s rich game records. For some events, it gives the count of games in which the event occurred, and the median time, earliest time, and latest time it occurred in those games when it did. The times are given under the assumption that 1 second of game time is exactly 24 frames, a simplification.

• gas steal attempt - When Steamhammer sent out the drone to steal gas (if it did).
• gas steal success - Whether the gas steal attempt succeeded in taking the opponent’s gas. Steamhammer doesn’t record the time it happens, so this is only a success count.
• enemy scout - When the enemy scout first reached Streamhammer’s base.
• enemy combat units - When the first enemy combat unit was seen.
• enemy air units - When the enemy is first known to have tech for flying units (except overlords).
• enemy cloaked units - When the enemy is first known to have tech for cloaked units.

#1 locutus

opening	games	wins
11Gas10PoolMuta	1	0%
12Hatch12Pool	1	0%
2.5HatchMuta	2	0%
2HatchHydra	1	0%
2HatchHydraBust	4	0%
2HatchLingAllInSpire	1	0%
3HatchHydraBust	5	0%
3HatchHydraExpo	1	0%
3HatchLateHydras+1	5	0%
3HatchLingBust2	3	0%
4HatchBeforeGas	1	0%
4HatchBeforeLair	5	0%
5HatchBeforeGas	2	0%
5PoolHard2Player	2	0%
5Scout	1	0%
7PoolSoft	1	0%
8-8HydraRush	1	0%
8Hatch7Pool	1	0%
8Pool	1	0%
9Pool	2	0%
9PoolHatch	2	0%
9PoolSpeedAllIn	1	0%
AntiFact_Overpool9Gas	1	0%
DefilerRush	2	0%
Over10Hatch2Sunk	1	0%
Over10HatchSlowLings	1	0%
Over10PoolMuta	1	0%
OverhatchExpoLing	2	0%
OverhatchExpoMuta	2	0%
Overpool2HatchLurker	1	0%
OverpoolHatch	1	0%
OverpoolHydra	12	0%
OverpoolSpeed	4	0%
OverpoolSunk	1	0%
Overpool_4HatchLing	2	0%
PurpleSwarmBuild	1	0%
Sparkle 1HatchMuta	1	0%
Sparkle 2HatchMuta	1	0%
ZvP_3BaseSpire+Den	1	0%
ZvP_3HatchPoolHydra	2	0%
ZvP_4HatchPoolHydra	14	21%
ZvT_12PoolMuta	2	0%
ZvZ_12PoolLing	1	0%
ZvZ_12PoolLingB	1	0%
ZvZ_Overpool9Gas	1	0%
45 openings	100	3%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush		-	-	4	4%	0%	0%	0%
Safe expand	56	56%	2%	44	44%	5%	43%	5%
Turtle	44	44%	5%	45	45%	2%	43%	9%
Unknown		-	-	7	7%	0%	0%	0%

timing	#	median	early	late
gas steal attempt	43	1:29	0:00	2:34
gas steal success	23	-	-	-
enemy scout	100	1:35	1:11	6:51
enemy combat units	100	5:23	4:10	7:58
enemy air units	7	12:05	7:39	21:34
enemy cloaked units	8	11:24	4:39	21:34

It looks like Locutus opened forge-expand every game. It worked. Steamhammer desperately tried everything, including ZvZ builds and island builds, and finally squeezed 3 wins with a risky extreme macro opening, 4 hatcheries before spawning pool, which was able to win one game in five. I should add 5 and 6 hatch before pool and see if they help.

Locutus rarely made corsairs or dark templar. I wonder what its criteria are? Maybe it won before it got that far. The scout was usually quite early, and the first combat unit was seen late, as expected for a cannon-first opener.

I played over the 3 wins. They were in rounds 65, 70, and 73; after that, I expect that Locutus found a way to win. In 2 games, Steamhammer pulled ahead in early economy with its greedy opening, then struggled to defend and fell into a losing position. But Locutus got most of its units stuck in its base, and Steamhammer was able to turn it around and win after a hard fight with critical defiler support. In the third win, Locutus chose a zealot-archon unit mix that Steamhammer knows how to cope with, and zerg powered through.

#2 purplewave

opening	games	wins
10Pool9Gas	1	0%
11HatchTurtleHydra	50	44%
11HatchTurtleLurker	1	0%
11HatchTurtleMuta	15	20%
12Hatch_4HatchLing	1	0%
2HatchLingAllInSpire	1	0%
3HatchHydraExpo	1	0%
3HatchLing	1	0%
3HatchLingExpo	1	0%
4HatchBeforeLair	1	0%
5PoolSoft	1	0%
7Pool12Hatch	1	0%
9PoolBurrow	1	0%
AntiZeal_12Hatch	1	0%
HiveRush	1	0%
Over10Hatch	2	0%
Over10Hatch1Sunk	3	0%
Over10Hatch2Sunk	1	0%
OverhatchLateGas	1	0%
Overpool+1	1	0%
OverpoolSpeed	1	0%
OverpoolTurtle	2	0%
ZvP_3HatchPoolHydra	1	0%
ZvT_7Pool	1	0%
ZvZ_Overpool9Gas	9	33%
25 openings	100	28%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Fast rush		-	-	2	2%	0%	0%	0%
Heavy rush	99	99%	28%	90	90%	24%	90%	4%
Safe expand		-	-	3	3%	100%	0%	0%
Turtle		-	-	1	1%	100%	0%	0%
Unknown	1	1%	0%	4	4%	50%	0%	0%

timing	#	median	early	late
gas steal attempt	41	1:13	1:09	2:36
gas steal success	37	-	-	-
enemy scout	98	2:13	1:19	15:06
enemy combat units	99	2:35	2:15	5:59
enemy air units	51	13:57	5:15	20:37
enemy cloaked units	48	14:01	6:02	17:11

PurpleWave in contrast went with mostly 2 gate openings against Steamhammer; that’s what “heavy rush” means for protoss. Steamhammer countered with early sunkens plus hydras or, less successfully, mutalisks (this version had a bug that weakened mutalisk play). There are also 3 wins with a ZvZ fast mutalisk opening. 2 gates should beat that, so protoss either played poorly or chose a different build in those games.

#3 bananabrain

opening	games	wins
10HatchHydra	1	0%
11Gas10PoolLurker	2	0%
11Gas10PoolMuta	10	10%
11HatchTurtleHydra	1	0%
12Hatch_4HatchLing	1	0%
2.5HatchMuta	1	0%
2HatchLingAllInSpire	1	0%
3HatchHydra	2	0%
3HatchHydraBust	1	0%
3HatchHydraExpo	1	0%
3HatchLateHydras	1	0%
3HatchLingExpo	9	11%
5PoolHard	1	0%
6Pool	1	0%
6PoolSpeed	1	0%
7-7HydraLingRush	1	0%
8Gas7PoolLurker B	1	0%
9HatchMain9Pool9Gas	1	0%
9PoolBurrow	1	0%
9PoolSpeed	1	0%
9PoolSpire	1	0%
AntiFact_2Hatch	15	40%
AntiFact_Overpool9Gas	1	0%
AntiZeal_12Hatch	10	0%
Over10Hatch1Sunk	1	0%
Over10HatchBust	28	25%
OverpoolSpeed	1	0%
OverpoolTurtle 0	1	0%
ZvP_Overpool3Hatch	2	0%
ZvT_3HatchMuta	1	0%
30 openings	100	15%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	70	70%	14%	32	32%	3%	33%	39%
Naked expand	2	2%	0%	1	1%	100%	0%	50%
Proxy		-	-	4	4%	0%	0%	0%
Safe expand	23	23%	17%	16	16%	31%	17%	35%
Turtle	5	5%	20%	10	10%	20%	0%	20%
Unknown		-	-	37	37%	16%	0%	0%

timing	#	median	early	late
gas steal attempt	50	1:26	1:09	1:38
gas steal success	31	-	-	-
enemy scout	100	1:56	1:21	3:17
enemy combat units	100	2:56	2:19	8:19
enemy air units	83	5:19	2:51	11:41
enemy cloaked units	61	6:26	3:27	14:05

BananaBrain contrasts with both previous opponents in that it played a variety of builds. Steamhammer was unable to predict what was coming. It looks strange that the best reaction was an opening designed to counter terran factory-first builds that include a vulture runby, but in fact it is a mildly specialized 2 hatch mutalisk variant and not so surprising. BananaBrain made corsairs and dark templar in most games.

#4 daqin

opening	games	wins
10HatchHydra	1	0%
10Pool9Hatch	1	0%
11Gas10PoolLurker	11	9%
11Gas10PoolMuta	1	0%
11HatchTurtleLurker	1	0%
12Hatch12Pool	1	0%
12HatchTurtle	2	0%
12Hatch_4HatchLing	2	0%
2HatchHydraBust	1	0%
2HatchLurker	1	0%
3HatchHydra	1	0%
3HatchHydraBust	1	0%
3HatchHydraExpo	4	0%
3HatchLing	1	0%
3HatchLingBust2	10	20%
3HatchLingExpo	1	0%
4HatchBeforeGas	3	0%
4HatchBeforeLair	3	0%
4PoolSoft	1	0%
5HatchBeforeGas	2	0%
5Scout	1	0%
8-8HydraRush	1	0%
8Hatch7Pool	1	0%
8Hatch7PoolSpeed	19	16%
9GasLair	1	0%
9HatchExpo9Pool9Gas	2	0%
9PoolBurrow	1	0%
9PoolSpeedAllIn	1	0%
AntiFact_2Hatch	1	0%
AntiFactory	1	0%
OverhatchExpoLing	3	0%
OverhatchExpoMuta	1	0%
OverhatchLateGas	1	0%
Overpool+1	1	0%
OverpoolSunk	1	0%
ZvP_2HatchMuta	1	0%
ZvP_3BaseSpire+Den	11	0%
ZvZ_12Gas11Pool	1	0%
ZvZ_12HatchMain	1	0%
ZvZ_12Pool	1	0%
40 openings	100	6%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush		-	-	4	4%	0%	0%	0%
Proxy		-	-	6	6%	0%	0%	0%
Safe expand	11	11%	9%	23	23%	13%	18%	0%
Turtle	89	89%	6%	63	63%	5%	62%	4%
Unknown		-	-	4	4%	0%	0%	0%

timing	#	median	early	late
gas steal attempt	43	1:26	1:09	1:58
gas steal success	24	-	-	-
enemy scout	99	1:34	1:14	9:35
enemy combat units	100	5:26	4:07	6:58
enemy air units	31	9:47	8:54	16:19
enemy cloaked units	36	9:44	7:23	14:39

DaQin played forge-expand and has similar timings to Locutus, for the same reasons. The fast scout is to allow adjustment of the cannon count and timing, and the late combat units are due to getting a gateway later. Steamhammer couldn’t find any better reaction than to try to bust with zerglings, either early or late, and it was not particularly successful.

#6 zzzkbot

opening	games	wins
2.5HatchMuta	1	0%
3HatchLingExpo	1	0%
9HatchExpo9Pool9Gas	3	67%
9PoolLurker	9	33%
9PoolSpeedAllIn	1	0%
9PoolSunkHatch	12	58%
9PoolSunkSpeed	9	33%
OverpoolSunk	13	38%
ZvZ_Overgas9Pool	12	58%
ZvZ_Overpool9Gas	39	82%
10 openings	100	59%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Turtle	100	100%	59%	79	79%	71%	79%	21%
Unknown		-	-	21	21%	14%	0%	0%

timing	#	median	early	late
gas steal attempt	26	1:14	1:11	1:37
gas steal success	12	-	-	-
enemy scout	99	2:55	0:37	7:23
enemy combat units	100	4:06	2:25	4:43
enemy air units	66	5:29	5:03	10:11
enemy cloaked units	0	-	-	-

ZZZKBot mostly played a turtle-into-mutalisks strategy against Steamhammer, and was somewhat successful. You can read the idea straight out of the tables above. The 2:25 earliest timing but 4:06 median timing for combat units says that ZZZKBot sometimes rushed zerglings, but usually not.

#7 microwave

opening	games	wins
11Gas10PoolLurker	28	43%
11Gas10PoolMuta	15	20%
2HatchHydra	1	0%
3HatchLing	1	0%
3HatchLingBust2	1	0%
5PoolHard	1	0%
6Pool	1	0%
7-7HydraLingRush	1	0%
9GasLair	1	0%
9HatchMain9Pool9Gas	1	0%
9PoolLurker	1	0%
OverhatchLing	1	0%
OverhatchMuta	20	30%
OverpoolLurker	1	0%
PurpleSwarmBuild	1	0%
Sparkle 1HatchMuta	8	12%
ZvZ_12HatchExpo	1	0%
ZvZ_12HatchMain	2	0%
ZvZ_Overpool11Gas	1	0%
ZvZ_Overpool9Gas	1	0%
ZvZ_OverpoolTurtle	12	25%
21 openings	100	25%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	7	7%	29%	2	2%	50%	0%	86%
Naked expand	89	89%	26%	23	23%	17%	25%	73%
Turtle	4	4%	0%	1	1%	0%	0%	75%
Unknown		-	-	74	74%	27%	0%	0%

timing	#	median	early	late
gas steal attempt	54	1:30	0:42	1:57
gas steal success	1	-	-	-
enemy scout	100	2:39	1:37	3:47
enemy combat units	100	2:27	2:14	3:58
enemy air units	39	6:25	5:21	9:19
enemy cloaked units	0	-	-	-

Microwave played a 9 pool speed build into expansion and then spire, which you cannot read out of the plan table because Steamhammer didn’t recognize it accurately. But in the timing table you can see that combat units (zerglings) were early and air units (mutalisks) were not late.

Steamhammer was not able to steal Microwave’s gas. It probably should have stopped trying.

#8 iron

opening	games	wins
2.5HatchMuta	1	0%
5HatchBeforeGas	1	0%
5Scout	1	0%
7-7HydraLingRush	43	84%
8Gas7PoolLurker B	1	0%
AntiFact_13Pool	11	55%
AntiFactory	39	64%
OverhatchExpoMuta	1	0%
OverhatchMuta	1	0%
Sparkle 2HatchMuta	1	0%
10 openings	100	67%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	100	100%	67%	91	91%	70%	91%	9%
Unknown		-	-	9	9%	33%	0%	0%

timing	#	median	early	late
gas steal attempt	30	1:26	1:25	2:03
gas steal success	0	-	-	-
enemy scout	87	3:27	0:34	15:09
enemy combat units	100	2:59	2:29	5:27
enemy air units	23	13:53	10:11	20:07
enemy cloaked units	69	6:39	5:35	11:47

Look at that huge range of scout timings! 0:34 means that the scout SCV was sent immediately at the start of the game and went directly to the zerg base. 15:09 probably means that no enemy unit got into the base until the end of the game when Steamhammer lost (Steamhammer is on BWAPI 4.1.2 and cannot detect scans). Steamhammer prevented the scout entirely in 13 out of the 100 games by its own count; 15:09 is probably the same. Steamhammer was not able to steal Iron’s gas, and did eventually give up trying.

#9 xiaoyi

opening	games	wins
12Hatch13Pool	1	0%
2HatchLingAllInSpire	16	19%
2HatchLurkerAllIn	1	0%
3HatchLurker	1	0%
3HatchPoolMuta	1	0%
5PoolSoft	1	0%
7-7HydraLingRush	36	69%
7PoolMid	24	75%
AntiFact_13Pool	9	33%
AntiFactory	1	0%
AntiFactoryHydra	8	12%
Over10Hatch	1	0%
12 openings	100	50%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	98	98%	51%	74	74%	41%	74%	21%
Naked expand		-	-	1	1%	100%	0%	0%
Safe expand		-	-	3	3%	33%	0%	0%
Unknown	2	2%	0%	22	22%	82%	0%	50%

timing	#	median	early	late
gas steal attempt	46	1:26	1:05	2:08
gas steal success	1	-	-	-
enemy scout	92	2:37	1:34	7:29
enemy combat units	100	2:39	2:25	3:22
enemy air units	55	12:23	8:55	16:30
enemy cloaked units	61	7:39	5:42	17:07

Steamhammer liked 7 pool against XiaoYi, just as Microwave did, but also liked its dawn hydra rush.

#10 mcrave

opening	games	wins
2HatchHydraBust	5	80%
3HatchHydraBust	6	67%
9PoolHatch	19	84%
Over10Hatch2Sunk	32	88%
Over10Hatch2SunkHard	26	92%
OverpoolTurtle	12	83%
6 openings	100	86%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Heavy rush	97	97%	87%	64	64%	80%	63%	19%
Safe expand	2	2%	100%	11	11%	100%	0%	0%
Turtle	1	1%	0%	7	7%	100%	0%	0%
Unknown		-	-	18	18%	94%	0%	0%

timing	#	median	early	late
gas steal attempt	53	1:28	1:11	1:33
gas steal success	33	-	-	-
enemy scout	92	2:21	1:14	9:55
enemy combat units	98	2:42	2:15	8:51
enemy air units	69	10:02	5:06	14:18
enemy cloaked units	30	10:29	5:01	16:39

#11 ualbertabot

opening	games	wins
5Scout	28	75%
Over10Hatch2Sunk	1	0%
OverpoolTurtle	71	97%
3 openings	100	90%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	5	5%	100%	8	8%	100%	0%	0%
Fast rush	4	4%	100%	11	11%	100%	0%	25%
Heavy rush	86	86%	88%	47	47%	87%	48%	24%
Naked expand	5	5%	100%	10	10%	100%	0%	40%
Unknown		-	-	24	24%	83%	0%	0%

timing	#	median	early	late
gas steal attempt	43	1:12	0:00	1:16
gas steal success	23	-	-	-
enemy scout	79	2:11	1:19	4:23
enemy combat units	61	2:43	1:46	4:33
enemy air units	10	14:20	12:02	16:57
enemy cloaked units	12	14:25	2:38	16:57

Thanks to pre-learning, I expected Steamhammer to play its overpool turtle build every game. I’m not sure why it didn’t. I also don’t know how it hit on the 5 scout build, which means send out a drone at 5 supply to scout very early, then leave all decisions to the strategy boss. It’s a logical try against a random opponent, especially one that has a single strategy for each race, and it was fairly successful. But it did not appear in the pre-learned data.

#12 aitp

opening	games	wins
7-7HydraLingRush	12	92%
9HatchExpo9Pool9Gas	30	100%
AntiFact_13Pool	22	100%
AntiFactory	21	95%
AntiFactoryHydra	14	93%
ZvT_3HatchMuta	1	0%
6 openings	100	96%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Factory	87	87%	97%	30	30%	97%	28%	62%
Fast rush	4	4%	100%	4	4%	100%	0%	50%
Heavy rush	4	4%	100%	2	2%	100%	25%	50%
Turtle	4	4%	100%	5	5%	60%	0%	25%
Unknown	1	1%	0%	59	59%	98%	0%	0%

timing	#	median	early	late
gas steal attempt	44	1:25	1:12	1:58
gas steal success	3	-	-	-
enemy scout	25	3:31	2:41	19:11
enemy combat units	100	3:23	1:57	8:09
enemy air units	31	11:27	7:46	19:34
enemy cloaked units	59	7:38	5:19	16:34

AITP scored zip against both mass zerglings (9HatchExpo9Pool9Gas) and against fast mutalisks (AntiFact_13Pool). And it successfully scouted Steamhammer’s base only 25% of the time. If you don’t scout reliably, it will be hard to withstand rushes.

#13 bunkerboxer

opening	games	wins
9PoolExpo	42	100%
9PoolSunkHatch	31	100%
9PoolSunkSpeed	27	100%
3 openings	100	100%

plan	predicted			recognized			accuracy
plan	count	games	wins	count	games	wins	good	?
Proxy	66	66%	100%	35	35%	100%	36%	36%
Unknown		-	-	42	42%	100%	0%	0%
Worker rush	34	34%	100%	23	23%	100%	15%	53%

timing	#	median	early	late
gas steal attempt	40	1:35	1:32	1:37
gas steal success	33	-	-	-
enemy scout	91	2:10	1:47	3:47
enemy combat units	84	2:43	2:09	3:27
enemy air units	0	-	-	-
enemy cloaked units	0	-	-	-

Steamhammer was not able to judge whether BunkerBoxeR was playing a proxy (with its proxy bunker) or a worker rush (since it sent SCVs in support). But it didn’t matter. The reactions are nearly the same. Since BunkerBoxeR never wants gas, stealing its gas was a waste.

overall

	total		ZvT		ZvP		ZvZ		ZvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
10HatchHydra	2	0%			2	0%
10Pool9Gas	1	0%			1	0%
10Pool9Hatch	1	0%			1	0%
11Gas10PoolLurker	41	32%			13	8%	28	43%
11Gas10PoolMuta	27	15%			12	8%	15	20%
11HatchTurtleHydra	51	43%			51	43%
11HatchTurtleLurker	2	0%			2	0%
11HatchTurtleMuta	15	20%			15	20%
12Hatch12Pool	2	0%			2	0%
12Hatch13Pool	1	0%	1	0%
12HatchTurtle	2	0%			2	0%
12Hatch_4HatchLing	4	0%			4	0%
2.5HatchMuta	5	0%	1	0%	3	0%	1	0%
2HatchHydra	2	0%			1	0%	1	0%
2HatchHydraBust	10	40%			10	40%
2HatchLingAllInSpire	19	16%	16	19%	3	0%
2HatchLurker	1	0%			1	0%
2HatchLurkerAllIn	1	0%	1	0%
3HatchHydra	3	0%			3	0%
3HatchHydraBust	13	31%			13	31%
3HatchHydraExpo	7	0%			7	0%
3HatchLateHydras	1	0%			1	0%
3HatchLateHydras+1	5	0%			5	0%
3HatchLing	3	0%			2	0%	1	0%
3HatchLingBust2	14	14%			13	15%	1	0%
3HatchLingExpo	12	8%			11	9%	1	0%
3HatchLurker	1	0%	1	0%
3HatchPoolMuta	1	0%	1	0%
4HatchBeforeGas	4	0%			4	0%
4HatchBeforeLair	9	0%			9	0%
4PoolSoft	1	0%			1	0%
5HatchBeforeGas	5	0%	1	0%	4	0%
5PoolHard	2	0%			1	0%	1	0%
5PoolHard2Player	2	0%			2	0%
5PoolSoft	2	0%	1	0%	1	0%
5Scout	31	68%	1	0%	2	0%			28	75%
6Pool	2	0%			1	0%	1	0%
6PoolSpeed	1	0%			1	0%
7-7HydraLingRush	93	77%	91	79%	1	0%	1	0%
7Pool12Hatch	1	0%			1	0%
7PoolMid	24	75%	24	75%
7PoolSoft	1	0%			1	0%
8-8HydraRush	2	0%			2	0%
8Gas7PoolLurker B	2	0%	1	0%	1	0%
8Hatch7Pool	2	0%			2	0%
8Hatch7PoolSpeed	19	16%			19	16%
8Pool	1	0%			1	0%
9GasLair	2	0%			1	0%	1	0%
9HatchExpo9Pool9Gas	35	91%	30	100%	2	0%	3	67%
9HatchMain9Pool9Gas	2	0%			1	0%	1	0%
9Pool	2	0%			2	0%
9PoolBurrow	3	0%			3	0%
9PoolExpo	42	100%	42	100%
9PoolHatch	21	76%			21	76%
9PoolLurker	10	30%					10	30%
9PoolSpeed	1	0%			1	0%
9PoolSpeedAllIn	3	0%			2	0%	1	0%
9PoolSpire	1	0%			1	0%
9PoolSunkHatch	43	88%	31	100%			12	58%
9PoolSunkSpeed	36	83%	27	100%			9	33%
AntiFact_13Pool	42	74%	42	74%
AntiFact_2Hatch	16	38%			16	38%
AntiFact_Overpool9Gas	2	0%			2	0%
AntiFactory	62	73%	61	74%	1	0%
AntiFactoryHydra	22	64%	22	64%
AntiZeal_12Hatch	11	0%			11	0%
DefilerRush	2	0%			2	0%
HiveRush	1	0%			1	0%
Over10Hatch	3	0%	1	0%	2	0%
Over10Hatch1Sunk	4	0%			4	0%
Over10Hatch2Sunk	35	80%			34	82%			1	0%
Over10Hatch2SunkHard	26	92%			26	92%
Over10HatchBust	28	25%			28	25%
Over10HatchSlowLings	1	0%			1	0%
Over10PoolMuta	1	0%			1	0%
OverhatchExpoLing	5	0%			5	0%
OverhatchExpoMuta	4	0%	1	0%	3	0%
OverhatchLateGas	2	0%			2	0%
OverhatchLing	1	0%					1	0%
OverhatchMuta	21	29%	1	0%			20	30%
Overpool+1	2	0%			2	0%
Overpool2HatchLurker	1	0%			1	0%
OverpoolHatch	1	0%			1	0%
OverpoolHydra	12	0%			12	0%
OverpoolLurker	1	0%					1	0%
OverpoolSpeed	6	0%			6	0%
OverpoolSunk	15	33%			2	0%	13	38%
OverpoolTurtle	85	93%			14	71%			71	97%
OverpoolTurtle 0	1	0%			1	0%
Overpool_4HatchLing	2	0%			2	0%
PurpleSwarmBuild	2	0%			1	0%	1	0%
Sparkle 1HatchMuta	9	11%			1	0%	8	12%
Sparkle 2HatchMuta	2	0%	1	0%	1	0%
ZvP_2HatchMuta	1	0%			1	0%
ZvP_3BaseSpire+Den	12	0%			12	0%
ZvP_3HatchPoolHydra	3	0%			3	0%
ZvP_4HatchPoolHydra	14	21%			14	21%
ZvP_Overpool3Hatch	2	0%			2	0%
ZvT_12PoolMuta	2	0%			2	0%
ZvT_3HatchMuta	2	0%	1	0%	1	0%
ZvT_7Pool	1	0%			1	0%
ZvZ_12Gas11Pool	1	0%			1	0%
ZvZ_12HatchExpo	1	0%					1	0%
ZvZ_12HatchMain	3	0%			1	0%	2	0%
ZvZ_12Pool	1	0%			1	0%
ZvZ_12PoolLing	1	0%			1	0%
ZvZ_12PoolLingB	1	0%			1	0%
ZvZ_Overgas9Pool	12	58%					12	58%
ZvZ_Overpool11Gas	1	0%					1	0%
ZvZ_Overpool9Gas	50	70%			10	30%	40	80%
ZvZ_OverpoolTurtle	12	25%					12	25%
total	1200	52%	400	78%	500	28%	200	42%	100	90%
openings played	111		24		93		29		3

Steamhammer knows 142 different openings. In the whole tournament, it was only able to try 111 of them! It tried the most openings versus protoss, since it was looking everywhere for an escape from the overwhelming top protoss bots. Most openings were tried only a few times and lost every game, which means that Steamhammer would have performed better without them. That’s expected and even intentional; my plan is to add smarts until it is able to make good guesses about what to try. The work is underway.

AIIDE 2019 - what Microwave learned

Microwave keeps its result files in the same format as UAlbertaBot: A file for each opponent, and in the file a list of strategies tried, each with win and loss counts. But Microwave independently restricts the win count and the loss count to not exceed 10. This amounts to intentionally forgetting older history when there has been a lot of it. The advantage is that Microwave adapts its strategies more quickly if the opponent shifts its play. The disadvantage, of course, is that information is thrown away. As a side effect, the numbers in the “total” and “overall” cells of my tables are not too informative.

This post looks at Microwave’s result_*.txt files for each opponent, since it’s what I’ve done before and I already had a script to parse them. This year Microwave also kept history_*.txt files with a record of each game. I could get a fuller picture of what Microwave did from the history files. I’m not sure whether Microwave uses the history files to make decisions. Still, if this is about “what Microwave learned,” then the result files are what Microwave learned, at least in large part.

Microwave has pre-learned data files for a number of opponents. Data from those files survived to be included in these tables. In other words, the tables here include not only tournament games, but in some cases preparation games played before the tournament.

Microwave’s author MicroDK commented that Microwave might have a bug in keeping its learning files, since the numbers did not always agree with official tournament results. As explained yesterday for UAlbertaBot, the player cannot always know what the tournament manager decides is the outcome of the game. Between that and the pre-learned data, I see reason to doubt that Microwave had a bug. But I didn’t look into details.

This year Microwave recorded a total of 32 strategies, compared to 19 last year. I tried to keep the tables tractable by breaking them down by opponent race, since not all strategies were tried against all races. Nevertheless, prepare to scroll right!

terran

It seems that most of Microwave’s openings worked about equally poorly against Iron, which is interesting and hard for me to explain. 2 hatch muta equals 3 hatch hydra bust equals 7 pool—among others? XiaoYi was vulnerable to rushes, and Microwave settled on 7 pool. After seeing the learning files of only 2 bots, I’m already getting a picture of XiaoYi as strong but apparently not robust against tricky strategies; UAlbertaBot chose DT rush against it. AITP and BunkerBoxeR were easy opponents and seem to have been vulnerable to the first thing Microwave tried, so that zerg never felt the need to vary.

The tournament had 100 rounds. Totals of more than 100 games versus an opponent, as versus Iron here, are a sign that pre-learned data was carried over. Microwave did not have time during the competition to try this many strategies so many times each.

#	bot	total	10Hatch9Pool9gas	2HatchHydra	2HatchLurker	2HatchLurkerAllIn	2HatchMuta	3HatchHydraBust	3HatchHydraExpo	3HatchLingBust	3HatchPoolHydra	4HatchBeforeGas	4PoolHard	4PoolSoft	5Pool	5PoolSpeed	6Pool	6PoolSpeed	7Pool	8Pool	9Pool	9PoolLurker	9PoolSpeed	9PoolSpeedLing	OverpoolTurtle	ZvZ_Overpool11Gas
8	iron	63-175 26%	1-5 17%	1-5 17%	1-5 17%	2-7 22%	5-10 33%	5-10 33%	1-5 17%	0-5 0%	1-5 17%	2-7 22%	5-10 33%	5-10 33%	1-5 17%	5-10 33%	1-5 17%	3-8 27%	5-10 33%	5-10 33%	3-8 27%	4-10 29%	2-7 22%	2-7 22%	3-8 27%	0-3 0%
9	xiaoyi	22-13 63%	0-2 0%	-	0-1 0%	-	0-2 0%	-	-	-	-	-	-	10-4 71%	1-1 50%	-	-	-	10-0 100%	-	1-1 50%	-	-	0-2 0%	-	-
12	aitp	10-0 100%	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	-	10-0 100%	-	-
13	bunkerboxer	10-0 100%	-	-	-	-	-	-	-	-	-	-	-	-	10-0 100%	-	-	-	-	-	-	-	-	-	-	-
overall		- 36%	1-7 12%	1-5 17%	1-6 14%	2-7 22%	5-12 29%	5-10 33%	1-5 17%	0-5 0%	1-5 17%	2-7 22%	5-10 33%	15-14 52%	12-6 67%	5-10 33%	1-5 17%	3-8 27%	15-10 60%	5-10 33%	4-9 31%	4-10 29%	2-7 22%	12-9 57%	3-8 27%	0-3 0%

protoss

Aha, Bananabrain had a weakness against Microwave’s 3 hatch zergling bust! I’ve seen the same on BASIL, e.g. Microwave-BananaBrain on Empire of the Sun, where BananaBrain was negligent in setting up the defense of its natural. In general, Bananabrain showed sensitivity to the opponent’s strategy; the other top protoss bots were more consistent.

#	bot	total	10Hatch9Pool9gas	2HatchHydra	2HatchLurker	2HatchLurkerAllIn	2HatchMuta	3HatchHydraBust	3HatchHydraExpo	3HatchLingBust	3HatchPoolHydra	4HatchBeforeGas	4PoolHard	4PoolSoft	5Pool	5PoolSpeed	6Pool	6PoolSpeed	7Pool	8Pool	9Pool	9PoolLurker	9PoolSpeed	9PoolSpeedLing	ZvP_10Hatch9Pool	ZvZ_Overgas11Pool	ZvZ_Overgas9Pool	ZvZ_Overpool11Gas
1	locutus	0-202 0%	0-10 0%	0-10 0%	0-10 0%	0-8 0%	0-10 0%	0-10 0%	-	0-8 0%	0-8 0%	0-8 0%	0-10 0%	0-10 0%	0-10 0%	0-8 0%	0-8 0%	0-8 0%	0-10 0%	0-8 0%	0-10 0%	0-7 0%	0-10 0%	0-7 0%	0-7 0%	-	-	0-7 0%
2	purplewave	26-154 14%	2-10 17%	2-9 18%	0-6 0%	0-5 0%	2-10 17%	2-10 17%	0-4 0%	3-10 23%	0-4 0%	0-4 0%	0-3 0%	3-10 23%	3-10 23%	0-3 0%	0-3 0%	0-3 0%	3-10 23%	2-9 18%	0-3 0%	0-5 0%	1-7 12%	3-10 23%	0-3 0%	-	-	0-3 0%
3	bananabrain	85-130 40%	1-3 25%	2-4 33%	2-5 29%	1-5 17%	2-4 33%	0-2 0%	1-4 20%	10-4 71%	1-4 20%	0-5 0%	6-7 46%	10-9 53%	10-9 53%	5-7 42%	3-5 38%	4-6 40%	1-3 25%	4-6 40%	4-6 40%	0-5 0%	2-4 33%	4-6 40%	0-2 0%	-	7-8 47%	5-7 42%
4	daqin	8-64 11%	2-9 18%	0-4 0%	0-4 0%	-	3-10 23%	0-3 0%	-	3-10 23%	0-3 0%	-	-	0-3 0%	0-3 0%	-	-	-	0-3 0%	-	0-3 0%	-	0-3 0%	0-3 0%	0-3 0%	-	-	-
10	mcrave	95-78 55%	2-2 50%	2-3 40%	5-3 62%	2-3 40%	10-4 71%	10-0 100%	0-4 0%	2-3 40%	1-2 33%	1-4 20%	3-2 60%	10-4 71%	1-4 20%	2-3 40%	5-4 56%	10-5 67%	4-5 44%	2-3 40%	7-4 64%	0-3 0%	3-3 50%	3-2 60%	2-2 50%	8-4 67%	-	0-2 0%
overall		- 25%	7-34 17%	6-30 17%	7-28 20%	3-21 12%	17-38 31%	12-25 32%	1-12 8%	18-35 34%	2-21 9%	1-21 5%	9-22 29%	23-36 39%	14-36 28%	7-21 25%	8-20 29%	14-22 39%	8-31 21%	8-26 24%	11-26 30%	0-20 0%	6-27 18%	10-28 26%	2-17 11%	8-4 67%	7-8 47%	5-19 21%

zerg

Steamhammer and ZZZKBot are opposite opponents, from Microwave’s point of view. Whatever worked against one did not work against the other. Most of the numbers in Steamhammer’s row, by the way, are from preparation games. I see the same numbers in the pre-learned data file. According to the history file, Microwave played its 9 pool speed opening in every game.

#	bot	total	12Pool	4PoolHard	4PoolSoft	5Pool	5PoolSpeed	6Pool	6PoolSpeed	7Pool	8Pool	9HatchMain8Pool8Gas	9PoolHatch	9PoolSpeed	9PoolSunken	OverpoolSpeed	OverpoolTurtle	ZvZ_Overgas11Pool	ZvZ_Overpool11Gas
5	steamhammer	26-16 62%	2-2 50%	-	-	1-2 33%	-	-	-	-	-	2-2 50%	1-2 33%	10-0 100%	1-2 33%	4-3 57%	-	-	5-3 62%
6	zzzkbot	67-70 49%	0-2 0%	1-5 17%	0-2 0%	10-7 59%	0-2 0%	0-2 0%	5-10 33%	1-5 17%	7-10 41%	10-7 59%	10-4 71%	0-2 0%	0-2 0%	3-4 43%	10-4 71%	10-0 100%	0-2 0%
overall		- 52%	2-4 33%	1-5 17%	0-2 0%	11-9 55%	0-2 0%	0-2 0%	5-10 33%	1-5 17%	7-10 41%	12-9 57%	11-6 65%	10-2 83%	1-4 20%	7-7 50%	10-4 71%	10-0 100%	5-5 50%

random

UAlbertaBot was the only random participant. It’s striking how similar openings can have different outcomes, though the numbers are noisy because the game counts intentionally limited and an opening that makes a bad first impression may not be repeated.

#	bot	total	4PoolHard	4PoolSoft	5Pool	5PoolSpeed	6Pool	6PoolSpeed	7Pool	8Pool	9PoolSpeedLing
11	ualbertabot	34-21 62%	5-4 56%	10-0 100%	4-4 50%	10-3 77%	4-3 57%	1-2 33%	0-2 0%	0-2 0%	0-1 0%
overall		- 62%	5-4 56%	10-0 100%	4-4 50%	10-3 77%	4-3 57%	1-2 33%	0-2 0%	0-2 0%	0-1 0%

AIIDE 2019 - what UAlbertaBot learned

#11 UAlbertaBot was one of the weaker participants, but no player shut it out. Even against #1 Locutus, UAlbertaBot scored 1 win and learned a little bit about its opponent. That also tells us something about each opponent.

The “total” column gives UAlbertaBot’s view of how many games it won and lost, which does not always line up with the tournament results. The results give UAlbertaBot 6 crashes, when it presumably could not record any information. Also if one side overstepped the frame time limit (UAlbertaBot never did), or if the game timed out and was decided on points (12 instances for UAlbertaBot), the player has no way to know what the tournament manager decided, and the two may disagree about who won. Something like that must explain why UAlbertaBot recorded 3 wins for itself against #2 PurpleWave when officially it won only 2 games. These issues cause difficulties for learning, but as long as most games finish normally it shouldn’t be serious.

#	bot	total	Terran	Terran	Terran	Terran	Protoss	Protoss	Protoss	Zerg	Zerg	Zerg	Zerg
#	bot	total	4RaxMarines	MarineRush	TankPush	VultureRush	DTRush	DragoonRush	ZealotRush	2HatchHydra	3HatchMuta	3HatchScourge	ZerglingRush
1	locutus	1-99 1%	0-9 0%	0-8 0%	0-8 0%	0-8 0%	0-11 0%	1-15 6%	0-10 0%	0-8 0%	0-8 0%	0-7 0%	0-7 0%
2	purplewave	3-93 3%	0-8 0%	0-8 0%	0-8 0%	0-8 0%	3-18 14%	0-8 0%	0-7 0%	0-7 0%	0-7 0%	0-7 0%	0-7 0%
3	bananabrain	16-82 16%	0-7 0%	1-10 9%	0-6 0%	0-6 0%	0-4 0%	0-4 0%	9-20 31%	0-4 0%	0-4 0%	0-4 0%	6-13 32%
4	daqin	21-77 21%	0-10 0%	0-9 0%	0-9 0%	0-9 0%	0-3 0%	3-8 27%	5-8 38%	0-2 0%	0-2 0%	0-2 0%	13-15 46%
5	steamhammer	9-89 9%	0-4 0%	8-9 47%	0-4 0%	0-4 0%	0-6 0%	1-9 10%	0-5 0%	0-12 0%	0-12 0%	0-12 0%	0-12 0%
6	zzzkbot	10-89 10%	0-8 0%	0-8 0%	0-8 0%	0-7 0%	0-4 0%	0-4 0%	8-16 33%	0-7 0%	0-7 0%	0-7 0%	2-13 13%
7	microwave	18-81 18%	0-7 0%	2-12 14%	0-7 0%	0-7 0%	0-3 0%	0-3 0%	13-11 54%	1-9 10%	0-5 0%	1-9 10%	1-8 11%
8	iron	9-90 9%	1-9 10%	1-8 11%	0-5 0%	0-5 0%	0-8 0%	0-8 0%	3-18 14%	0-6 0%	0-6 0%	0-6 0%	4-11 27%
9	xiaoyi	26-68 28%	0-4 0%	0-5 0%	4-13 24%	0-4 0%	21-5 81%	0-2 0%	0-5 0%	0-7 0%	0-7 0%	0-6 0%	1-10 9%
10	mcrave	56-44 56%	-	22-9 71%	-	-	0-4 0%	10-19 34%	0-5 0%	-	-	-	24-7 77%
12	aitp	71-23 76%	-	19-8 70%	-	-	-	-	24-5 83%	19-2 90%	7-2 78%	1-2 33%	1-4 20%
13	bunkerboxer	88-12 88%	-	34-4 89%	-	-	-	-	30-0 100%	-	-	-	24-8 75%
overall		- 28%	1-66 1%	87-98 47%	4-68 6%	0-58 0%	24-66 27%	15-80 16%	92-110 46%	20-64 24%	7-60 10%	2-62 3%	76-115 40%

UAlbertaBot was random. Its learning plan is to first play its best opening for each race (terran marine rush, protoss zealot rush, zerg zergling rush), and switch away only if it lost too often. If you are always losing, there is no harm in experimentation. Against strong opponents it tried everything, to little avail. Against weak opponents, the best opening might be reliable, so it did not try others.

UAlbertaBot’s configuration file has enemy-specific strategies defined for many historical opponents. In this tournament, 2 of them reappeared: Iron and ZZZKBot, and the declaration for ZZZKBot says “make the default choices.” I don’t see evidence in the table that UAlbertaBot paid attention to its Iron-specific strategies, so I watched replays to find out. It turned out as I expected, an enemy-specific strategy became the default strategy, the expected best opening, and if it failed severely enough (as it always did against Iron) then UAlbertaBot would try its other strategies.

The “overall” row across the bottom tells us that its best openings truly were the best. In most cases, it did no good to try alternatives. The notable exceptions are that the dark templar rush won against XiaoYi, while the 2 hatch hydra rush won against AITP (this suggests that AITP consistently followed a mech strategy). Of course, UAlbertaBot played random, which can confuse opponents that learn. It’s possible that a protoss bot that always rushed dark templar might do less well against XiaoYi, and so on.

Some openings were useless in the tournament, and UAlbertaBot would have done better without them. For example, the 3 hatchery scourge opening is designed to combat XIMP by Tomas Vajda, and scored miserably. The terran vulture rush made 58 losses and no wins at all, a weight pulling down the ranking.

There is more to learn from the table. Steamhammer had some trouble against the terran marine rush, but shut out the zealots and the zerglings. The other 2 zergs had more trouble against the hard zealot rush (which was historically difficult for zerg bots to cope with, at least zerg bots other than KillerBot by Marian Devecka). I think the difference ultimately reflects the skills of the different bots. Steamhammer has micro and defensive weaknesses against ranged units in general (the one loss against protoss was to the dragoon rush). Its opening learning is ingenious enough to cover the weakness, but only at the expense of losses against protoss and zerg. So instead Steamhammer’s learning converged on the idea of allowing the marines to win sometimes, and strictly controlling the other races. It’s counterintuitive but effective.

CIG 2018 - what Locutus learned

Locutus only recorded 8 games. It is configured to retain 200 game records, and I read the source code and verified that Locutus does not intentionally drop game records before the limit of 200. Recording exactly 8 games is the same problem that McRave suffered, and must be due to CIG problems. I don't know what the underlying problem was. My suspicion is that CIG organizers or tournament software may have accidentally or mistakenly cleared learning data for some bots. If that is what happened, and it happened once 8 games before the end of the tournament, it seems likely that it happened more than once. Who knows, though? The error might be somewhere else. Maybe they mistakenly shipped us data from after round 8 instead of round 125—in that case the tournament may have run normally, and only the data about it is wrong.

Locutus has prepared data for some opponents, stored in the AI directory. When Locutus finds it has no game records for a given opponent, it looks in AI to see if it has prepared data, and if so, it reads in those game records. At the end of the game, it writes out the prepared game records along with the record for the newly played game, and from then on the prepared records are treated like any others and retained unless and until the 200 record limit is passed.

How many other bots were affected by the 8 game problem?

Here is Locutus’s prepared data. Against some opponents, like McRave, Locutus picks out openings to avoid at first. If other openings don’t win either, I’m sure Locutus will come back and try these anyway. Against others, it picks out winners to try first. For some, it simply provides data. Most but not all of the prepared data is for opponents which were carried over from last year, for which pre-learning is sure to be helpful... if it is done on the same maps.

#3 mcrave

opening	games	wins
12Nexus5ZealotFECannons	1	0%
Turtle	1	0%
2 openings	2	0%

#6 iron

opening	games	wins
DTDrop	14	100%
1 openings	14	100%

#7 zzzkbot

opening	games	wins
ForgeExpand5GateGoon	2	100%
1 openings	2	100%

#11 ualbertabot

opening	games	wins
4GateGoon	1	0%
9-9GateDefensive	2	50%
ForgeExpand5GateGoon	15	93%
3 openings	18	83%

#14 aiur

opening	games	wins
4GateGoon	3	100%
9-9GateDefensive	1	100%
2 openings	4	100%

#16 ziabot

opening	games	wins
9-9GateDefensive	1	0%
ForgeExpand5GateGoon	1	100%
2 openings	2	50%

#19 terranuab

opening	games	wins
DTDrop	10	100%
1 openings	10	100%

#21 opprimobot

opening	games	wins
DTDrop	11	100%
1 openings	11	100%

#22 sling

opening	games	wins
ForgeExpand5GateGoon	2	100%
1 openings	2	100%

#23 srbotone

opening	games	wins
DTDrop	7	100%
PlasmaProxy2Gate	1	100%
2 openings	8	100%

#24 bonjwa

opening	games	wins
DTDrop	6	100%
PlasmaProxy2Gate	1	100%
2 openings	7	100%

overall

	total		PvT		PvP		PvZ		PvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
12Nexus5ZealotFECannons	1	0%			1	0%
4GateGoon	4	75%			3	100%			1	0%
9-9GateDefensive	4	50%			1	100%	1	0%	2	50%
DTDrop	48	100%	48	100%
ForgeExpand5GateGoon	20	95%					5	100%	15	93%
PlasmaProxy2Gate	2	100%	2	100%
Turtle	1	0%			1	0%
total	80	92%	50	100%	6	67%	6	83%	18	83%
openings played	7		2		4		2		3

Here is Locutus’s learned data. In every case, the number of games recorded is 8 plus the number of games in the prepared data. With only 8 games there is not much to go on, but the prepared data does seem to have helped Locutus choose successful openings.

#2 purplewave

opening	games	wins
12Nexus5ZealotFECannons	1	0%
4GateGoon	1	0%
9-9GateDefensive	5	80%
Proxy9-9Gate	1	0%
4 openings	8	50%

#3 mcrave

opening	games	wins
12Nexus5ZealotFECannons	1	0%
4GateGoon	3	67%
Proxy9-9Gate	5	100%
Turtle	1	0%
4 openings	10	70%

#4 tscmoo

opening	games	wins
4GateGoon	1	0%
9-9GateDefensive	1	0%
ForgeExpand5GateGoon	4	25%
Proxy9-9Gate	2	50%
4 openings	8	25%

#5 isamind

opening	games	wins
4GateGoon	6	83%
9-9GateDefensive	1	100%
Proxy9-9Gate	1	100%
3 openings	8	88%

#6 iron

opening	games	wins
DTDrop	22	95%
1 openings	22	95%

#7 zzzkbot

opening	games	wins
ForgeExpand5GateGoon	7	86%
ForgeExpandSpeedlots	2	50%
Proxy9-9Gate	1	0%
3 openings	10	70%

#8 microwave

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

#9 letabot

opening	games	wins
DTDrop	8	88%
1 openings	8	88%

#10 megabot

opening	games	wins
4GateGoon	8	100%
1 openings	8	100%

#11 ualbertabot

opening	games	wins
4GateGoon	1	0%
9-9GateDefensive	2	50%
ForgeExpand5GateGoon	23	91%
3 openings	26	85%

#12 tyr

opening	games	wins
4GateGoon	8	100%
1 openings	8	100%

#13 ecgberht

opening	games	wins
DTDrop	8	88%
1 openings	8	88%

#14 aiur

opening	games	wins
12Nexus5ZealotFECannons	1	0%
2GateDTExpo	1	100%
4GateGoon	5	80%
9-9GateDefensive	1	100%
Proxy9-9Gate	4	75%
5 openings	12	75%

#15 titaniron

opening	games	wins
DTDrop	8	100%
1 openings	8	100%

#16 ziabot

opening	games	wins
9-9GateDefensive	1	0%
ForgeExpand5GateGoon	6	83%
ForgeExpandSpeedlots	2	50%
Proxy9-9Gate	1	100%
4 openings	10	70%

#17 steamhammer

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

#18 overkill

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

#19 terranuab

opening	games	wins
DTDrop	18	100%
1 openings	18	100%

#20 cunybot

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

#21 opprimobot

opening	games	wins
DTDrop	19	100%
1 openings	19	100%

#22 sling

opening	games	wins
ForgeExpand5GateGoon	10	100%
1 openings	10	100%

#23 srbotone

opening	games	wins
DTDrop	15	100%
PlasmaProxy2Gate	1	100%
2 openings	16	100%

#24 bonjwa

opening	games	wins
DTDrop	14	100%
PlasmaProxy2Gate	1	100%
2 openings	15	100%

#25 stormbreaker

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

#26 korean

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

#27 salsa

opening	games	wins
ForgeExpand5GateGoon	8	100%
1 openings	8	100%

overall

	total		PvT		PvP		PvZ		PvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
12Nexus5ZealotFECannons	3	0%			3	0%
2GateDTExpo	1	100%			1	100%
4GateGoon	33	82%			31	87%			2	0%
9-9GateDefensive	11	64%			7	86%	1	0%	3	33%
DTDrop	112	97%	112	97%
ForgeExpand5GateGoon	106	93%					79	97%	27	81%
ForgeExpandSpeedlots	4	50%					4	50%
PlasmaProxy2Gate	2	100%	2	100%
Proxy9-9Gate	15	73%			11	82%	2	50%	2	50%
Turtle	1	0%			1	0%
total	288	90%	114	97%	54	80%	86	93%	34	71%
openings played	10		2		6		4		4

CIG 2018 - what Steamhammer learned

I wrote a new script to analyze Steamhammer’s learning data. A couple points: 1. Steamhammer crashed in nearly half of its games in CIG 2018. It can’t save learning data after a crash, so against some opponents Steamhammer had few opportunities to experiment. The number of crashes varied strongly depending on the opponent. 2. Steamhammer was set to remember the previous 100 games, since I figure there’s no play advantage to remembering more. The tournament was 125 rounds long. So in the tables below, “100 games” means that Steamhammer played at least 100 games without crashing, and up to 25 games may have been dropped, the early games. Against some weak opponents, Steamhammer learned, within 25 games, how to win 100% of the remaining games, and those tables give a 100% win rate for remembered games. Steamhammer did not score 100% against any opponent overall; it always had some losses in early games.

I should be able to run the same analysis for Steamhammer forks which retain Steamhammer’s opponent model file format.

#1 Locutus

opening	games	wins
2HatchHydraBust	1	0%
3HatchHydraExpo	2	0%
3HatchLingBust	1	0%
3HatchLingExpo	1	0%
4HatchBeforeGas	1	0%
OverpoolSpeed	9	56%
6 openings	15	33%

A mystery is solved. Why was Steamhammer’s crash rate higher than I expected? Because many opponents learned to make Steamhammer crash. A crash for the opponent is a win, and the bot doesn’t care how it wins, so if it can learn a plan that makes the opponent crash reliably, it will. The stronger opponents tend to be learning bots, so Steamhammer crashed more often on average against strong opponents. This also means that my glib conclusion that Steamhammer won 66% of non-crash games, so it seems to have kept up with general progress is not sound. The non-crash games were mostly against weak opponents.

Locutus was lucky that it could figure out how to break Steamhammer. As Bruce mentioned in a comment, this Locutus version had a bug when facing certain zergling timings, and Steamhammer quickly figured out how to exploit the bug. It’s possible that Steamhammer minus the crash would have upset Locutus.

#2 PurpleWave

opening	games	wins
11Gas10PoolMuta	1	0%
3HatchHydra	3	0%
3HatchLurker	1	0%
4PoolSoft	1	0%
7Pool12Hatch	1	0%
7PoolSoft	1	0%
9Hatch8Pool	1	0%
9HatchExpo9Pool9Gas	1	0%
9PoolSpeed	1	0%
AntiFactory	1	0%
Over10Hatch	6	0%
Over10Hatch1Sunk	7	0%
Over10Hatch2Sunk	18	0%
Over10HatchBust	1	0%
Over10HatchSlowLings	4	0%
OverhatchMuta	1	0%
OverpoolHatch	1	0%
OverpoolTurtle	3	0%
ZvP_3HatchPoolHydra	2	0%
ZvP_4HatchPoolHydra	1	0%
ZvT_12PoolMuta	1	0%
ZvZ_Overpool11Gas	1	0%
22 openings	58	0%

PurpleWave shut out Steamhammer. It didn’t learn to make Steamhammer crash because every game was a win for it anyway. Steamhammer desperately tried alternatives all over the map, including crazy all-ins and openings intended for ZvT and ZvZ, and nothing worked.

#3 McRave

opening	games	wins
11Gas10PoolLurker	1	0%
4HatchBeforeGas	1	0%
9HatchExpo9Pool9Gas	1	0%
9PoolSpeed	5	100%
ZvP_3HatchPoolHydra	2	0%
5 openings	10	50%

#4 tscmoo

opening	games	wins
9PoolExpo	1	0%
9PoolHatch	1	0%
9PoolSunkHatch	1	0%
AntiFact_2Hatch	1	0%
Over10Hatch2Sunk	1	0%
OverhatchExpoLing	13	15%
OverpoolSpeed	22	23%
7 openings	40	18%

#5 ISAMind

opening	games	wins
3HatchHydraExpo	1	0%
4HatchBeforeGas	1	0%
OverpoolSpeed	4	100%
ZvP_2HatchMuta	7	0%
ZvP_3HatchPoolHydra	6	0%
5 openings	19	21%

#6 Iron

opening	games	wins
2HatchHydra	1	0%
3HatchLingExpo	2	0%
4PoolHard	1	0%
6PoolSpeed	1	0%
9Hatch8Pool	1	0%
9HatchMain9Pool9Gas	1	0%
9PoolSunkSpeed	1	0%
AntiFact_13Pool	4	0%
AntiFact_2Hatch	83	12%
AntiFactory	1	0%
Over10Hatch	1	0%
PurpleSwarmBuild	1	0%
ZvP_2HatchMuta	1	0%
ZvT_12PoolMuta	1	0%
14 openings	100	10%

Iron is not a learning bot, so it did not learn to crash Steamhammer. Still, these results show a weakness in Steamhammer: Its best opening against Iron is AntiFactory, which it tried only once in these 100 games. Steamhammer did not explore enough. I tried to fix the weakness in Steamhammer 2.0.

#7 ZZZKBot

opening	games	wins
11Gas10PoolMuta	1	0%
8Pool	7	29%
9HatchMain9Pool9Gas	1	0%
9PoolSpeed	1	0%
OverhatchMuta	1	0%
Overpool+1	1	0%
OverpoolSpeed	1	0%
ZvZ_12HatchMain	2	0%
ZvZ_12Pool	1	0%
ZvZ_12PoolLing	48	58%
ZvZ_Overgas9Pool	2	0%
ZvZ_Overpool9Gas	2	0%
12 openings	68	44%

#8 Microwave

opening	games	wins
9PoolSunkHatch	5	80%
9PoolSunkSpeed	27	67%
OverpoolSunk	1	0%
OverpoolTurtle	3	33%
ZvZ_12PoolLing	1	0%
5 openings	37	62%

This looks like successful learning. Too bad Steamhammer only successfully played 37 of the 125 games.

#9 LetaBot

opening	games	wins
11Gas10PoolLurker	1	0%
2HatchLurkerAllIn	4	0%
3HatchHydraExpo	1	0%
3HatchLurker	13	38%
9HatchExpo9Pool9Gas	45	36%
OverpoolLurker	13	31%
ZvP_2HatchMuta	1	0%
ZvT_12PoolMuta	1	0%
ZvT_13Pool	1	0%
ZvT_3HatchMuta	1	0%
10 openings	81	31%

#10 MegaBot

opening	games	wins
11Gas10PoolLurker	1	0%
3HatchHydra	1	0%
3HatchHydraExpo	1	0%
3HatchLingExpo	21	43%
Over10Hatch	1	0%
OverhatchExpoLing	1	100%
ZvP_3HatchPoolHydra	2	0%
7 openings	28	36%

#11 UAlbertaBot

opening	games	wins
3HatchLingExpo	1	0%
5PoolHard2Player	1	0%
9PoolExpo	1	0%
9PoolSpeed	1	0%
9PoolSunkHatch	46	33%
9PoolSunkSpeed	29	48%
Over10Hatch1Sunk	2	0%
OverpoolSpeed	1	0%
ZvZ_Overpool9Gas	1	0%
9 openings	83	35%

#12 Tyr

opening	games	wins
9PoolHatch	5	100%
ZvP_3HatchPoolHydra	5	0%
2 openings	10	50%

#13 Ecgberht

opening	games	wins
11Gas10PoolLurker	10	50%
2HatchLurker	23	61%
2HatchLurkerAllIn	44	75%
Over10HatchBust	3	33%
OverpoolLurker	8	75%
OverpoolSpeed	3	33%
ZvT_13Pool	1	0%
7 openings	92	65%

#14 Aiur

opening	games	wins
11Gas10PoolLurker	1	100%
5PoolHard2Player	1	100%
9PoolSunkHatch	1	100%
9PoolSunkSpeed	2	100%
Over10Hatch	1	0%
Over10Hatch1Sunk	2	50%
Over10Hatch2Hard	1	100%
Over10HatchSlowLings	1	100%
OverpoolSpeed	2	100%
OverpoolTurtle	3	67%
10 openings	15	80%

#15 TitanIron

opening	games	wins
3HatchLingBust	1	0%
AntiFact_13Pool	6	50%
AntiFact_2Hatch	1	0%
AntiFactory	74	42%
Over10Hatch2Sunk	1	0%
OverhatchExpoMuta	1	0%
OverpoolLurker	1	0%
ZvZ_Overgas9Pool	14	21%
ZvZ_Overpool9Gas	1	0%
9 openings	100	37%

This selection of openings implies that TitanIron plays a factory-first build against zerg, like Iron, and is a non-learning bot, like Iron. Later I’ll look into the source and find out for sure.

#16 Ziabot

opening	games	wins
11Gas10PoolMuta	4	25%
2.5HatchMuta	1	0%
3HatchHydraBust	1	0%
6PoolSpeed	1	0%
8Pool	7	71%
9Hatch8Pool	1	0%
9PoolHatch	4	50%
ZvP_2HatchTurtle	1	0%
ZvZ_12Pool	1	0%
ZvZ_12PoolMain	16	25%
ZvZ_Overpool11Gas	10	50%
ZvZ_Overpool9Gas	53	74%
12 openings	100	56%

Low win rates against Zia and some other opponents suggest to me that Steamhammer had other new weaknesses besides crashing. I think Steamhammer should score over 80% against Zia.

#18 Overkill

opening	games	wins
11Gas10PoolMuta	10	90%
4PoolHard	23	96%
6PoolSpeed	28	100%
9Hatch8Pool	1	0%
OverhatchLing	2	50%
OverpoolSpeed	13	92%
ZvZ_12HatchExpo	2	50%
ZvZ_12PoolMain	1	0%
8 openings	80	91%

#19 TerranUAB

opening	games	wins
2HatchLurker	52	90%
AntiFact_13Pool	8	88%
AntiFact_2Hatch	9	78%
AntiFactory	31	90%
4 openings	100	89%

#20 CUNYbot

opening	games	wins
11Gas10PoolMuta	9	78%
OverhatchLing	34	97%
ZvZ_12PoolLing	27	96%
ZvZ_Overgas9Pool	1	0%
ZvZ_Overpool9Gas	19	89%
5 openings	90	92%

#21 OpprimoBot

opening	games	wins
11Gas10PoolLurker	3	67%
2HatchLurker	2	50%
2HatchLurkerAllIn	6	83%
6PoolSpeed	19	100%
OverpoolLurker	1	0%
OverpoolSpeed	5	80%
ZvT_12PoolMuta	20	95%
ZvT_3HatchMuta	20	100%
ZvT_3HatchMutaExpo	24	100%
9 openings	100	94%

#22 Sling

opening	games	wins
4PoolHard	4	75%
4PoolSoft	6	100%
5PoolHard2Player	3	100%
ZvZ_12HatchMain	1	0%
ZvZ_Overgas9Pool	1	0%
5 openings	15	80%

The selection of fast rush openings suggests that Sling played a macro strategy which was countered by fast rushes. But I don’t want to draw strong conclusions based on 15 non-crash games out of 125.

#23 SRbotOne

opening	games	wins
11Gas10PoolLurker	14	93%
2HatchLurker	10	90%
2HatchLurkerAllIn	10	90%
3HatchLurker	17	100%
4PoolSoft	17	100%
5PoolHard	7	100%
9HatchExpo9Pool9Gas	4	75%
9PoolLurker	3	100%
OverpoolLurker	5	100%
9 openings	87	95%

The wide range of lurker openings means that SRbotOne by Johan Kayser fought with mostly barracks units. Well, we already knew that.

#24 Bonjwa

opening	games	wins
9PoolExpo	6	100%
9PoolSunkHatch	5	100%
9PoolSunkSpeed	5	100%
AntiFact_2Hatch	3	100%
AntiFactory	5	100%
ZvT_2HatchMuta	1	100%
6 openings	25	100%

#25 Stormbreaker

opening	games	wins
11Gas10PoolMuta	1	100%
4PoolHard	1	100%
9PoolSunkHatch	8	100%
9PoolSunkSpeed	8	100%
OverhatchLing	1	100%
OverhatchMuta	7	100%
OverpoolSpeed	1	100%
OverpoolSunk	7	100%
ZvZ_12HatchExpo	2	100%
ZvZ_12HatchMain	3	100%
ZvZ_12PoolLing	1	100%
ZvZ_12PoolMain	3	100%
12 openings	43	100%

#26 Korean

opening	games	wins
4PoolHard	1	100%
4PoolSoft	3	100%
5PoolHard	5	100%
5PoolHard2Player	3	100%
5PoolSoft	1	100%
6PoolSpeed	6	100%
OverhatchLing	9	100%
OverhatchMuta	12	100%
ZvZ_12HatchExpo	13	100%
ZvZ_12HatchMain	16	100%
ZvZ_12PoolLing	14	100%
ZvZ_12PoolMain	17	100%
12 openings	100	100%

#27 Salsa

opening	games	wins
4PoolHard	2	100%
4PoolSoft	4	100%
5PoolHard	7	100%
5PoolHard2Player	1	100%
5PoolSoft	1	100%
6PoolSpeed	8	100%
OverhatchLing	11	100%
OverhatchMuta	8	100%
ZvZ_12HatchExpo	12	100%
ZvZ_12HatchMain	20	100%
ZvZ_12PoolLing	13	100%
ZvZ_12PoolMain	12	100%
ZvZ_Overgas9Pool	1	100%
13 openings	100	100%

overall

	total		ZvT		ZvP		ZvZ		ZvR
opening	games	wins	games	wins	games	wins	games	wins	games	wins
11Gas10PoolLurker	31	68%	28	71%	3	33%
11Gas10PoolMuta	26	69%			1	0%	25	72%
2.5HatchMuta	1	0%					1	0%
2HatchHydra	1	0%	1	0%
2HatchHydraBust	1	0%			1	0%
2HatchLurker	87	82%	87	82%
2HatchLurkerAllIn	64	73%	64	73%
3HatchHydra	4	0%			4	0%
3HatchHydraBust	1	0%					1	0%
3HatchHydraExpo	5	0%	1	0%	4	0%
3HatchLingBust	2	0%	1	0%	1	0%
3HatchLingExpo	25	36%	2	0%	22	41%			1	0%
3HatchLurker	31	71%	30	73%	1	0%
4HatchBeforeGas	3	0%			3	0%
4PoolHard	32	91%	1	0%			31	94%
4PoolSoft	31	97%	17	100%	1	0%	13	100%
5PoolHard	19	100%	7	100%			12	100%
5PoolHard2Player	9	89%			1	100%	7	100%	1	0%
5PoolSoft	2	100%					2	100%
6PoolSpeed	63	97%	20	95%			43	98%
7Pool12Hatch	1	0%			1	0%
7PoolSoft	1	0%			1	0%
8Pool	14	50%					14	50%
9Hatch8Pool	4	0%	1	0%	1	0%	2	0%
9HatchExpo9Pool9Gas	51	37%	49	39%	2	0%
9HatchMain9Pool9Gas	2	0%	1	0%			1	0%
9PoolExpo	8	75%	6	100%					2	0%
9PoolHatch	10	70%			5	100%	4	50%	1	0%
9PoolLurker	3	100%	3	100%
9PoolSpeed	8	62%			6	83%	1	0%	1	0%
9PoolSunkHatch	66	50%	5	100%	1	100%	13	92%	47	32%
9PoolSunkSpeed	72	65%	6	83%	2	100%	35	74%	29	48%
AntiFact_13Pool	18	56%	18	56%
AntiFact_2Hatch	97	21%	96	21%					1	0%
AntiFactory	112	57%	111	58%	1	0%
Over10Hatch	9	0%	1	0%	8	0%
Over10Hatch1Sunk	11	9%			9	11%			2	0%
Over10Hatch2Hard	1	100%			1	100%
Over10Hatch2Sunk	20	0%	1	0%	18	0%			1	0%
Over10HatchBust	4	25%	3	33%	1	0%
Over10HatchSlowLings	5	20%			5	20%
OverhatchExpoLing	14	21%			1	100%			13	15%
OverhatchExpoMuta	1	0%	1	0%
OverhatchLing	57	96%					57	96%
OverhatchMuta	29	93%			1	0%	28	96%
Overpool+1	1	0%					1	0%
OverpoolHatch	1	0%			1	0%
OverpoolLurker	28	54%	28	54%
OverpoolSpeed	61	56%	8	62%	15	73%	15	87%	23	22%
OverpoolSunk	8	88%					8	88%
OverpoolTurtle	9	33%			6	33%	3	33%
PurpleSwarmBuild	1	0%	1	0%
ZvP_2HatchMuta	9	0%	2	0%	7	0%
ZvP_2HatchTurtle	1	0%					1	0%
ZvP_3HatchPoolHydra	17	0%			17	0%
ZvP_4HatchPoolHydra	1	0%			1	0%
ZvT_12PoolMuta	23	83%	22	86%	1	0%
ZvT_13Pool	2	0%	2	0%
ZvT_2HatchMuta	1	100%	1	100%
ZvT_3HatchMuta	21	95%	21	95%
ZvT_3HatchMutaExpo	24	100%	24	100%
ZvZ_12HatchExpo	29	97%					29	97%
ZvZ_12HatchMain	42	93%					42	93%
ZvZ_12Pool	2	0%					2	0%
ZvZ_12PoolLing	104	79%					104	79%
ZvZ_12PoolMain	49	73%					49	73%
ZvZ_Overgas9Pool	19	21%	14	21%			5	20%
ZvZ_Overpool11Gas	11	45%			1	0%	10	50%
ZvZ_Overpool9Gas	76	74%	1	0%			74	76%	1	0%
total	1596	64%	685	62%	155	26%	633	82%	123	29%
openings played	69		37		36		31		13

This summary table took me hours to get right, so I hope it's useful.

Steamhammer played 69 openings in 1596 non-crash games, which is around 2/3rds of the openings it knows. No single matchup had more than 37 different openings. There were far more games against terran and zerg than against protoss and random, partly due to the crashing pattern. Against the random opponents (Tscmoo and UAlbertaBot), it settled on mostly general-purpose openings, as you might expect. Its best matchup was ZvZ, with a Jaedong-like 82% win rate (and lately, Jaedong crashes half the time too, so they’re just alike).

Openings that were both popular and successful include 2HatchLurker and 2HatchLurkerAllIn versus terran, 6PoolSpeed with a 97% win rate against mostly weak opponents, 9PoolSunkSpeed used across all matchups, and ZvZ specialties OverhatchLing, ZvZ_12PoolLing, and ZvZ_Overpool9Gas. None of the opening choices surprises me, though some of the win rates do.

CIG 2018 - what Overkill learned

After analyzing AIUR yesterday, I ran a similar (but much simpler) analysis for the classic zerg #18 Overkill. The version in CIG 2018 has not been updated since 2015 and is the same version that still plays on SSCAIT. In 2015 it was a sensation, placing 3rd in both CIG and AIIDE—its place of 18 in this tournament, with about 35% win rate, suggests huge progress over the past 3 years. But keep reading; Overkill appears to have been broken in this tournament. I did this analysis once before: See what Overkill learned in AIIDE 2015.

Classic Overkill knows 3 openings, a 9 pool opening which stays on one base for a good time, and 10- and 12-hatch openings to get mutalisks first. When it chooses 9 pool, that means that the opponent is either rushing (so the 9 pool is necessary to defend) or is being too greedy (which the 9 pool can exploit). Overkill counts some games twice in an attempt to learn faster, so sometimes its total game count is larger than the number of rounds in the tournament (125).

	NinePoolling		TenHatchMuta		TwelveHatchMuta		total
opponent	n	win	n	win	n	win	n	win
#1 Locutus	42	0%	42	0%	41	0%	125	0%
#2 PurpleWave	43	0%	43	0%	42	0%	128	0%
#3 McRave	44	0%	44	0%	43	0%	131	0%
#4 tscmoo	40	0%	40	0%	47	2%	127	1%
#5 ISAMind	42	0%	42	0%	41	0%	125	0%
#6 Iron	54	7%	32	0%	39	3%	125	4%
#7 ZZZKBot	47	2%	39	0%	47	2%	133	2%
#8 Microwave	54	6%	35	0%	42	2%	131	3%
#9 LetaBot	52	6%	33	0%	40	2%	125	3%
#10 MegaBot	60	12%	24	0%	41	7%	125	8%
#11 UAlbertaBot	41	0%	41	0%	48	2%	130	1%
#12 Tyr	40	0%	39	0%	47	2%	126	1%
#13 Ecgberht	57	16%	24	4%	42	12%	123	12%
#14 Aiur	94	34%	14	7%	17	12%	125	28%
#15 TitanIron	36	11%	20	0%	69	16%	125	12%
#16 Ziabot	16	0%	16	0%	93	23%	125	17%
#17 Steamhammer	107	48%	7	0%	10	10%	124	42%
#19 TerranUAB	24	67%	3	0%	98	83%	125	78%
#20 CUNYbot	18	44%	6	17%	101	66%	125	61%
#21 OpprimoBot	36	67%	3	0%	86	76%	125	71%
#22 Sling	67	46%	6	0%	52	42%	125	42%
#23 SRbotOne	23	74%	4	25%	95	89%	122	84%
#24 Bonjwa	75	92%	4	25%	46	87%	125	88%
#25 Stormbreaker	70	91%	2	0%	53	87%	125	88%
#26 Korean	77	99%	2	0%	46	93%	125	95%
#27 Salsa	46	100%	32	94%	46	100%	124	98%
total	1305	36%	597	6%	1372	40%	3274	32%

The 10 hatch opening was useless in this tournament—against every opponent, 10 hatch was the worst choice, at best tying for 0. In 2015, 10 hatch was about as successful as the other openings.

Signs are that something was wrong with Overkill in this tournament. In AIIDE 2015, then #3 Overkill scored 23% against then #4 UAlbertaBot, 68% against #5 AIUR, and 99% against #17 OpprimoBot. In CIG 2018, it was 1.6% against UAlbertaBot, 28% against AIUR, 71% against OpprimoBot. All versions appear to be the same in both tournaments—I didn’t look closely, but I did unpack the sources and check dates (in particular, Overkill has file change dates up to 8 October 2015 in both tournaments). Overkill had 14 crash games in CIG 2018, not enough to account for the difference. It’s hard to believe that the maps could have shifted results that much.

Tomorrow: What went wrong with Overkill?

CIG 2018 - what AIUR learned

Here is what the classic protoss bot AIUR learned about each opponent over the course of CIG 2018. AIUR has not been updated in many years and has fallen behind the state of the art, but its varied strategies and learning still make it a tricky opponent in a long tournament. Seeing AIUR's counters for each opponent tells us something about how the opponent played. For past editions, see AIIDE 2017 what AIUR learned and what AIUR learned (AIIDE 2015).

This is generated from data in AIUR's final write directory. There were 125 rounds and 5 maps, one 2-player and two each 3- and 4-player maps. For some opponents, all games were recorded, giving 25 games on the 2-player map and 50 games each on 3- and 4-player maps. For most opponents, fewer games were recorded. AIUR recorded 2932 games, and the results table lists 318 crashes for AIUR. 2932 + 318 = 3250, the correct total game count. Unrecorded games were lost due to crashes, and for no other reason.

First the overview, summing across all opponents.

overall	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	72	49%	127	65%	132	35%	331	49%
rush	29	41%	269	33%	261	55%	559	44%
aggressive	13	23%	225	68%	184	78%	422	71%
fast expo	33	24%	185	48%	207	48%	425	46%
macro	46	33%	180	52%	135	60%	361	53%
defensive	141	75%	314	73%	379	55%	834	65%
total	334	54%	1300	56%	1298	56%	2932	56%

2, 3, 4 - map size, the number of starting positions
n - games recorded
wins - winning percentage over those games
cheese - cannon rush
rush - dark templar rush
aggressive - fast 4 zealot drop
fast expo - nexus first
macro - aim for a strong middle game army
defensive - try to be safe against rushes

Looking across the bottom row, you can see that AIUR had a plus score on every size of map, and that it had to choose different strategies to do so well. It's a strong result for a bot which has essentially no micro skills and has not been updated since 2014. It does still have the best cannon rush of any bot, if you ask me.

#1 locutus	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	8	0%	25	12%	34	9%
rush	1	0%	10	0%	6	0%	17	0%
aggressive	1	0%	4	0%	5	0%	10	0%
fast expo	1	0%	14	0%	5	0%	20	0%
macro	1	0%	7	0%	4	0%	12	0%
defensive	1	0%	7	14%	5	0%	13	8%
total	6	0%	50	2%	50	6%	106	4%

Even against the toughest opponents, AIUR can scrape a small edge with learning. Against Locutus, it pulled barely above zero, but got a few extra wins because it discovered that its cannon rush occasionally scores on 4-player maps. Results against PurpleWave below are similar. I suspect that if AIUR had played the cannon rush every game, Locutus would have adapted and nullified the edge. Maybe it did, and that’s why the edge is so small.

#2 purplewave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	8	0%	39	18%	48	15%
rush	1	0%	8	0%	2	0%	11	0%
aggressive	1	0%	10	0%	3	0%	14	0%
fast expo	4	0%	8	0%	2	0%	14	0%
macro	1	0%	10	0%	2	0%	13	0%
defensive	3	0%	6	0%	2	0%	11	0%
total	11	0%	50	0%	50	14%	111	6%

#3 mcrave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	1	0%	1	0%	3	33%
rush	1	0%	41	2%	1	0%	43	2%
aggressive	0	0%	2	0%	3	0%	5	0%
fast expo	1	0%	1	0%	42	17%	44	16%
macro	1	0%	3	0%	1	0%	5	0%
defensive	1	0%	2	0%	2	0%	5	0%
total	5	20%	50	2%	50	14%	105	9%

Against McRave, the choice is nexus first. McRave must have settled on a macro opening itself.

#4 tscmoo	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	11	27%	1	0%	1	0%	13	23%
rush	1	0%	1	0%	3	0%	5	0%
aggressive	1	0%	11	9%	1	0%	13	8%
fast expo	5	20%	33	15%	1	0%	39	15%
macro	1	0%	2	0%	22	14%	25	12%
defensive	1	0%	2	0%	22	18%	25	16%
total	20	20%	50	12%	50	14%	120	14%

Against the unpredictable Tscmoo, AIUR wavered before settling on an unpredictable set of answers. Notice that not all the strategies are well explored: If you win less than 1 game in 5, then playing an opening 3 times is not enough. If the tournament were much longer, AIUR would likely have scored higher because of its slow but effective learning.

#5 isamind	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	0%	4	0%	7	0%
rush	1	100%	37	19%	38	8%	76	14%
aggressive	0	0%	1	0%	3	0%	4	0%
fast expo	1	0%	5	0%	2	0%	8	0%
macro	1	0%	1	0%	2	0%	4	0%
defensive	1	0%	4	0%	1	0%	6	0%
total	5	20%	50	14%	50	6%	105	10%

ISAMind may be based on Locutus, but unlike Locutus it is vulnerable to AIUR’s dark templar rushes. It’s a sign that it is not as mature and well tested.

#6 iron	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	5	0%	7	0%
rush	1	0%	26	19%	2	0%	29	17%
aggressive	0	0%	2	0%	2	0%	4	0%
fast expo	1	0%	1	0%	31	10%	33	9%
macro	1	0%	19	5%	4	0%	24	4%
defensive	1	0%	1	0%	6	0%	8	0%
total	5	0%	50	12%	50	6%	105	9%

#7 zzzkbot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	4	0%	2	0%	2	0%	8	0%
rush	4	0%	4	0%	1	0%	9	0%
aggressive	3	0%	2	0%	1	0%	6	0%
fast expo	3	0%	3	0%	1	0%	7	0%
macro	7	0%	5	0%	4	0%	16	0%
defensive	4	0%	34	29%	41	12%	79	19%
total	25	0%	50	20%	50	10%	125	12%

4 pooler ZZZKBot is of course best countered by a defensive anti-rush strategy. Well, it helped, but the rush is too strong for AIUR to survive reliably. On the 2-player map, AIUR found no answer.

#8 microwave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	2	0%	2	0%	1	0%	5	0%
rush	1	0%	27	7%	1	0%	29	7%
aggressive	1	0%	1	0%	1	0%	3	0%
fast expo	1	0%	2	0%	1	0%	4	0%
macro	1	0%	1	0%	9	22%	11	18%
defensive	18	22%	17	24%	36	25%	71	24%
total	24	17%	50	12%	49	22%	123	17%

Microwave apparently also played a rushy style versus AIUR. That’s interesting. I think that AIUR’s defensive strategy is good against pressure openings generally, so Microwave was likely playing low-econ but not necessarily fast rushes.

#9 letabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	0%	1	0%	3	33%	5	20%
aggressive	0	0%	3	33%	1	0%	4	25%
fast expo	1	0%	41	49%	43	49%	85	48%
macro	1	100%	3	33%	1	0%	5	40%
defensive	1	0%	1	0%	1	0%	3	0%
total	5	20%	50	44%	50	44%	105	43%

Fast expo makes sense against LetaBot’s “wait for it... wait for it... here it comes!” one big smash.

#10 megabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	0%	3	0%	6	0%
rush	2	50%	4	0%	38	11%	44	11%
aggressive	1	0%	3	0%	3	0%	7	0%
fast expo	1	0%	3	0%	2	0%	6	0%
macro	2	0%	36	28%	2	0%	40	25%
defensive	18	94%	2	0%	2	0%	22	77%
total	25	72%	50	20%	50	8%	125	26%

Why did MegaBot have so much more trouble on the 2-player map? According to the official per-map result table, MegaBot did fine overall on Destination (the one 2-player map), so its trouble came only against AIUR. Maybe I should watch replays and diagnose it.

#11 ualbertabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	2	0%	43	37%	2	0%	47	34%
aggressive	1	0%	2	0%	1	0%	4	0%
fast expo	1	0%	2	0%	1	0%	4	0%
macro	18	33%	1	0%	1	0%	20	30%
defensive	1	0%	1	0%	44	16%	46	15%
total	24	25%	50	32%	50	14%	124	23%

#12 tyr	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	100%	1	0%	32	81%	34	79%
aggressive	0	0%	37	46%	8	75%	45	51%
fast expo	1	100%	3	33%	3	67%	7	57%
macro	1	0%	6	33%	3	33%	10	30%
defensive	1	0%	2	0%	3	33%	6	17%
total	5	40%	50	40%	50	72%	105	55%

I suspect that Tyr suffered here because it is a jvm bot and could not write its learning file.

#13 ecgberht	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	38	89%	2	50%	41	88%
rush	1	100%	1	0%	43	67%	45	67%
aggressive	0	0%	4	75%	1	0%	5	60%
fast expo	1	100%	1	0%	2	0%	4	25%
macro	1	0%	3	67%	1	0%	5	40%
defensive	1	0%	3	67%	1	0%	5	40%
total	5	60%	50	82%	50	60%	105	70%

#15 titaniron	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	2	50%	4	25%
rush	1	0%	1	0%	3	33%	5	20%
aggressive	0	0%	42	79%	42	88%	84	83%
fast expo	1	0%	1	0%	1	0%	3	0%
macro	1	100%	2	50%	1	0%	4	50%
defensive	1	100%	3	0%	1	0%	5	20%
total	5	40%	50	68%	50	78%	105	71%

TitanIron appears to have been too predictable. Notice that the winning strategy on most maps was never tried (without crashing) on the 2-player map. It might have won there too.

#16 ziabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	16	50%	2	50%	1	0%	19	47%
rush	1	0%	2	0%	1	0%	4	0%
aggressive	1	0%	1	0%	3	33%	5	20%
fast expo	1	0%	2	50%	0	0%	3	33%
macro	1	0%	1	0%	1	0%	3	0%
defensive	3	33%	42	69%	44	57%	89	62%
total	23	39%	50	62%	50	52%	123	54%

#17 steamhammer	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	3	67%	4	75%	9	100%	16	88%
aggressive	3	100%	17	100%	15	100%	35	100%
fast expo	2	0%	2	0%	2	50%	6	17%
macro	1	100%	10	100%	1	0%	12	92%
defensive	14	100%	16	100%	22	100%	52	100%
total	24	83%	50	92%	50	94%	124	91%

#18 overkill	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	3	0%	2	50%	6	17%
rush	0	0%	2	50%	1	0%	3	33%
aggressive	0	0%	1	0%	10	60%	11	55%
fast expo	1	0%	3	67%	0	0%	4	50%
macro	0	0%	0	0%	0	0%	0	0%
defensive	16	88%	41	90%	37	78%	94	85%
total	18	78%	50	80%	50	72%	118	76%

#19 terranuab	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	8	88%	1	0%	10	80%
rush	1	100%	11	100%	30	100%	42	100%
aggressive	0	0%	4	75%	2	50%	6	67%
fast expo	1	100%	16	100%	6	83%	23	96%
macro	1	100%	9	89%	10	90%	20	90%
defensive	1	100%	2	50%	1	0%	4	50%
total	5	100%	50	92%	50	90%	105	91%

#20 cunybot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	50%	4	75%	7	57%
rush	1	100%	1	0%	2	0%	4	25%
aggressive	0	0%	4	75%	13	92%	17	88%
fast expo	1	0%	2	50%	2	50%	5	40%
macro	1	100%	9	89%	13	100%	23	96%
defensive	1	100%	32	100%	15	100%	48	100%
total	5	60%	50	90%	49	90%	104	88%

#21 opprimobot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	12	100%	6	83%	19	95%
rush	1	100%	5	100%	7	100%	13	100%
aggressive	0	0%	7	100%	4	100%	11	100%
fast expo	1	100%	11	100%	17	100%	29	100%
macro	1	100%	8	100%	7	100%	16	100%
defensive	1	100%	7	100%	9	100%	17	100%
total	5	100%	50	100%	50	98%	105	99%

#22 sling	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	100%	5	100%	2	50%	8	88%
aggressive	0	0%	13	100%	13	100%	26	100%
fast expo	1	100%	7	100%	10	100%	18	100%
macro	1	100%	8	100%	11	100%	20	100%
defensive	1	100%	16	100%	13	100%	30	100%
total	5	80%	50	98%	50	96%	105	96%

#23 srbotone	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	50%	1	0%	4	25%
rush	1	100%	9	100%	3	67%	13	92%
aggressive	0	0%	13	100%	16	100%	29	100%
fast expo	1	100%	10	100%	8	100%	19	100%
macro	1	100%	7	86%	6	100%	14	93%
defensive	1	100%	9	100%	16	100%	26	100%
total	5	80%	50	96%	50	96%	105	95%

#24 bonjwa	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	9	100%	4	75%	14	93%
rush	1	100%	13	100%	10	100%	24	100%
aggressive	0	0%	7	100%	10	100%	17	100%
fast expo	1	100%	6	100%	7	100%	14	100%
macro	1	100%	7	100%	8	100%	16	100%
defensive	1	100%	8	100%	11	100%	20	100%
total	5	100%	50	100%	50	98%	105	99%

#25 stormbreaker	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	4	75%	1	0%	4	75%	9	67%
rush	0	0%	5	80%	10	100%	15	93%
aggressive	0	0%	18	100%	7	100%	25	100%
fast expo	0	0%	0	0%	6	100%	6	100%
macro	0	0%	9	100%	8	100%	17	100%
defensive	20	100%	17	100%	15	100%	52	100%
total	24	96%	50	96%	50	98%	124	97%

#26 korean	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	7	100%	2	100%	10	100%	19	100%
rush	0	0%	7	100%	8	100%	15	100%
aggressive	0	0%	5	100%	8	100%	13	100%
fast expo	0	0%	8	100%	8	100%	16	100%
macro	0	0%	5	100%	6	100%	11	100%
defensive	14	100%	23	100%	10	100%	47	100%
total	21	100%	50	100%	50	100%	121	100%

Well, if you win every game, learning cannot help.

#27 salsa	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	9	100%	15	100%	9	100%	33	100%
rush	0	0%	0	0%	3	100%	3	100%
aggressive	0	0%	11	100%	8	100%	19	100%
fast expo	0	0%	0	0%	4	100%	4	100%
macro	0	0%	8	100%	7	100%	15	100%
defensive	15	100%	16	100%	19	100%	50	100%
total	24	100%	50	100%	50	100%	124	100%

the critical vital absolutely essential filename question of the decade

Here is a question of virtually no importance that nevertheless has me puzzled: What should I call my learning files?

For now, Steamhammer has an opponent model file for each opponent, named om_[opponent].txt. OM for opponent model. It makes sense, or anyway it makes as much sense as an arbitrary filename can. I like names that make sense.

The next learning data I want to add is opening models, so that Steamhammer can know the timings. With knowledge of both opponent strategies and its own opening strategies, it will be able to directly compare and find counters: Your attack comes at this time, which opening is ready at that timing? It will also be able to recognize which openings are similar to others, so that when it can’t match a strategy, finding good openings by trial and error is quicker.

What should I name the opening model files? Opponent models and opening models are both empirical data, and should be updated as games are played. But the unfortunate words “opponent” and “opening” are too much alike to abbreviate nicely. Should I start with “enemy” and “build order”? “Bot” and “strategy”? Most abbreviations seem unintuitive. My best idea so far is to rename the om_* files to bot_* and use the prefix open_ for openings. Maybe OK?

What’s your idea? Because, according to the bikeshed principle, everyone should have an opinion on this....

Steamhammer’s learning results

When I uploaded Steamhammer 1.4.3 to SSCAIT on 11 June, I erased its learned data from the server. Its elo immediately plunged, partly because the voters wanted to put it through its paces against strong opponents, and partly because it needs its learning data to cope. Most rushbots, and many others, won their first or first few games against Steamhammer. That didn’t change when I uploaded 1.4.4 a week later—the improvements weren’t many.

Finally, only in the past several days, I’ve started to feel that Steamhammer has learned enough that it is closing in on its equilibrium elo. It has been wavering around the high 2100’s, not able to break above 2200 but not falling far either. It seems about right, at least for SSCAIT conditions.

The findings. As I’ve mentioned, clearing the learning data was a deliberate test to see how well the learning system works when learning from scratch. I’m fairly pleased. I see weaknesses, but only weaknesses I expected. Against XIMP, Steamhammer has settled on an opening that has won every game so far, but is not as strong as the 3 hatch before pool opening that I hand-chose for it in the old days. Steamhammer only sees that it wins. It can’t tell the difference between openings that win nearly all games because it sees only the winning rate; it needs an evaluation function that can tell it “this one wins more convincingly.” Against Proxy, it won one game with its unusual 6 pool opening. Then it played another game and recorded another win—because Proxy crashed. Steamhammer thought it had found a winner, and had to lose some games before it realized that the 6 pool was not a reliable counter. (It would be, if not for Proxy’s powerful worker defense.) Possibly 5 pool or 4 pool would succeed, but Steamhammer does not know that some openings are related to others. When I teach it that, it will be able to realize that if one opening shows promise but is not quite successful, it should try related openings.

In some cases, Steamhammer hit on surprising counters. The most striking example is against TyrProtoss, which had been winning every game with its cannon turtle into timing attack strategy. Steamhammer tried its 2 hatch lurker all-in attack, which did not make sense to me—Steamhammer’s lurkers suck when cannons are around, it has little idea how to break the cannons and no idea how to bypass them. But it won a game. We’ll see if it keeps winning!

I expected the weaknesses, and I expected the surprising counters. I feel as though I understood the learning system and its limitations fairly well. It gives me confidence that my planned improvements, when I finally get around to them, will be real improvements.

no more enemy-specific strategies for Steamhammer

Working on the opponent model today, I made one of the key changes for the next version:

    "UseEnemySpecificStrategy" : false,
    "EnemySpecificStrategy" :
    {
    },

No more openings hand-configured for known opponents. Steamhammer has to figure out everything on its own. I’ve been working toward this for a long time, and it’s good to finally take the step. I expect play to become more varied—Steamhammer is likely to discover surprising solutions for some opponents. Play should also become stronger, especially in tournaments where opponents like to prepare specially against select enemies. They’ll have to look for ways to exploit Steamhammer’s tactical and micro mistakes, because the game plans will be too adaptive.

I also wrote the terran vulture-first recognizer for the plan recognizer today. It recognizes a plan called Factory that can only be followed by terran, and Steamhammer zerg is configured to counter the plan with the AntiFactory opening. Testing against Iron, it worked perfectly: The first game, Iron won easily. The second game, Steamhammer countered and fought back hard (and happened to win). That’s how it’s supposed to work.

The recognizer was easy to write. Maybe I should write a few more recognizers and counters.

Iron should be a good test case, because Iron is strong enough to usually defeat the counter—AntiFactory puts up a tough battle, but still mostly loses. Opening learning success looks like this: Steamhammer realizes that AntiFactory is probably best, though not all that good, and explores other openings sometimes but not too often. I think I should be able to get that right.

Will playing better games against Iron entice voters on SSCAIT? I think it might happen. If so, I will quickly grow bored with similar Iron-Steamhammer games, but stream watchers may be pleased. Iron would likely lose a few elo points to Steamhammer on average, instead of gaining as it does now.

The upcoming version 1.4.2 has important improvements for all races, including some improvements I haven’t mentioned. Strategy, macro, and micro are better. Look forward to higher rankings for Steamhammer and Randomhammer.