AIUR | Starcraft AI blog

CIG 2018 - what AIUR learned

Here is what the classic protoss bot AIUR learned about each opponent over the course of CIG 2018. AIUR has not been updated in many years and has fallen behind the state of the art, but its varied strategies and learning still make it a tricky opponent in a long tournament. Seeing AIUR's counters for each opponent tells us something about how the opponent played. For past editions, see AIIDE 2017 what AIUR learned and what AIUR learned (AIIDE 2015).

This is generated from data in AIUR's final write directory. There were 125 rounds and 5 maps, one 2-player and two each 3- and 4-player maps. For some opponents, all games were recorded, giving 25 games on the 2-player map and 50 games each on 3- and 4-player maps. For most opponents, fewer games were recorded. AIUR recorded 2932 games, and the results table lists 318 crashes for AIUR. 2932 + 318 = 3250, the correct total game count. Unrecorded games were lost due to crashes, and for no other reason.

First the overview, summing across all opponents.

overall	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	72	49%	127	65%	132	35%	331	49%
rush	29	41%	269	33%	261	55%	559	44%
aggressive	13	23%	225	68%	184	78%	422	71%
fast expo	33	24%	185	48%	207	48%	425	46%
macro	46	33%	180	52%	135	60%	361	53%
defensive	141	75%	314	73%	379	55%	834	65%
total	334	54%	1300	56%	1298	56%	2932	56%

2, 3, 4 - map size, the number of starting positions
n - games recorded
wins - winning percentage over those games
cheese - cannon rush
rush - dark templar rush
aggressive - fast 4 zealot drop
fast expo - nexus first
macro - aim for a strong middle game army
defensive - try to be safe against rushes

Looking across the bottom row, you can see that AIUR had a plus score on every size of map, and that it had to choose different strategies to do so well. It's a strong result for a bot which has essentially no micro skills and has not been updated since 2014. It does still have the best cannon rush of any bot, if you ask me.

#1 locutus	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	8	0%	25	12%	34	9%
rush	1	0%	10	0%	6	0%	17	0%
aggressive	1	0%	4	0%	5	0%	10	0%
fast expo	1	0%	14	0%	5	0%	20	0%
macro	1	0%	7	0%	4	0%	12	0%
defensive	1	0%	7	14%	5	0%	13	8%
total	6	0%	50	2%	50	6%	106	4%

Even against the toughest opponents, AIUR can scrape a small edge with learning. Against Locutus, it pulled barely above zero, but got a few extra wins because it discovered that its cannon rush occasionally scores on 4-player maps. Results against PurpleWave below are similar. I suspect that if AIUR had played the cannon rush every game, Locutus would have adapted and nullified the edge. Maybe it did, and that’s why the edge is so small.

#2 purplewave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	8	0%	39	18%	48	15%
rush	1	0%	8	0%	2	0%	11	0%
aggressive	1	0%	10	0%	3	0%	14	0%
fast expo	4	0%	8	0%	2	0%	14	0%
macro	1	0%	10	0%	2	0%	13	0%
defensive	3	0%	6	0%	2	0%	11	0%
total	11	0%	50	0%	50	14%	111	6%

#3 mcrave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	1	0%	1	0%	3	33%
rush	1	0%	41	2%	1	0%	43	2%
aggressive	0	0%	2	0%	3	0%	5	0%
fast expo	1	0%	1	0%	42	17%	44	16%
macro	1	0%	3	0%	1	0%	5	0%
defensive	1	0%	2	0%	2	0%	5	0%
total	5	20%	50	2%	50	14%	105	9%

Against McRave, the choice is nexus first. McRave must have settled on a macro opening itself.

#4 tscmoo	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	11	27%	1	0%	1	0%	13	23%
rush	1	0%	1	0%	3	0%	5	0%
aggressive	1	0%	11	9%	1	0%	13	8%
fast expo	5	20%	33	15%	1	0%	39	15%
macro	1	0%	2	0%	22	14%	25	12%
defensive	1	0%	2	0%	22	18%	25	16%
total	20	20%	50	12%	50	14%	120	14%

Against the unpredictable Tscmoo, AIUR wavered before settling on an unpredictable set of answers. Notice that not all the strategies are well explored: If you win less than 1 game in 5, then playing an opening 3 times is not enough. If the tournament were much longer, AIUR would likely have scored higher because of its slow but effective learning.

#5 isamind	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	0%	4	0%	7	0%
rush	1	100%	37	19%	38	8%	76	14%
aggressive	0	0%	1	0%	3	0%	4	0%
fast expo	1	0%	5	0%	2	0%	8	0%
macro	1	0%	1	0%	2	0%	4	0%
defensive	1	0%	4	0%	1	0%	6	0%
total	5	20%	50	14%	50	6%	105	10%

ISAMind may be based on Locutus, but unlike Locutus it is vulnerable to AIUR’s dark templar rushes. It’s a sign that it is not as mature and well tested.

#6 iron	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	5	0%	7	0%
rush	1	0%	26	19%	2	0%	29	17%
aggressive	0	0%	2	0%	2	0%	4	0%
fast expo	1	0%	1	0%	31	10%	33	9%
macro	1	0%	19	5%	4	0%	24	4%
defensive	1	0%	1	0%	6	0%	8	0%
total	5	0%	50	12%	50	6%	105	9%

#7 zzzkbot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	4	0%	2	0%	2	0%	8	0%
rush	4	0%	4	0%	1	0%	9	0%
aggressive	3	0%	2	0%	1	0%	6	0%
fast expo	3	0%	3	0%	1	0%	7	0%
macro	7	0%	5	0%	4	0%	16	0%
defensive	4	0%	34	29%	41	12%	79	19%
total	25	0%	50	20%	50	10%	125	12%

4 pooler ZZZKBot is of course best countered by a defensive anti-rush strategy. Well, it helped, but the rush is too strong for AIUR to survive reliably. On the 2-player map, AIUR found no answer.

#8 microwave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	2	0%	2	0%	1	0%	5	0%
rush	1	0%	27	7%	1	0%	29	7%
aggressive	1	0%	1	0%	1	0%	3	0%
fast expo	1	0%	2	0%	1	0%	4	0%
macro	1	0%	1	0%	9	22%	11	18%
defensive	18	22%	17	24%	36	25%	71	24%
total	24	17%	50	12%	49	22%	123	17%

Microwave apparently also played a rushy style versus AIUR. That’s interesting. I think that AIUR’s defensive strategy is good against pressure openings generally, so Microwave was likely playing low-econ but not necessarily fast rushes.

#9 letabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	0%	1	0%	3	33%	5	20%
aggressive	0	0%	3	33%	1	0%	4	25%
fast expo	1	0%	41	49%	43	49%	85	48%
macro	1	100%	3	33%	1	0%	5	40%
defensive	1	0%	1	0%	1	0%	3	0%
total	5	20%	50	44%	50	44%	105	43%

Fast expo makes sense against LetaBot’s “wait for it... wait for it... here it comes!” one big smash.

#10 megabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	0%	3	0%	6	0%
rush	2	50%	4	0%	38	11%	44	11%
aggressive	1	0%	3	0%	3	0%	7	0%
fast expo	1	0%	3	0%	2	0%	6	0%
macro	2	0%	36	28%	2	0%	40	25%
defensive	18	94%	2	0%	2	0%	22	77%
total	25	72%	50	20%	50	8%	125	26%

Why did MegaBot have so much more trouble on the 2-player map? According to the official per-map result table, MegaBot did fine overall on Destination (the one 2-player map), so its trouble came only against AIUR. Maybe I should watch replays and diagnose it.

#11 ualbertabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	2	0%	43	37%	2	0%	47	34%
aggressive	1	0%	2	0%	1	0%	4	0%
fast expo	1	0%	2	0%	1	0%	4	0%
macro	18	33%	1	0%	1	0%	20	30%
defensive	1	0%	1	0%	44	16%	46	15%
total	24	25%	50	32%	50	14%	124	23%

#12 tyr	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	100%	1	0%	32	81%	34	79%
aggressive	0	0%	37	46%	8	75%	45	51%
fast expo	1	100%	3	33%	3	67%	7	57%
macro	1	0%	6	33%	3	33%	10	30%
defensive	1	0%	2	0%	3	33%	6	17%
total	5	40%	50	40%	50	72%	105	55%

I suspect that Tyr suffered here because it is a jvm bot and could not write its learning file.

#13 ecgberht	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	38	89%	2	50%	41	88%
rush	1	100%	1	0%	43	67%	45	67%
aggressive	0	0%	4	75%	1	0%	5	60%
fast expo	1	100%	1	0%	2	0%	4	25%
macro	1	0%	3	67%	1	0%	5	40%
defensive	1	0%	3	67%	1	0%	5	40%
total	5	60%	50	82%	50	60%	105	70%

#15 titaniron	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	2	50%	4	25%
rush	1	0%	1	0%	3	33%	5	20%
aggressive	0	0%	42	79%	42	88%	84	83%
fast expo	1	0%	1	0%	1	0%	3	0%
macro	1	100%	2	50%	1	0%	4	50%
defensive	1	100%	3	0%	1	0%	5	20%
total	5	40%	50	68%	50	78%	105	71%

TitanIron appears to have been too predictable. Notice that the winning strategy on most maps was never tried (without crashing) on the 2-player map. It might have won there too.

#16 ziabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	16	50%	2	50%	1	0%	19	47%
rush	1	0%	2	0%	1	0%	4	0%
aggressive	1	0%	1	0%	3	33%	5	20%
fast expo	1	0%	2	50%	0	0%	3	33%
macro	1	0%	1	0%	1	0%	3	0%
defensive	3	33%	42	69%	44	57%	89	62%
total	23	39%	50	62%	50	52%	123	54%

#17 steamhammer	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	3	67%	4	75%	9	100%	16	88%
aggressive	3	100%	17	100%	15	100%	35	100%
fast expo	2	0%	2	0%	2	50%	6	17%
macro	1	100%	10	100%	1	0%	12	92%
defensive	14	100%	16	100%	22	100%	52	100%
total	24	83%	50	92%	50	94%	124	91%

#18 overkill	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	3	0%	2	50%	6	17%
rush	0	0%	2	50%	1	0%	3	33%
aggressive	0	0%	1	0%	10	60%	11	55%
fast expo	1	0%	3	67%	0	0%	4	50%
macro	0	0%	0	0%	0	0%	0	0%
defensive	16	88%	41	90%	37	78%	94	85%
total	18	78%	50	80%	50	72%	118	76%

#19 terranuab	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	8	88%	1	0%	10	80%
rush	1	100%	11	100%	30	100%	42	100%
aggressive	0	0%	4	75%	2	50%	6	67%
fast expo	1	100%	16	100%	6	83%	23	96%
macro	1	100%	9	89%	10	90%	20	90%
defensive	1	100%	2	50%	1	0%	4	50%
total	5	100%	50	92%	50	90%	105	91%

#20 cunybot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	50%	4	75%	7	57%
rush	1	100%	1	0%	2	0%	4	25%
aggressive	0	0%	4	75%	13	92%	17	88%
fast expo	1	0%	2	50%	2	50%	5	40%
macro	1	100%	9	89%	13	100%	23	96%
defensive	1	100%	32	100%	15	100%	48	100%
total	5	60%	50	90%	49	90%	104	88%

#21 opprimobot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	12	100%	6	83%	19	95%
rush	1	100%	5	100%	7	100%	13	100%
aggressive	0	0%	7	100%	4	100%	11	100%
fast expo	1	100%	11	100%	17	100%	29	100%
macro	1	100%	8	100%	7	100%	16	100%
defensive	1	100%	7	100%	9	100%	17	100%
total	5	100%	50	100%	50	98%	105	99%

#22 sling	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	100%	5	100%	2	50%	8	88%
aggressive	0	0%	13	100%	13	100%	26	100%
fast expo	1	100%	7	100%	10	100%	18	100%
macro	1	100%	8	100%	11	100%	20	100%
defensive	1	100%	16	100%	13	100%	30	100%
total	5	80%	50	98%	50	96%	105	96%

#23 srbotone	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	50%	1	0%	4	25%
rush	1	100%	9	100%	3	67%	13	92%
aggressive	0	0%	13	100%	16	100%	29	100%
fast expo	1	100%	10	100%	8	100%	19	100%
macro	1	100%	7	86%	6	100%	14	93%
defensive	1	100%	9	100%	16	100%	26	100%
total	5	80%	50	96%	50	96%	105	95%

#24 bonjwa	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	100%	9	100%	4	75%	14	93%
rush	1	100%	13	100%	10	100%	24	100%
aggressive	0	0%	7	100%	10	100%	17	100%
fast expo	1	100%	6	100%	7	100%	14	100%
macro	1	100%	7	100%	8	100%	16	100%
defensive	1	100%	8	100%	11	100%	20	100%
total	5	100%	50	100%	50	98%	105	99%

#25 stormbreaker	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	4	75%	1	0%	4	75%	9	67%
rush	0	0%	5	80%	10	100%	15	93%
aggressive	0	0%	18	100%	7	100%	25	100%
fast expo	0	0%	0	0%	6	100%	6	100%
macro	0	0%	9	100%	8	100%	17	100%
defensive	20	100%	17	100%	15	100%	52	100%
total	24	96%	50	96%	50	98%	124	97%

#26 korean	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	7	100%	2	100%	10	100%	19	100%
rush	0	0%	7	100%	8	100%	15	100%
aggressive	0	0%	5	100%	8	100%	13	100%
fast expo	0	0%	8	100%	8	100%	16	100%
macro	0	0%	5	100%	6	100%	11	100%
defensive	14	100%	23	100%	10	100%	47	100%
total	21	100%	50	100%	50	100%	121	100%

Well, if you win every game, learning cannot help.

#27 salsa	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	9	100%	15	100%	9	100%	33	100%
rush	0	0%	0	0%	3	100%	3	100%
aggressive	0	0%	11	100%	8	100%	19	100%
fast expo	0	0%	0	0%	4	100%	4	100%
macro	0	0%	8	100%	7	100%	15	100%
defensive	15	100%	16	100%	19	100%	50	100%
total	24	100%	50	100%	50	100%	124	100%

AIIDE 2017 what AIUR learned

Here is what AIUR learned about each opponent over the course of the tournament. I did this mostly because it’s easy; I already had the script from last year. But it’s also informative—AIUR’s reactions tell us how each bot played, and may tell bot authors what they need to work on.

The data is generated from files in AIUR’s final read directory. AIUR recorded 111 games against some opponents even though the tournament officially ran for 110 rounds; that is presumably because the tournament did run longer but was cut back to a multiple of 10 rounds for fairness (since there are 10 maps). On the other hand, AIUR’s total game count according to itself is 2938 and according to the tournament results is 2965, so it may have been unable to record some games (it is listed with 53 crashes, so that’s not a surprise). First an overall view, totalling the data for all opponents. We can see that all 6 of AIUR’s strategies (“moods” it calls them) were widely valuable: Every strategy has win rate over 50% on some map size. AIUR’s overall win rate in the tournament was 50.46%.

overall	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	159	55%	59	37%	161	44%	379	47%
rush	134	66%	87	55%	185	50%	406	56%
aggressive	107	56%	108	43%	155	30%	370	41%
fast expo	69	45%	84	33%	197	51%	350	46%
macro	46	28%	69	52%	211	37%	326	39%
defensive	352	60%	185	58%	570	55%	1107	57%
total	867	57%	592	49%	1479	48%	2938	50%

2, 3, 4 - map size, the number of starting positions
n - games recorded
wins - winning percentage over those games
cheese - cannon rush
rush - dark templar rush
aggressive - fast 4 zealot drop
fast expo - nexus first
macro - aim for a strong middle game army
defensive - be safe against rushes (not entirely successful)

#1 zzzkbot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	16	12%	1	0%	4	0%	21	10%
rush	5	0%	1	0%	1	0%	7	0%
aggressive	3	0%	1	0%	5	0%	9	0%
fast expo	4	0%	1	0%	5	0%	10	0%
macro	3	0%	2	0%	3	0%	8	0%
defensive	3	0%	16	31%	37	24%	56	25%
total	34	6%	22	23%	55	16%	111	14%

AIUR struggled against the tournament leader but was not entirely helpless. Its cannon rush had a chance on 2 player maps and its anti-rush strategy on the others. We see how AIUR gains by taking the map size into account.

#2 purplewave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	2	0%	4	0%
rush	28	79%	3	33%	40	55%	71	63%
aggressive	1	0%	3	33%	1	0%	5	20%
fast expo	1	0%	11	36%	10	60%	22	45%
macro	1	0%	2	0%	1	0%	4	0%
defensive	1	0%	1	0%	1	0%	3	0%
total	33	67%	21	29%	55	51%	109	51%

AIUR upset #2 PurpleWave, a surprising outcome. The DT rush and the fast expand were both somewhat successful—rather unrelated strategies.

#3 iron	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	5	0%	1	0%	7	0%	13	0%
rush	5	0%	2	0%	7	0%	14	0%
aggressive	3	0%	2	0%	12	0%	17	0%
fast expo	8	0%	14	7%	9	0%	31	3%
macro	6	0%	1	0%	10	0%	17	0%
defensive	5	0%	2	0%	10	0%	17	0%
total	32	0%	22	5%	55	0%	109	1%

Learning can’t help if nothing you try wins....

#4 cpac	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	4	0%	0	0%	2	0%	6	0%
aggressive	2	0%	1	0%	1	0%	4	0%
fast expo	1	0%	1	0%	1	0%	3	0%
macro	2	0%	3	33%	2	0%	7	14%
defensive	24	38%	16	69%	48	50%	88	50%
total	34	26%	22	55%	55	44%	111	41%

Cpac was configured to play 5 pool against AIUR. It worked, but AIUR was able to compensate to an extent by playing its anti-rush build.

#5 microwave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	2	0%	2	0%	4	0%	8	0%
rush	1	0%	1	0%	4	0%	6	0%
aggressive	20	20%	15	13%	11	0%	46	13%
fast expo	1	0%	2	0%	6	0%	9	0%
macro	1	0%	1	0%	4	0%	6	0%
defensive	1	0%	1	0%	26	12%	28	11%
total	26	15%	22	9%	55	5%	103	9%

Microwave was successful but showed a little vulnerability to surprise zealots dropped in its main. I suspect it’s a tactical reaction issue.

#6 cherrypi	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	1	0%	1	0%	1	0%	3	0%
aggressive	2	0%	2	0%	1	0%	5	0%
fast expo	2	0%	1	0%	1	0%	4	0%
macro	2	0%	1	0%	9	11%	12	8%
defensive	26	4%	16	12%	42	12%	84	10%
total	34	3%	22	9%	55	11%	111	8%

#7 mcrave	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	26	100%	5	60%	45	62%	76	75%
rush	3	67%	9	67%	4	50%	16	62%
aggressive	1	0%	4	50%	1	0%	6	33%
fast expo	1	0%	2	50%	2	50%	5	40%
macro	1	0%	1	0%	1	0%	3	0%
defensive	1	0%	1	0%	2	0%	4	0%
total	33	85%	22	55%	55	56%	110	65%

AIUR upset McRave with its cannon rush, and the dark templar rush did well too. AIUR executes the best cannon rush of any bot, in my opinion. It is a sign that McRave’s play was not robust enough against tricks.

#8 arrakhammer	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	2	0%	2	0%	3	0%	7	0%
rush	1	0%	1	0%	4	0%	6	0%
aggressive	1	0%	5	60%	3	0%	9	33%
fast expo	1	0%	1	0%	2	0%	4	0%
macro	0	0%	12	50%	38	37%	50	40%
defensive	29	66%	1	0%	4	25%	34	59%
total	34	56%	22	41%	54	28%	110	39%

#9 tyr	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	6	67%	1	0%	1	0%	8	50%
rush	20	100%	1	0%	2	0%	23	87%
aggressive	3	33%	10	20%	1	0%	14	21%
fast expo	1	0%	7	29%	49	35%	57	33%
macro	1	0%	1	0%	1	0%	3	0%
defensive	2	50%	2	0%	1	0%	5	20%
total	33	79%	22	18%	55	31%	110	43%

The DT rush won 100% of the time on 2 player maps and was tried only a few times on larger maps, losing. Was it only unlucky on the 3 and 4 player maps, or is there a real difference? With only 3 games total, we can’t tell from the numbers. It is a weakness of AIUR’s learning: It’s slow because there is so much to learn. The flip side of the slowness is that, over a long tournament, it learns a lot.

#10 steamhammer	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	2	0%	1	0%	1	0%	4	0%
rush	2	50%	1	0%	2	0%	5	20%
aggressive	1	0%	1	0%	1	0%	3	0%
fast expo	1	0%	1	0%	1	0%	3	0%
macro	0	0%	1	0%	1	0%	2	0%
defensive	27	81%	17	88%	49	67%	93	75%
total	33	70%	22	68%	55	60%	110	65%

I was surprised to see Steamhammer upset by AIUR. I had thought that AIUR was a solved problem. On SSCAIT too, Steamhammer started to show losses against AIUR in September for the first time in months. I may have introduced a weakness in some recent version and AIUR’s learning took that long to find it on SSCAIT. In AIIDE, the tournament was easily long enough.

#11 ailien	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	3	0%	1	0%	2	0%	6	0%
aggressive	1	0%	2	0%	1	0%	4	0%
fast expo	1	0%	2	50%	0	0%	3	33%
macro	4	50%	8	75%	1	0%	13	62%
defensive	24	58%	8	88%	49	37%	81	48%
total	34	47%	22	64%	54	33%	110	44%

#12 letabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	7	43%	1	0%	2	0%	10	30%
rush	3	33%	13	54%	43	40%	59	42%
aggressive	5	40%	1	0%	1	0%	7	29%
fast expo	13	46%	3	33%	1	0%	17	41%
macro	1	0%	1	0%	6	33%	8	25%
defensive	1	0%	3	33%	1	0%	5	20%
total	30	40%	22	41%	54	35%	106	38%

I suspect that fast expo was the best strategy on 4 player maps, but how was AIUR to know? A weakness of AIUR’s epsilon-greedy learning, compared to UCB, is that it doesn’t realize that a less-explored option is more likely to be misevaluated.

#13 ximp	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	34	35%	0	0%	1	0%	35	34%
rush	0	0%	0	0%	1	0%	1	0%
aggressive	0	0%	13	8%	52	2%	65	3%
fast expo	0	0%	9	0%	0	0%	9	0%
macro	0	0%	0	0%	1	0%	1	0%
defensive	0	0%	0	0%	0	0%	0	0%
total	34	35%	22	5%	55	2%	111	13%

#14 ualbertabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	0	0%	0	0%	1	100%	1	100%
rush	0	0%	0	0%	0	0%	0	0%
aggressive	0	0%	0	0%	1	100%	1	100%
fast expo	0	0%	0	0%	0	0%	0	0%
macro	0	0%	0	0%	0	0%	0	0%
defensive	34	32%	21	5%	52	27%	107	24%
total	34	32%	21	5%	54	30%	109	26%

What’s up with all those zeroes? AIUR is coded to try each strategy once before it starts making decisions, and that did not happen here. It turns out that AIUR has pre-learned data for Skynet, XIMP, and UAlbertaBot, so its learning in those cases looks different.

#16 icebot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	2	0%	1	0%	4	0%
rush	1	0%	2	50%	3	33%	6	33%
aggressive	3	100%	3	67%	4	50%	10	70%
fast expo	14	100%	3	67%	44	93%	61	93%
macro	4	75%	2	50%	1	0%	7	57%
defensive	9	89%	10	80%	2	50%	21	81%
total	32	88%	22	64%	55	82%	109	80%

#17 skynet	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	13	92%	0	0%	0	0%	13	92%
rush	21	95%	21	90%	51	88%	93	90%
aggressive	0	0%	0	0%	0	0%	0	0%
fast expo	0	0%	1	100%	0	0%	1	100%
macro	0	0%	0	0%	0	0%	0	0%
defensive	0	0%	0	0%	4	50%	4	50%
total	34	94%	22	91%	55	85%	111	89%

#18 killall	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	3	0%	1	0%	5	0%
rush	1	0%	2	0%	1	0%	4	0%
aggressive	1	0%	2	0%	1	0%	4	0%
fast expo	1	0%	3	0%	1	0%	5	0%
macro	0	0%	2	0%	2	50%	4	25%
defensive	30	80%	10	70%	49	76%	89	76%
total	34	71%	22	32%	55	69%	111	62%

#19 megabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	3	67%	1	0%	2	0%	6	33%
rush	2	0%	14	36%	5	0%	21	24%
aggressive	6	67%	4	25%	4	0%	14	36%
fast expo	2	50%	1	0%	4	0%	7	14%
macro	1	0%	1	0%	36	25%	38	24%
defensive	17	76%	1	0%	2	0%	20	65%
total	31	65%	22	27%	53	17%	106	33%

#20 xelnaga	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	9	100%	6	83%	1	0%	16	88%
rush	19	100%	4	75%	1	0%	24	92%
aggressive	1	0%	3	33%	1	0%	5	20%
fast expo	1	0%	4	75%	1	0%	6	50%
macro	2	0%	2	50%	50	36%	54	35%
defensive	2	50%	3	67%	1	0%	6	50%
total	34	85%	22	68%	55	33%	111	56%

Against Xelnaga, AIUR found solutions on 2 and 3 player maps but not on 4 player maps. Is it another case of underexploration?

#21 overkill	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	3	67%	5	40%
rush	2	50%	0	0%	0	0%	2	50%
aggressive	8	100%	4	100%	7	86%	19	95%
fast expo	3	67%	3	100%	7	100%	13	92%
macro	4	75%	3	67%	12	92%	19	84%
defensive	14	93%	11	100%	26	96%	51	96%
total	32	84%	22	91%	55	93%	109	90%

#22 juno	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	5	0%	14	36%	33	15%	52	19%
rush	3	0%	1	0%	1	0%	5	0%
aggressive	2	0%	1	0%	2	0%	5	0%
fast expo	2	0%	1	0%	16	12%	19	11%
macro	1	0%	1	0%	1	0%	3	0%
defensive	19	21%	4	25%	2	0%	25	20%
total	32	12%	22	27%	55	13%	109	16%

Juno’s cannon contain upset AIUR. Learning didn’t help much, because the problem wasn’t in any of the strategies, it was in AIUR’s poor reactions to cannons appearing in front of its base. It is amusing to watch 2 bots cannon each other when sometimes both get cannons up.

#23 garmbot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	2	50%	1	0%	0	0%	3	33%
aggressive	17	94%	17	100%	3	67%	37	95%
fast expo	0	0%	1	0%	23	83%	24	79%
macro	0	0%	1	0%	1	0%	2	0%
defensive	5	80%	1	0%	27	81%	33	79%
total	25	84%	22	77%	55	78%	102	79%

#24 myscbot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	2	50%	4	25%
rush	2	0%	3	67%	2	50%	7	43%
aggressive	3	33%	2	100%	9	78%	14	71%
fast expo	1	0%	2	50%	1	0%	4	25%
macro	4	50%	4	100%	3	67%	11	73%
defensive	23	61%	10	100%	38	79%	71	76%
total	34	50%	22	86%	55	75%	111	69%

#25 hannesbredberg	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	5	80%	3	100%	3	67%	11	82%
rush	2	50%	3	100%	2	50%	7	71%
aggressive	2	50%	2	50%	2	0%	6	33%
fast expo	8	100%	3	100%	9	89%	20	95%
macro	2	50%	4	100%	11	91%	17	88%
defensive	15	100%	7	100%	28	100%	50	100%
total	34	88%	22	95%	55	89%	111	90%

#26 sling	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	2	50%	1	0%	3	33%	6	33%
rush	2	50%	0	0%	1	0%	3	33%
aggressive	12	100%	0	0%	23	96%	35	97%
fast expo	1	0%	5	100%	1	0%	7	71%
macro	3	67%	5	80%	12	75%	20	75%
defensive	5	80%	11	100%	15	80%	31	87%
total	25	80%	22	91%	55	80%	102	82%

Here is another possible case of insufficient exploration. The 4 zealot drop won 100% of the time on 2 player maps and 96% of the time on 4 player maps, but was never tried on 3 player maps (I guess due to a crash, since AIUR tries to play each strategy once). It’s not a severe problem, though, because 3 player maps did have 2 strategies that scored 100%.

#27 forcebot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	1	0%	1	0%	1	0%	3	0%
rush	0	0%	1	0%	1	0%	2	0%
aggressive	3	67%	2	0%	1	0%	6	33%
fast expo	0	0%	1	0%	1	0%	2	0%
macro	0	0%	9	78%	3	67%	12	75%
defensive	29	100%	8	75%	48	94%	85	94%
total	33	94%	22	59%	55	85%	110	83%

#28 ziabot	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	12	100%	7	86%	36	86%	55	89%
rush	1	0%	1	100%	4	75%	6	67%
aggressive	6	100%	8	88%	6	83%	20	90%
fast expo	1	0%	1	0%	2	0%	4	0%
macro	3	0%	1	0%	1	0%	5	0%
defensive	6	67%	4	75%	6	83%	16	75%
total	29	76%	22	77%	55	80%	106	78%

Next: AILien’s learning.

what AIUR learned

After Overkill yesterday, I wrote a not-quite-as-little Perl script to read AIUR’s learning files. AIUR learns more data: Overkill learns a table (opponent, strategy), while AIUR learns a table (opponent, strategy, map size) where map size is the number of starting positions, which is 2, 3 or 4 in AIIDE 2015.

Unlike Overkill, AIUR recorded every game exactly once, missing none and adding none, so its data should be easier to interpret.

Here’s a sample table for one opponent. Compare it against AIUR’s row in Overkill’s table from yesterday. See the full AIUR learning results.

overkill	2		3		4		total
	n	wins	n	wins	n	wins	n	wins
cheese	18	67%	3	33%	1	0%	22	59%
rush	1	0%	1	0%	1	0%	3	0%
aggressive	1	0%	1	0%	1	0%	3	0%
fast expo	1	0%	1	0%	2	0%	4	0%
macro	1	0%	3	33%	25	12%	29	14%
defensive	5	40%	9	33%	15	40%	29	38%
total	27	52%	18	28%	45	20%	90	31%

For reference, here are AIUR’s “moods,” aka strategies.

cheese - cannon rush
rush - dark templar rush
aggressive - fast 4-zealot drop
fast expo - nexus first
macro - aim for a strong middle game army
defensive - be safe against rushes

We see that against Overkill, the cannon rush was relatively successful on 2-player maps, 3-player maps were a struggle, and on 4-player maps AIUR discovered a little late that the defensive mood was better than the macro mood. We also see that AIUR barely explored further when it found a reasonably successful try. If the best strategy was one that happened to lose its first game and didn’t get tried again, it would never know. With so many table cells to fill in, the tremendously long tournament was not long enough for AIUR to explore every possibility thoroughly.

AIUR selected strategies with an initial phase of try-everything-approximately-once followed by an epsilon-greedy algorithm, with epsilon set at 6%. Epsilon-greedy means that 6% of the time it chose a strategy at random, and otherwise it made the greedy choice, the strategy with the best record so far. With 90 games against each opponent to fill in 18 table cells, most cells never came up in the 6% random sample.

It should be clear why AIUR was still improving steadily at the end of the tournament! I offered a theory that AIUR learned so much because of its extreme strategies. If you read through the full set of tables, you’ll see that a strategy which works on one map size only sometimes works on other sizes too. The combination of opponent and map size paid off in ways that neither could alone, though only sometimes.

Overkill and AIUR fought a learning duel during the tournament. Both are running learning algorithms which assume that the opponent does not change (or at least settles down in the long run), and both bots violated the assumption. AIUR violated it more strongly. Was that an advantage? Could there be a connection with AIUR’s late discovery of the defensive strategy on 4-player maps?

I updated the zip archive of the Perl scripts and related files to add AIUR’s script alongside Overkill’s. By the way, I haven’t tested it on Windows, so it might need a tweak or two for that (nothing more than one or two very small changes).

AIUR learns more

The protoss bot AIUR by Florian Richoux has a set of hand-coded strategies and learns over time which strategies win against which opponents. That’s a popular religion; other bots like Overkill (see my post on it) and Tscmoo worship at the same altar. But a funny thing happened on the way through the tournament. In the AIIDE 2015 competition report, look at the graph of winning rate over time for the different bots. Let me steal the image showing the top half of participants:

win rates by round in AIIDE 2015

AIUR’s line is the one in the middle that keeps rising and rising. Look carefully and you can see it leveling off, but it hasn’t reached its asymptote at the end of the very long tournament. AIUR’s learning seems to learn more, and to keep on learning, even though AIUR’s learning method is about the same as other bots. Howzat happen?

Of course AIUR doesn’t do exactly the same thing as other bots. After all, it calls its strategies “moods,” which sounds entirely different. It doesn’t learn an opponent -> strategy mapping, it learns opponent + map size -> strategy, where map size means the number of starting bases, usually 2, 3, or 4. It can figure out that its cannon rush works better on 2-player maps, for example. I imagine that that’s part of the answer, but could it be the whole story?

I have a theory. My theory is that AIUR’s extreme strategies make good probes for weakness. AIUR’s strategies range from absolutely reckless cannon rush, dark templar rush, and 4-zealot drop cheese to defensive and macro-oriented game plans. AIUR’s strategies stake out corners of strategy space. Compare Overkill’s middle-of-the-road zergling, mutalisk, and hydralisk strats, with no fast rushes or slow macro plays, nothing highly aggressive and nothing highly cautious. My theory is that if an enemy makes systematic mistakes, then one of AIUR’s extreme strategies is likely to exploit the mistakes, and AIUR will eventually learn so.

If true, that could explain why AIUR learns more effectively in the long run. Presumably the reason that it takes so long to reach its asymptote is that it has to learn the effect of the map size. The tournament had 27 games per opponent on 2-player maps, 18 on 3-player, and 45 on 4-player, not enough to test each of its 6 strategies repeatedly. It could learn faster by doing a touch of generalization—I’ll post on that some other day.

AIUR also claims to implement its strategies with a further dose of randomness. Intentional unpredictability could confuse the learning algorithms of its enemies. I approve.