Starcraft AI blog | Entries from November 2019

Steamhammer’s documentation is updated

I have updated Steamhammer’s documentation for the first time since version 1.4.2 in May last year. There’s not much to it: One page lists important classes in the code to help people get started, and the other explains the configuration file in great detail.

The documentation is so little used that many people do not seem to realize that it exists. Or maybe it’s the other way around. But now there is no longer an excuse.

Steamhammer 2.4 source released

The source for Steamhammer 2.4 is available at Steamhammer’s web page.

Steamhammer 2.4 change list

Steamhammer 2.4 is uploaded. Since I never got around to releasing the source of the previous version, I’ll release source shortly.

This version has 2 points of focus: First, fixing critical bugs that are responsible for many of the losses against weaker opponents, and second, coping with the deadly Styx opening. I wrote 18 new openings, 10 of which are connected to the Styx opening.

configuration

• Added the Config::Strategy::HumanOpponent flag for games against humans. It’s false by default, since almost all games are versus bots.

critical bugs

• Cancelling mutalisks just before making them is fixed. It happened intermittently in some openings. A spire takes 1800 frames to build, and Steamhammer is configured to conclude that production is jammed if there are 1440 frames without production. In some cases, the opening build might pause production for longer than the limit while saving up for mutalisks; a jam clears the queue and drops out of the opening, so the mutalisks were never made. I fixed it with a special case test, since only this one case was failing.

• Gas deadlock is an emergency that happens when Steamhammer needs gas for the next item, and has the extractors for it, but finds itself unable to collect the gas for some reason (for example, the base is under attack so the drones ran away). It clears the queue to try again. The condition for recognizing gas deadlock was too loose, and sometimes declared deadlock falsely. I tightened it up.

• Tracking of geysers and refinery buildings seemed to be wrong in some cases. I was not able to pin down any bad behavior it caused, but the code looked confused and I’ll be surprised if nothing is corrected.

• Combat sim is more fixed. In the previous version I fixed bugs in setting up the combat sim, related to short-circuit judgments when facing cloaked units. There was still a bug, and I traced it to... checking whether a unit’s air weapon was unequal to BWAPI::UnitTypes::None rather than BWAPI::WeaponTypes::None. Thank you C++ for your “helpful” type checking. This fix will make mutalisks more aggressive, as they should be, in certain situations.

macro

• Restored the setting of 2 workers per mineral patch for terran and protoss. It had been accidentally set to the zerg value of 1.6.

operations

• Against certain stronger enemy units, send more defenders. If an archon is in your base wreaking havoc, 2 marines will not stop it.

• The natural expansion cannot be one of the map’s starting bases. This fixes a problem that came up on the map Baekmagoji.

• StrategyBossZerg::enemySeemsToBeDead() works more nearly as intended. There was a minor bug. The enemy is considered to be dead and ready to be buried when all these conditions are met: 1. The enemy starting base has been found (so it can’t happen immediately when the game starts). 2. The enemy has no known surviving bases. 3. The known enemy ground army is not strong enough to threaten to win. 4. The enemy has no known anti-air units or anti-air static defense. When the enemy looks dead, Steamhammer techs to mutas and switches to a unit mix of drone + mutalisk. The mutalisks can efficiently hunt down any hidden or floating buildings.

unit control

• The micro system drops positional orders for which the position is “close enough” to the previous order. If code orders a move to (x, y) and then on a later frame a move to (x, y+1), the second order is dropped. The tolerance is a few pixels for short-range moves, larger for long moves. This reduces command spamming and smooths unit flow. In the long run, I ideally want to reduce Steamhammer’s APM to human levels.

• Targeting priority for enemy buildings is factored out and more nearly standardized across unit controllers. Streamhammer should destroy enemy bases a little more purposefully, with fewer cases of zerglings tearing down the spire while mutalisks tackle the spawning pool.

• Enemy observers are a higher priority target for ranged units and scourge, depending on the situation.

zerg

• When in book, if we have no zerglings, do not change a planned zergling into a drone. Steamhammer likes to skip zerglings when it foresees no danger. A similar change was already made for the case when it was out of the opening.

• Queen bug: The queen believed it could infest an unfinished command center. “It has few enough HP, let me fly across the map and take it.” When the building SCV was killed, the queen might sit over the unfinished command center forever. “This infestation sure is taking a long time. Oh well, stick with it!”

• Queen bug: When using broodling, the previous version was intended to be willing to make 2 queens, not the usual 1. After testing the feature, I introduced a typo that prevented it from working. Fixed.

• Prevent production freezes related to queen upgrades. Rare but painful.

• Allow air upgrades in any order, using 1 or 2 spires. Formerly, air carapace had to be first (or else production freezes were possible) and Steamhammer could upgrade in only 1 spire. This any-order-is-OK improvement was made earlier for ground upgrades.

openings

• ZvZ_Overgas8Pool fills a small gap in the repertoire. A very small gap.

• 973HydraBust is a precise modern build versus protoss forge expand. The name is because you are supposed to end up with 9 drones in your main, 7 in your natural, and 3 at a third base. Steamhammer does not have the skills to distribute the drones efficiently as intended, but it does get the right total count.

• 2HatchMutaForever covers a weakness in the zerg strategy boss: There are times when growing the mutalisk flock is best, but the strategy boss doesn’t understand the value of stacking air units and prefers to switch into a different unit mix. The opening doesn’t really make mutalisks indefinitely, only for a long time. It also gets air upgrades.

• 2HatchFakeHydra and 2HatchFakeMuta are deceptive openings that make both hydralisk den and spire. The fake hydra opening makes a hydralisk den when the enemy scout should see it, chases the scout away, and adds the spire later. The enemy may predict hydra play or lurkers. The fake muta opening is similar but in reverse.

• 5HatchPool makes 5 hatcheries before spawning pool, an extremely greedy opening for use against opponents that move out late. (See my mention of Locutus in what Steamhammer learned. I’ve seen 6 hatch before pool work against a bot.) ZvP_5HatchPoolHydra is a specialized version that follows up with mass hydralisks. 5HatchPoolLing gets an evolution chamber before the spawning pool, so that when the first zerglings hatch, melee attack +1 is already half finished. The flood of lings is late but massive.

• The powerful Styx build. The openings 9PoolHatchSpeed7Drone, 9PoolHatchSpeed7DroneB, 9PoolHatchSpeed, 9PoolHatchSpeedAllIn, 9PoolHatchSpeedAllInB are variations on the Styx build of 9 pool, 3 pairs of zerglings, hatchery, extractor (leaving yourself with 7 drones), zergling speed, and deluge the enemy with zerglings (7 drones are enough). That exact build is 9PoolHatchSpeed7DroneB. The versions without a B have the extractor before the second hatchery, which gets faster zergling speed and a slightly slower hatchery. The versions without 7Drone use the extractor trick to end up with an 8th drone without losing time. The 8th drone does not make the zergling attack stronger, because 7 drones are enough to keep the 2 hatcheries completely busy producing, but it does establish leeway to lose a drone to accidents, make an emergency sunken, or transition into another plan. The plain version 9PoolHatchSpeed makes fewer zerglings and leaves the followup to the strategy boss, which is not too helpful because the strategy boss doesn’t understand the idea.

• Anti-Styx openings. If you know your opponent’s exact build order, you can always choose your own build to gain an advantage. (Don’t be predictable!) If the opponent is too greedy, you can clobber them with a rush or a bust before they’re ready. The Styx opening is not greedy; 9 pool is what you play to be safe against cheese. When the opponent is not greedy, you gain an advantage by being greedier, to a calculated extent. AntiStyx_9Pool is the simplest try. It uses the extractor trick for the 8th drone, and turns the drone into a sunken. The zergling wars are about equal, but the sunken at home is safer and allows more freedom of action because there is less need to hang back in defense. At Steamhammer’s level of play, the advantage is slight. However, this anti-Styx build is valuable as a general-purpose anti-cheese build; I configured it for that too. AntiStyx_3Hatch and AntiStyx_Muta are more successful at defending against the Styx opening. They open with overpool rather than 9 pool, gaining a lead in drones. Both make 1 sunken and use the safety it provides to scrape out more drones. The 3 hatch followup adds a third hatchery to eventually win the zergling wars, while the muta followup techs instead. The safe path is narrow; I’m curious to see how well the openings perform in practice.

Turtle openings can also cross the river Styx. Some turtles swim well, and they do not have the same weakness as Achilles. See wins over StyxZ by ZurZurZur, by Simplicity, and by Microwave with its turtle build.

• 9PoolHatchSpeedSpire and 9PoolHatchSpeedLurker are my try to pre-empt bot authors who may add a recognizer for the dangerous Styx opening so they can counter it. These builds start out with the Styx opening stem but transition into tech.

• I made tweaks to AntiZeal_12Hatch and 12HatchTurtle. They should be a trifle stronger.

Steamhammer 2.4 tomorrow

Steamhammer 2.4 is almost ready. I’ll upload it tomorrow. It has accumulated enough critical bug fixes and changes; it is time to put it on the road and see how it drives.

New are 4 variants of the “Styx” opening (popularized, not invented, by StyxZ—as Ogden Nash said, “Don’t be a discoverer, be a promoter.”). A later post will analyze and compare the variants. And I wrote 3 counter-Styx openings designed to defend and come out ahead.

Steamhammer’s SSCAIT prospects

In 2016, when Steamhammer was brand new, it squeaked into the SSCAIT final playoffs through a tiebreaker match. In 2017 and 2018, it was clear to me ahead of time that Steamhammer would make it into the finals. This year, my life and schedule have been disrupted and I made less progress. Some of the work I put in, like queen skills, was more about having fun than about playing better, because I need some fun. The bottom line is, Steamhammer is failing to keep up with advances. Its rank is sliding.

I think that Steamhammer is still likely to make it into the SSCAIT round of 16, but it may be a close call. There’s a real risk that it may fall short. The fixes in version 2.4 may claw back a rank or two, but not enough to eliminate the risk. It’s possible that I can change Steamhammer’s prospects before then by implementing a killer skill (I have one in mind), but that’s risky in itself, and my past attempts to code killer skills before a deadline have fallen short. But we’ll see.

the HumanOpponent flag

I watched the SCHNAIL video and I’m pleased. Today I added a HumanOpponent flag to Steamhammer’s configuration file, to make the bot more fun for humans to play against. If you set it to true, the flag has 2 effects:

1. It tells the opponent model that the opponent is unpredictable, which the opponent model takes to mean “I’d better be unpredictable too so I can’t be exploited.” It chooses openings more randomly. This is just turning on a standard Steamhammer behavior that already existed.

2. When losing, Steamhammer ggs out much earlier. Steamhammer in the past assumed that its opponent is a bot and may mess up the win, so it surrenders barely one step before it is provably unable to win. (Even so, I’ve seen a couple games over the years when it might have won if it hadn’t given up—when it had no drones and no combat units and no money, but had units in production that could outfight the opponent which was also near death. It’s extremely rare.) Versus a human, that gg timing is way way too late, unacceptably late. When HumanOpponent is turned on, Steamhammer follows a two-part rule: A. The enemy is much stronger than me—my supply is less than half the enemy’s known supply. B. I have been hurt—my supply has fallen below half of its high water mark. The B part ensures that Steamhammer doesn’t give up without a fight merely because it has been grossly outmacroed.

I’m curious to find out how well the gg rule works in real human games. In test games, I thought the gg still came later than a strong human would prefer. But perhaps it is a good fit for human players who deem Steamhammer an interesting opponent. Also, the enemy’s known supply is generally less than the enemy’s true supply, and Steamhammer is weak at scouting so often it is much less. But I will need to improve scouting as part of my strategy adaptation project, and gg timing may improve when I do.

About SCHNAIL: Obviously we can’t expect SCHNAIL users to edit Steamhammer’s config file and set the HumanOpponent flag. The file won’t be exposed to them at all; they don’t need to know it exists. I asked Sonko if there will be a way for a bot to tell that it is running under SCHNAIL. Whatever the final arrangement ends up being, I will help Steamhammer fit into it so the bot does sensible things in human games.

an amusing Steamhammer bug

I was testing a macro build on the map Baekmagoji, a 2 player map where macro builds are fitting because each main base has 2 geysers and 18 mineral patches—double the usual. (I routinely test on all kinds of maps.) Early in the game, Steamhammer suddenly panicked and canceled a hatchery to get its spawning pool immediately. What was going on?

The bug, or I should say bugs, turned out to be this: Because the enemy main was so rich in resources, Steamhammer decided “well, it’s a bit far away, but still it’s the best choice for my natural base.” When the scout found enemy buildings (“hey, that’s in my natural”), Steamhammer concluded that it was getting proxied. Panic! That was easy to fix; the chosen natural is not allowed to be a starting base. But wait, why was the first expansion hatchery already started somewhere else, so that it could be canceled? Because the choice of the natural base and the actual first expansion taken are not quite coordinated correctly; in some cases they can be different for no good reason. Ack!

Baekmagoji is a difficult map for bots. Another issue is that some of the mineral patches in the main are not reachable until other patches are mined out. It’s a clever design, but Steamhammer doesn’t understand it and tries to assign drones to minerals that they can’t reach. Well, that’s not important to fix yet. I can’t think of another competitive map that would trigger this bug (though more than a few Blizzard maps do). Also notice the neutral sunken colonies spreading creep, and the blocking temples and blocking minerals. Lots of tricks lying in wait.

AIIDE 2019 - looking into AITP

AITP follows an “aggressive defense” game plan, similar to SAIDA, where it builds up a strong ground army, then sets up tank lines in forward positions to constrict the enemy like a boa (preferably a snake, sometimes a feather boa). Its overall skill is far less, though; it finished second to last in AIIDE 2019. Overall, after reading some of the key code, I am not impressed (maybe you shouldn’t expect me to have been). The plans are ambitious but the work looks hasty, as if the authors underestimated it and had to rush to make the tournament. Still, the plans are ambitious, and that makes it somewhat interesting.

tactics

One of AITP’s first steps is to calculate map positions for a wall (supplyPoint barracksPoint bunkerPoint), defensive locations (chokePoint1 through chokePoint4) and offensive tank lines (frontLine1 through frontLine3). How does it calculate them? They are hardcoded for every starting base on every map in InformationManager::initializePosition(). That must be part of why AITP did not want to participate in the unknown map tournament. These positions seem to be the foundation on which all the tactical decisions are laid.

Spider mines are placed in neat diagonal double lines at locations offset from the defense line calculated by CombatCommander::updateDefenseLine(), which looks mostly at the supply and the previous defense line and sets the current defense line to chokePoint2 (the natural choke) or to one of the precalculated frontLine values. It’s simple and primitive, and there is not much examination of the game situation, a good starting try but nothing you expect to be strong. It’s silly sometimes; the defense line may be in front of an empty base that is away from the fight. CombatCommander::updateSpiderSquads() sets up spiderSquad1 and spiderSquad2—and also lays mines itself, calculating the offset steps and checking every vulture itself rather than leaving it to the squads. The intended separation of functions is not observed. Micro::LaySpiderMine() is unused.

StrategyManager::shouldBuildTurret() decides how many turrets should go where. Some turrets go to occupied choke points 1 and 2, some next to command centers, and some at the front lines computed above depending on certain “squad positions.” These so-called squad positions come from CombatCommander::getSquadPosition(std::string squadID), which takes a string that is called squad ID but is actually a 2-digit code that identifies a location rather than a squad. The location is usually an offset from the current defense line, but there are exceptions. The ID seems to choose one of a hardcoded set of offsets for the final position.

Units, of course, also go to the current defense line: That is the tank line, soon furnished with mines and turrets, that is meant to restrict the enemy. At some point CombatCommander::updateAttackEnemyBase() becomes true and sets _aimEnemyBase to the location of an enemy base to destroy. When this happens depends on what the current defense line is, but it’s another set of simple calculations using the closest friendly unit to various enemy bases.

what to build

I outlined the build order and unit mix decisions in the post how AITP played. There are game stages A, B and C and “modules” A1, A2 and so on for each game stage.

A1 is the antirush module that starts a barracks at 5, whose play was described in that post. It switches to the next module when there is a bunker and 4 marines and the engineering bay is started. One of AITP’s strategies was A1-B1-B2-C2. B1 makes SCVs up to 20, barracks up to 5, academy and medics, moves out at 10 marines, and switches to the next module after 10 minutes game time. B2 makes a bunker under certain conditions—but only if the previous module was A4. The modules are not in fact modular but know about each other. Then it makes SCVs and marines up to a smaller limit than B1, makes a factory and gets the upgrades and expansion. It switches when there are 2 command centers or the supply reaches 100 (really 100, not 50: int supplyUsed = BWAPI::Broodwar->self()->supplyUsed() / 2;). Module C2 is the middlegame: It makes SCVs to the limit, adds factories, throws bunkers into the middle of the map, gets armory upgrades, and never switches.

Here’s an extract showing the strange over-specificity of AITP’s code, from StrategyManager::doSwitchModule() which switches from the current module to the next one. The code does not simply parse out the strategy module names from the StrategyName string, it handles each as a special case, sometimes taking extra actions that I would say should be factored out.

	if (_currentModule == "A1")
	{
		if (Config::Strategy::StrategyName == "A1-B1-B2-C2")
		{
			_currentModule = "B1";
			CombatCommander::Instance().clearSquad("bunkerSquad2");
		}
		else if (Config::Strategy::StrategyName == "A1-B3-C2")
		{
			_currentModule = "B3";
			CombatCommander::Instance().clearSquad("bunkerSquad2");
		}
	}

AITP’s modules are not easy to read. I think Steamhammer’s explicit build orders plus zerg strategy rules are more perspicuous and no less expressive—but then I would say that, wouldn’t I?

new bot Crona

New bot Crona is, as it says in its description, BananaBrain playing zerg. It was uploaded today and has started out well. The name “Crona” is after an anime villain.

Sp far, I have only seen Crona play with zerglings and mutalisks, no other units. Here are the names of Crona’s openings, extracted from the binary. “Main Muta/Hydra/Ling” looks like an unrelated string that sneaked into the list, but maybe it’s an opening too.

ZvZ_2hatchling
ZvZ_5pool
ZvT_2hatchling
ZvT_4pool
ZvT_2hatchmuta_12pool
ZvT_2hatchmuta_12hatch
ZvP_2hatchling
ZvP_5pool
ZvP_4pool
ZvP_2hatchmuta
Main Muta/Hydra/Ling
ZvU_2hatchling

The build ZvZ_2hatchling is the “Styx build” of 9 pool, 3 pairs of zerglings, second hatchery, extractor, research zergling speed and produce zerglings for a long, long time. (Note 1: One of the rules of naming is that the origin of a thing cannot be the name; you have to pick something later. Note 2: I’ve tested both variants, and I’m pretty sure that the PurpleSwarm variant with extractor before the second hatchery is better.) Likely the other 2hatchling openings are too.

When playing 4 pool, and presumably other builds, Crona sticks with 1 hatchery and 3 drones for a while, then transitions to 2 hatcheries and 7 drones, then later to 3 hatcheries. I assume the sequence continues. My impression is that the expansions are on a timer: I’ve done this long enough, time for another hatchery. It’s a simple way to slowly increase pressure on the opponent.

Crona’s zerg play is good—see its results—but still looks a little rough to me. It doesn’t scout with its overlord. It has a glitch where, at a certain point in the opening, all the drones move away from the minerals for a second before returning. Crona seems a little confused about drone transfers in general. These things should not be hard to fix, though.

I see terran openings in the binary too. Can anybody guess what the terran BananaBrain will be called?

immediate plans

My time and energy are still limited. Well, my plans always outrun my execution anyway. Here are things I hope to get to soon:

• Write up a confused Steamhammer-McRave game.
• Look into AITP’s code.
• I forgot to release source for Steamhammer 2.3.6. I should do that.
• Look more closely at ZZZKBot’s wide swing in AIIDE results.
• Analyze CoG and the AIIDE unknown maps competitions to some degree.

For Steamhammer, I’m constantly torn between the need for immediate repairs of glaring weaknesses and the need to make long-term progress on strategy adaptation and infrastructure rework. The long run is what counts, but there are always lost games and upcoming tournaments to urge me to make quick fixes.

I will do both. For the upcoming SSCAIT, it looks like I will make another point release, Steamhammer 2.4, in which the visible changes will be more on the quick fix side. It will have work (it already has work) on strategy adaptation, but little or none that will be active yet. I don’t have the energy to put my head down and bring off a substantial subproject.

old bot Styx

Old zerg bot StyxZ has been getting updates recently, and I have been watching it steadily climb the rankings. Today Styx became the top zerg on BASIL, at least for the moment, so clearly it is time to write about it!

To give an impression of how far Styx has climbed, its lifetime win rate on SSCAIT is about 34%. In the last 24 hours on BASIL, I count 5 losses and 35 wins, an 88% win rate. On the BASIL crosstable its row is almost all red and stands out sharply in its new green environment. The degree of improvement is astounding.

Styx is distributed as a JAR file, so I unpacked it and took a look. I found that Styx is written in Kotlin, a recent language. That makes me wonder: Is this Styx version a complete rewrite, new code that replaces the older version? That would fit in with the quantum leap in results. But I also see both of the map analysis libraries BWTA and BWEM, which gives the impression that Styx is in the process of transitioning to BWEM, though perhaps it is using both to take advantage of different features. Also in there is the jts geometry library, which seems potentially useful for map-related calculations. The jts version is 1.16.1, officially released this past February (I love that it is the same as our Starcraft version number).

Also in the object code I see the ASS combat simulator, a project started only last year. I see the jsoniter JSON parser, which suggests that Styx is reading data files of some kind, whether static initialization and configuration or dynamic learning files.

The feeling I get is that work on Styx was restarted sometime this year, and a substantial amount of new work has been done. It’s hard to say exactly what, though, since I didn’t save an old version to compare.

What specifically makes Styx stronger? The most obvious improvement is a new build, much sharper than what Styx used to play: It is 9 pool, second hatchery, then zergling speed, and don’t waste time making extra drones but keep up zergling production with a 7 drone economy, which enough for constant production from the 2 hatcheries. The mass lings overrun opponents of every race, including solid defenders like McRave. The purple zerg PurpleSwarm has been playing a similar build for a while, also with success.

The new build is great, but it does make me wonder how much the other changes to Styx contribute.

new bot adias

Adias is a new terran bot, and starting off strongly. The first thing I noticed about it is that its name is SAIDA spelled backward. Does that mean it is like SAIDA, or the opposite of SAIDA? The second thing I noticed, after unpacking the binary from SSCAIT, is a machine learning folder with files caffe2.dll, torch.dll, and a couple of learned models with the .pt suffix. It is the signature of a project using PyTorch for deep learning. It’s not TorchCraft, so not directly related to CherryPi. It also does not seem to be based on SAIDA_RL; strings I checked from SAIDA_RL do not appear in adias.

The third thing I noticed, peering into the .exe, is that it looks to be derived from SAIDA. It’s clear that the two share a lot of code, at least. That’s surprising; based on SAIDA and using RL but not based on SAIDA_RL? Close similarities to SAIDA are visible in the game play. For example, adias shares SAIDA’s way of grouping supply depots and tech buildings together, and SAIDA’s predilection for walls composed of barracks plus engineering bay. Both habits appear in the game adias-Tomas Cere on Benzene. There are a lot more similarities; the building placement and opening build orders look identical to me, and adias shares the skill to set up tank lines where the opponent will be compelled to engage at a disadvantage.

The combination of traits reminds me of something else: Stormbreaker, the bot which was disqualified from AIIDE 2019 for being SAIDA with cosmetic changes and a neural network whose result was not used. Is adias only Stormbreaker under a different name?

Well, I don’t know, but I’m guessing not. Though SAIDA and adias look more similar than not, I went back and watched old SAIDA games and I think I see differences too. The difference most salient to me is that adias likes to send an SCV to scout around inside and near its base early in the game, seeming to check for proxies. The only other bot I’ve seen with that style of anti-proxy scouting is tscmoo. Did tscmoo have a hand in adias?

I expect we’ll learn more as we get to see more games. Krasi0 was eventually updated to defeat SAIDA. Will Krasi0 quickly learn to beat adias too, or will Krasi0 have to be updated again to keep up?

AIIDE 2019 - how AITP played

I think AITP is the only AIIDE 2019 bot whose game play deserves its own post. Most others can be watched at SSCAIT and BASIL, and BunkerBoxeR is severely buggy. AITP finished second to last, but it is complex and interesting. In this post, the game links are links to replays; you’ll have to drop them into the OpenBW player yourself.

AITP is derived from Steamhammer and shares some of the same habits. For example, it places supply depots and other buildings of the same size preferentially at the edge of the map, like Steamhammer terran. As explained in what AITP learned, the bot has an interesting abstract strategy system that is different from any other Steamhammer derivative.

AITP does not have strategy work only. It knows how to make a wall at the ramp, how to lift the barracks to open the wall, and how to land the barracks nearby to leave the wall open. It also knows how to lift the barracks back and restore the wall in case of danger. It researches spider mines and lays mines in neat diagonal lines in places like the approach to its natural. Later in the game, when it has a dangerous tank ball, it moves out and sieges to cover movement routes, like SAIDA but with less understanding of where the important places are, and builds turrets there. It has an excessive desire to build bunkers in the middle of the map, which it then doesn’t use properly.

It has also lost important skills that it inherited. It seems to have lost the emergency reaction of taking SCVs off gas when mineral mining is more important because of a large gas excess. Marines may stand away from a bunker that they should jump into, a new and critical misplay. AITP often moves its army away from the action toward a distant empty base, as in this game versus XiaoYi on Fortress (possibly a bug in deciding on the squad target).

Overall, I judge AITP as promising but not mature. I imagine that the tournament came up too early in its development. Some of its skills are impressive, others look incomplete or broken. It needs work to fix bugs (223 crashes during the tournament), polish skills, and add strategy modules to cover more cases. If it gets that work, I think it could become very strong.

Here are my observations of how AITP played when following each of its 5 declared openings. I correlated its learning files with the detailed tournament results to find games with each strategy. Later I’ll read the code and decipher the strategy modules more fully. All strategies involve initial buildings placed at the base entrance. On the map Aztec, where the main base is on low ground below a ramp, it smartly builds at the entrance to the natural instead of the entrance to the main. Few bots know to do that; even Locutus got in some trouble by not doing that.

Every one of these strategies is defensive, as you might expect with buildings at the entrance. I think AITP has good vulture micro and would benefit from having the option of more aggressive vulture play. Also none of the openings is honed to a fine edge; they are all a little rough.

A1-B1-B2-C2 Make a very fast barracks at 5 supply, placed at the base entrance, then finish the wall with a bunker and supply depot. This is the build labeled AntiRush. Later add more barracks and get medics (though no marine upgrades) and move out. I initially guessed that AITP played factory unit mixes every game, but this is a barracks unit mix. This plan scored wins only against random UAlbertaBot, which of course favors rushes. Here is a win on Heartbreak Ridge versus protoss zealots.

A1-B3-C2 Make a very fast barracks at 5 supply, placed at the base entrance, then finish the wall with a bunker and supply depot (this part must be A1). Follow up with factory and ebay and turret up the base. Get vultures with speed first, then expand. Eventually switch into a tank-goliath unit mix (I’m guessing this is C2, which if true means that I didn’t see a game with the anti-rush A1-B1-B2-C2 strategy which got as far as stage C). This was AITP’s most successful strategy by far, in fact I would call it the only successful strategy. It was a top choice against ZZZKBot, Microwave, McRave, UAlbertaBot, and BunkerBoxeR. Here is one of AITP’s 7 wins against Microwave on Python where Microwave played a 9 pool and followed up unambitiously.

A3-B5-C1 Narrow the base entrance with supply depot then barracks, and start a factory before the first marine. Get vultures with mines first and remain defensive. Start a command center in the base, and keep making vultures. When the command center is done, float it into the natural, keep defending with the vultures, and finally add tanks before moving out. This strategy scored zero against every opponent except BunkerBoxeR, which played broken builds. In the first game versus Iron on Benzene, AITP put up a creditable fight.

A3-B7-C1 Narrow the base entrance with supply depot then barracks, and start a factory before the first marine. Get vultures with mines first and remain defensive (up to here must be A3). Then get tanks and start a command center in the base. Not a suitable opening against aggressive play. When there are 2 tanks (no siege mode), lift the finished command center toward the natural. This plan did not show good success against any opponent, but worked better than A3-B5-C1 above. Here is a game versus McRave on Destination.

A4-B2-C1 Wall the base entrance with barracks, depot, bunker, in that order. Make marines and vultures and a fast ebay for turrets. When there are 5 marines, push out, bunker the natural, and expand. The strategy does not impress me; it is not greedy enough to gain an advantage against a cautious opponent (DaQin), and as implemented not safe enough to survive an aggressive opponent (Microwave). Watch how effortlessly PurpleWave wins with 2 dragoons and straightforward play when the 5 marines move out; AITP is missing the defensive skills to make it work.

CoG 2019 downloads fixed

For those like me who have not been paying attention, the CoG 2019 results page has had its broken downloads fixed. The SOURCE_CODE link now gives you source, and REPLAY_04 is a valid zip file full of replays.

AIIDE 2019 - what Microwave did

Here’s data from Microwave’s history files, using the same script as for BananaBrain with a little customization. Unlike Microwave’s learning files, which deliberately omit data and include information from pre-learning, the history files tell what Microwave actually did during the games. Microwave didn’t record information about the opponent’s strategy, so that table is left out. That made it look a little sparse, so I added columns giving the first and last games when the opening was tried, where the first game in the history file is game 0. We can see things like when a winning opening was found, and whether it kept winning. If there are fewer than 100 games recorded for an opponent because Microwave crashed, then the game numbers generally do not align with the tournament round numbers.

Against difficult opponents, Microwave experimented widely. Against some opponents that Microwave pre-trained against, it played whatever came out of pre-training. So I don’t have much to say about opponents in the top half of the post. But toward the bottom I’ve made some comments. Especially see the note to AITP.

#1 locutus

opening	games	wins	first	last
10Hatch9Pool9gas	8	12%	1	52
2HatchHydra	7	0%	0	53
2HatchLurker	7	29%	83	89
2HatchLurkerAllIn	2	0%	63	90
2HatchMuta	12	25%	3	56
3HatchHydraBust	3	0%	10	57
3HatchLingBust	3	0%	38	91
3HatchPoolHydra	5	0%	16	92
4HatchBeforeGas	4	0%	27	93
4PoolHard	3	0%	15	58
4PoolSoft	4	0%	21	59
5Pool	2	0%	36	60
5PoolSpeed	3	0%	41	94
6Pool	3	0%	42	95
6PoolSpeed	3	0%	43	96
7Pool	2	0%	37	61
8Pool	3	0%	44	97
9Pool	9	22%	45	78
9PoolLurker	2	0%	46	79
9PoolSpeed	3	0%	11	62
9PoolSpeedLing	2	0%	47	80
ZvP_10Hatch9Pool	4	0%	17	81
ZvZ_Overpool11Gas	4	0%	18	82
23 openings	98	8%

#2 purplewave

opening	games	wins	first	last
10Hatch9Pool9gas	11	9%	20	93
2HatchHydra	6	0%	14	87
2HatchMuta	5	0%	35	94
3HatchHydraBust	9	0%	3	95
3HatchLingBust	14	7%	0	74
4PoolHard	1	0%	80	80
4PoolSoft	7	14%	30	75
5Pool	8	12%	15	90
5PoolSpeed	1	0%	81	81
6Pool	1	0%	82	82
6PoolSpeed	1	0%	83	83
7Pool	8	0%	17	76
8Pool	4	0%	42	91
9Pool	1	0%	84	84
9PoolSpeed	3	0%	52	92
9PoolSpeedLing	14	21%	4	77
ZvP_10Hatch9Pool	1	0%	85	85
ZvZ_Overpool11Gas	1	0%	86	86
18 openings	96	7%

#3 bananabrain

opening	games	wins	first	last
10Hatch9Pool9gas	1	0%	54	54
2HatchHydra	1	0%	51	51
2HatchMuta	1	0%	52	52
3HatchLingBust	37	49%	0	92
4PoolHard	3	0%	29	63
4PoolSoft	4	0%	28	67
5Pool	11	45%	22	76
5PoolSpeed	7	29%	19	78
6Pool	1	0%	62	62
6PoolSpeed	5	20%	20	68
7Pool	1	0%	55	55
8Pool	3	0%	24	69
9Pool	7	43%	56	70
9PoolSpeed	1	0%	53	53
9PoolSpeedLing	3	0%	25	71
ZvZ_Overgas9Pool	4	0%	26	77
ZvZ_Overpool11Gas	3	0%	35	79
17 openings	93	31%

#4 daqin

opening	games	wins	first	last
10Hatch9Pool9gas	11	18%	2	77
2HatchHydra	4	0%	18	78
2HatchLurker	4	0%	23	79
2HatchMuta	13	23%	17	89
3HatchHydraBust	3	0%	20	51
3HatchLingBust	31	39%	16	76
3HatchPoolHydra	3	0%	25	52
4PoolSoft	3	0%	6	53
5Pool	3	0%	7	54
7Pool	3	0%	11	55
9Pool	3	0%	1	56
9PoolSpeed	3	0%	10	57
9PoolSpeedLing	3	0%	0	58
ZvP_10Hatch9Pool	3	0%	5	59
14 openings	90	19%

#5 steamhammer

opening	games	wins	first	last
9PoolSpeed	100	75%	0	99
1 openings	100	75%

#6 zzzkbot

opening	games	wins	first	last
9PoolHatch	1	0%	0	0
ZvZ_Overgas11Pool	70	80%	1	70
2 openings	71	79%

Why are only 71 games recorded? According to the official results, Microwave crashed in 56 games throughout the tournament, and 29 of those crashes happened against ZZZKBot. Microwave recorded every game in which it did not crash. It’s a debugging opportunity. :-/

#8 iron

opening	games	wins	first	last
10Hatch9Pool9gas	2	0%	53	82
2HatchHydra	1	0%	83	83
2HatchLurkerAllIn	2	0%	63	88
2HatchMuta	11	9%	0	72
3HatchHydraBust	15	33%	5	77
3HatchHydraExpo	1	0%	84	84
3HatchPoolHydra	1	0%	85	85
4HatchBeforeGas	4	0%	18	89
4PoolHard	6	0%	13	78
4PoolSoft	7	14%	11	71
5Pool	1	0%	86	86
5PoolSpeed	6	0%	14	79
6Pool	2	0%	54	87
6PoolSpeed	5	20%	35	92
7Pool	10	30%	19	68
8Pool	7	14%	17	80
9Pool	8	12%	1	95
9PoolSpeedLing	4	0%	21	96
OverpoolTurtle	4	0%	22	81
19 openings	97	13%

#9 xiaoyi

opening	games	wins	first	last
10Hatch9Pool9gas	2	0%	42	47
2HatchLurker	1	0%	48	48
2HatchMuta	2	0%	45	46
4PoolSoft	38	63%	1	38
5Pool	2	50%	0	39
7Pool	51	76%	49	99
9Pool	2	50%	40	41
9PoolSpeedLing	2	0%	43	44
8 openings	100	65%

As soon as Microwave found that 7 pool worked, it played 7 pool exclusively.

#10 mcrave

opening	games	wins	first	last
2HatchMuta	40	62%	0	79
3HatchHydraBust	13	92%	86	98
4PoolHard	1	0%	80	80
4PoolSoft	40	62%	1	40
9Pool	1	0%	85	85
ZvZ_Overgas11Pool	4	50%	81	84
6 openings	99	65%

Microwave was late to discover the success of the hydra bust opening. That’s why it was played so little. The example shows the importance of finding good ideas as early as possible. I am adding smarts to Steamhammer to make it better at finding the good tries fast.

It’s interesting that 2HatchMuta and 4PoolSoft have the same numbers, but were given up on at different times.

#11 ualbertabot

opening	games	wins	first	last
4PoolSoft	100	82%	0	99
1 openings	100	82%

The choice against UAlbertaBot was determined by pre-training. From scratch, I expect Microwave would have tried a wider variety.

#12 aitp

opening	games	wins	first	last
9PoolSpeedLing	100	93%	0	99
1 openings	100	93%

If the first try wins, keep it up. What if Microwave had an opening that would have won more than 93%? The theory is that, above some winning rate, the risk of losing by trying alternatives is higher than the risk of losing by sticking with a known good opening. But what winning rate is high enough to stick with? It depends on how much you respect your opponents. If you expect to win nearly every game, like Locutus, maybe you should switch to an alternative as soon as you lose a single game. If you expect to finish near the bottom, maybe you should stick with a strategy that wins 50%.

But more: How much do you respect each opponent? Maybe bots should have a “contempt factor” like chess programs may use to decide whether to aim for a draw: Accept a low winning rate strategy against Locutus, but demand 95% wins against the unknown who you’ve decided is a weak newbie. I would rather call it a respect factor! In a UCB algorithm, a level of respect is implicitly encoded in the exploration rate constant. Does any bot already have a respect factor for specific opponents?

#13 bunkerboxer

opening	games	wins	first	last
5Pool	100	99%	0	99
1 openings	100	99%

Apparently the initial choice against an unknown is random.

AIIDE 2019 - what BananaBrain learned

I wrote a script to analyze BananaBrain’s game history files, which record its experience with each opponent. For now, I had the script summarize the strategies played and the enemy strategies recognized. The history files also record the map and a value that represents the game duration. History files are rich with information, and there are many ways to summarize it. It would be interesting to see how strategy usage and win rate vary by map, among other possibilities.

The same script should work with minor changes to summarize Microwave’s history files.

BananaBrain had prepared history files for the opponents #1 Locutus, #2 PurpleWave, #5 Steamhammer, #6 ZZZKBot, #7 Microwave, and #8 Iron. Data from the prepared history files was not copied into the write directory. That is different from how Steamhammer and Locutus keep their game records, and it has the nice effect that the tables show exactly what happened in the tournament, from BananaBrain’s point of view.

For each opponent, the left table is BananaBrain’s choice. The right table is BananaBrain’s idea of what the opponent did. All the win rates are from BananaBrain’s point of view, so that, for example, when Locutus played P_1gatecore, BananaBrain won 5% of the time. Of course, the opponent’s view of its own strategy is likely to be more fine-grained than BananaBrain’s. To take the extreme case, Steamhammer played 30 different openings against BananaBrain, and BananaBrain recognized them in 8 categories.

#1 locutus

opening	games	wins
PvP_10/12gate	6	17%
PvP_12nexus	11	36%
PvP_2gatedt	10	0%
PvP_2gatedtexpo	9	0%
PvP_3gaterobo	5	0%
PvP_3gatespeedzeal	8	25%
PvP_4gategoon	6	0%
PvP_9/9gate	12	8%
PvP_9/9proxygate	9	0%
PvP_nzcore	8	12%
PvP_zcore	4	0%
PvP_zcorez	6	0%
PvP_zzcore	6	17%
13 openings	100	10%

enemy	games	wins
P_1gatecore	20	5%
P_cannonrush	29	7%
P_fastexpand	1	0%
P_ffe	19	21%
P_unknown	31	10%
5 openings	100	10%

As you might expect against Locutus, the best choice was a fast expansion.

Is the single game of enemy P_fastexpand a misrecognition? I suspect that Locutus played otherwise, and BananaBrain didn’t see everything and wasn’t able to draw the right conclusion. Or maybe it’s a bug somewhere. PurpleWave and McRave also show a single P_fastexpand game.

#2 purplewave

opening	games	wins
PvP_10/12gate	23	70%
PvP_12nexus	2	0%
PvP_2gatedt	6	17%
PvP_2gatedtexpo	3	33%
PvP_3gaterobo	2	0%
PvP_3gatespeedzeal	1	0%
PvP_4gategoon	8	38%
PvP_9/9gate	26	88%
PvP_9/9proxygate	13	62%
PvP_nzcore	3	0%
PvP_zcore	4	25%
PvP_zcorez	5	40%
PvP_zzcore	4	25%
13 openings	100	56%

enemy	games	wins
P_1gatecore	54	56%
P_2gate	25	60%
P_2gatefast	6	33%
P_fastexpand	1	0%
P_ffe	2	50%
P_unknown	12	67%
6 openings	100	56%

Against PurpleWave, different zealot rushes worked best. Maybe it is because zealot rushes depend for their success more on execution than on the enemy’s strategic reaction. PurpleWave is particularly good at reacting to the enemy strategy, and BananaBrain is good at execution.

#4 daqin

opening	games	wins
PvP_10/12gate	8	62%
PvP_12nexus	6	33%
PvP_2gatedt	6	17%
PvP_2gatedtexpo	12	83%
PvP_3gaterobo	7	14%
PvP_3gatespeedzeal	6	33%
PvP_4gategoon	5	0%
PvP_9/9gate	14	93%
PvP_9/9proxygate	9	67%
PvP_nzcore	7	43%
PvP_zcore	6	33%
PvP_zcorez	7	43%
PvP_zzcore	7	43%
13 openings	100	51%

enemy	games	wins
P_1gatecore	82	50%
P_unknown	18	56%
2 openings	100	51%

BananaBrain made quite a variety of tries, and was most successful with... zealot rush and dark templars, which are kind of different. BananaBrain’s varied opening choice is a strength.

#5 steamhammer

opening	games	wins
PvZ_10/12gate	15	100%
PvZ_1basespeedzeal	8	88%
PvZ_2basespeedzeal	11	82%
PvZ_4gate2archon	7	57%
PvZ_5gategoon	7	86%
PvZ_9/9gate	12	92%
PvZ_9/9proxygate	15	100%
PvZ_bisu	4	75%
PvZ_neobisu	2	50%
PvZ_sairdt	7	100%
PvZ_sairgoon	2	0%
PvZ_stove	10	70%
12 openings	100	85%

enemy	games	wins
Z_10hatch	38	76%
Z_12hatch	31	84%
Z_12pool	11	91%
Z_4/5pool	3	100%
Z_9pool	1	100%
Z_9poolspeed	4	100%
Z_overpool	2	100%
Z_unknown	10	100%
8 openings	100	85%

2 gate zealot openings work well against Steamhammer—but only when played by PurpleWave or BananaBrain. Steamhammer can usually defend versus a lesser protoss.

#6 zzzkbot

opening	games	wins
PvZ_10/12gate	17	100%
PvZ_1basespeedzeal	11	91%
PvZ_2basespeedzeal	4	25%
PvZ_4gate2archon	4	50%
PvZ_5gategoon	6	67%
PvZ_9/9gate	15	100%
PvZ_9/9proxygate	3	67%
PvZ_bisu	5	60%
PvZ_neobisu	4	25%
PvZ_sairdt	12	100%
PvZ_sairgoon	6	50%
PvZ_stove	13	100%
12 openings	100	83%

enemy	games	wins
Z_4/5pool	33	85%
Z_9pool	17	100%
Z_9poolspeed	2	100%
Z_overpool	23	65%
Z_unknown	25	84%
5 openings	100	83%

I like that BananaBrain varies its opening choice even when several openings win 100%. (Steamhammer does too; if more than one opening has scored 100% so far, Steamhammer chooses randomly among them.) Playing a strong opening gives the opponent one problem to solve (“how do I survive this?”). Unpredictably playing one of several strong openings sets the opponent two problems (“what is this fiend doing, and then how do I live through it?”) which must both be solved, more than twice as difficult.

#7 microwave

opening	games	wins
PvZ_10/12gate	20	90%
PvZ_1basespeedzeal	11	73%
PvZ_2basespeedzeal	3	33%
PvZ_4gate2archon	6	50%
PvZ_5gategoon	8	75%
PvZ_9/9gate	17	88%
PvZ_9/9proxygate	8	75%
PvZ_bisu	10	60%
PvZ_neobisu	3	33%
PvZ_sairdt	4	50%
PvZ_sairgoon	2	0%
PvZ_stove	8	62%
12 openings	100	71%

enemy	games	wins
Z_10hatch	8	88%
Z_12hatch	38	55%
Z_12pool	2	100%
Z_4/5pool	28	71%
Z_9pool	9	67%
Z_9poolspeed	7	100%
Z_overpool	3	100%
Z_unknown	5	100%
8 openings	100	71%

#8 iron

opening	games	wins
PvT_10/12gate	6	67%
PvT_10/15gate	3	0%
PvT_12nexus	4	25%
PvT_1gatedtexpo	25	84%
PvT_2gatedt	10	60%
PvT_9/9gate	10	60%
PvT_9/9proxygate	4	75%
PvT_bulldog	1	0%
PvT_dtdrop	14	64%
PvT_nzcore	5	40%
PvT_proxydt	2	0%
PvT_stove	4	25%
PvT_zcore	5	40%
PvT_zzcore	7	43%
14 openings	100	58%

enemy	games	wins
T_1fac	30	63%
T_2fac	1	0%
T_fastexpand	29	48%
T_unknown	40	62%
4 openings	100	58%

Bulldog! That involves protoss dropping zealots, typically on cliff tanks, with a simultaneous attack by ground. When successful, a bulldog can abruptly break a terran defense that is sound against any purely ground attack. I don’t think I’ve seen BananaBrain play that; I should watch more games versus terran. Can anybody point out an example?

#9 xiaoyi

opening	games	wins
PvT_10/12gate	10	90%
PvT_10/15gate	7	43%
PvT_12nexus	5	20%
PvT_1gatedtexpo	11	100%
PvT_2gatedt	7	57%
PvT_9/9gate	6	33%
PvT_9/9proxygate	6	17%
PvT_bulldog	5	0%
PvT_dtdrop	9	89%
PvT_nzcore	6	17%
PvT_proxydt	7	71%
PvT_stove	8	75%
PvT_zcore	6	33%
PvT_zzcore	7	57%
14 openings	100	57%

enemy	games	wins
T_1fac	37	57%
T_fastexpand	20	65%
T_unknown	43	53%
3 openings	100	57%

The Stove worked against XiaoYi? Again, XiaoYi shows weakness against tricks. The Stove involves making scouts to harass while teching to dark templar. It should not be hard for a good terran to defend against; notice that Iron dealt with it well enough.

#10 mcrave

opening	games	wins
PvP_10/12gate	7	71%
PvP_12nexus	6	50%
PvP_2gatedt	6	67%
PvP_2gatedtexpo	8	50%
PvP_3gaterobo	9	78%
PvP_3gatespeedzeal	8	62%
PvP_4gategoon	7	57%
PvP_9/9gate	8	75%
PvP_9/9proxygate	6	33%
PvP_nzcore	10	90%
PvP_zcore	7	57%
PvP_zcorez	10	90%
PvP_zzcore	8	88%
13 openings	100	69%

enemy	games	wins
P_1gatecore	34	74%
P_2gate	26	65%
P_2gatefast	29	69%
P_fastexpand	1	0%
P_proxygate	4	100%
P_unknown	6	50%
6 openings	100	69%

It looks like most openings performed similarly against McRave, and BananaBrain struggled to identify what worked. I imagine a fierce learning battle, both trying to keep one step ahead.

#11 ualbertabot

opening	games	wins
PvU_10/12gate	17	94%
PvU_9/9gate	17	100%
PvU_9/9proxygate	13	85%
PvU_flex	12	67%
PvU_nzcore	11	64%
PvU_zcore	16	88%
PvU_zzcore	13	77%
7 openings	99	84%

enemy	games	wins
P_1gatecore	8	100%
P_2gate	6	83%
P_2gatefast	21	71%
P_unknown	3	33%
T_1fac	5	100%
T_2fac	7	100%
T_2rax	10	90%
T_fastexpand	3	100%
T_unknown	5	100%
Z_10hatch	2	100%
Z_12hatch	8	100%
Z_4/5pool	17	71%
Z_unknown	4	75%
13 openings	99	84%

#12 aitp

opening	games	wins
PvT_10/12gate	7	100%
PvT_10/15gate	8	100%
PvT_12nexus	6	100%
PvT_1gatedtexpo	8	100%
PvT_2gatedt	7	100%
PvT_9/9gate	6	100%
PvT_9/9proxygate	7	100%
PvT_bulldog	9	100%
PvT_dtdrop	7	100%
PvT_nzcore	7	100%
PvT_proxydt	7	100%
PvT_stove	9	100%
PvT_zcore	6	100%
PvT_zzcore	6	100%
14 openings	100	100%

enemy	games	wins
T_1fac	4	100%
T_2fac	12	100%
T_fastexpand	24	100%
T_unknown	60	100%
4 openings	100	100%

#13 bunkerboxer

opening	games	wins
PvT_10/12gate	7	100%
PvT_10/15gate	7	100%
PvT_12nexus	7	100%
PvT_1gatedtexpo	7	100%
PvT_2gatedt	7	100%
PvT_9/9gate	6	100%
PvT_9/9proxygate	7	100%
PvT_bulldog	8	100%
PvT_dtdrop	7	100%
PvT_nzcore	6	100%
PvT_proxydt	8	100%
PvT_stove	8	100%
PvT_zcore	7	100%
PvT_zzcore	8	100%
14 openings	100	100%

enemy	games	wins
T_unknown	100	100%
1 openings	100	100%

BananaBrain apparently does not have a bunker rush recognizer.