archive by month
Skip to content

AIIDE 2016 - upsets by map

I counted the number of upsets in the crosstables for each map. An upset is when a lower-ranked player defeats one who ends up at a higher rank. On any given map you should expect more upsets than in the tournament overall, which sums over the maps and smooths out differences.

mapupsets
(2)Benzene32
(2)Destination16
(2)HeartbreakRidge36
(3)Aztec29
(3)TauCross33
(4)Andromeda32
(4)CircuitBreaker37
(4)EmpireoftheSun33
(4)Fortress36
(4)Python30

I expected that more standard maps would have fewer upsets, but it didn’t turn out that way. Less standard Heartbreak Ridge is the only map to tease me by acting as if I understood. I’m a little surprised by the high upset counts on standard Circuit Breaker and Fortress and the lower counts on less standard Aztec. I am very surprised that Destination has half the upsets of other maps! I can’t explain it.

Another theory you might try is: Many bots are tuned on SSCAIT, so SSCAIT maps might show more solid play and fewer upsets. The non-SSCAIT maps here are only Aztec and Fortress, which don’t support the hypothesis (though there’s little evidence either way).

I thought that per-map upset rate might be a way to measure the strategic fragility of bots on maps that they weren’t tuned for, but it may also measure code fragility. Look into yesterday’s crosstables to see where today’s totals came from. Heartbreak Ridge has many upsets not so much because bots weren’t ready for it, but because NUSBot wasn’t ready for it. LetaBot could not defend Fortress. Only Circuit Breaker’s high rate of upsets was not helped along by any single bot—and Circuit Breaker is as standard as they come. So we’re not learning as much about map characteristics as about bot characteristics.

Can anybody explain why Destination was more stable and less upset-prone than other maps? That’s the big mystery here.

Trackbacks

No Trackbacks

Comments

Jay Scott on :

On 10 October I posted about how bots like maps, and found that Destination has the highest level of “controversy”, meaning that some bots do well on it and others poorly. And yet here Destination comes out with a much lower level of upsets. On the face of it the findings are contradictory. What’s going on?

krasi0 on :

It might be due the fact that Destination is just one of the maps that most people already have in their map pack (an old version of it comes along with Broodwar itself, so no extra downloads are necessary to start coding), it's a "simple" 2 player map and also on top of the list when you open the game's built-in map browser. So it's likely to be the map where most test games have been run locally before submitting to a tournament? Additionally, the mineral block doesn't seem to negatively affect lots of bots (from what I have observed so far) despite being a major PITA in the case of the built-in AI and that should confirm the above hypothesis.

Add Comment

E-Mail addresses will not be displayed and will only be used for E-Mail notifications.

To prevent automated Bots from commentspamming, please enter the string you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly.
CAPTCHA

Form options

Submitted comments will be subject to moderation before being displayed.