BWAPI bots ladder

The annual AIIDE, CIG, and SSCAIT tournaments give us snapshots of bot strength at particular moments in time. The SSCAIT ladder used to let us gauge strength at other times as well, but then voting was added, and voting distorts the ratings so that we only get a general idea. To add voting made sense, because it helps keep people engaged, and that (as I see it) is SSCAIT’s main goal. Still, we lost a useful ability.

To me, following Steamhammer’s results, the elo distortion caused by voting is unmissable. Since November or earlier, and except during the tournament when voting wasn’t allowed, every time Steamhammer’s rating rose to around 2200, the voters soon took interest and fed it a sequence of opponents that it would lose to, hammering the rating back down. Then voters lost interest and its rating gradually rose again. Watching this process helped me make my correct prediction that Steamhammer would finish in the SSCAIT tournament between places #4 to #8, even though the bot could not maintain such a high rank outside the tournament.

BWAPI bots ladder fills the gap. (It doesn’t seem to have an official name.) There is no voting or streaming; it only plays games. Since it doesn’t slow down games to make them watchable, it also plays more games. The larger amount of data should also improve the ratings (though it depends on the K factor in the elo calculation).

The UI is sparse. Presumably the project is in an early stage of development, just finished enough to make public. The web page doesn’t provide information about plans. I’ve sent a list of questions to the contact address.

A few points:

• The ladder seems to have been pre-populated with SSCAIT bots. It is a decision that can be questioned: The bot authors did not give permission for this public use. An author who doesn’t like it can write to the contact address to have their bot removed.

• The maps are the SSCAIT maps.

• Minimal public information. The bot names and game results and replays are made public; nothing else is revealed. It is an intentional choice. I don’t even see a way to find out the upload time of a bot, so that you can try to distinguish versions.

• Random bots are allowed, but the tournament manager chooses the race so that the opponent knows it when the game starts. This is different from playing random. It reflects the opinion that playing random gives an unfair advantage.

I personally disagree with the treatment of random players. I think that decisions about fair balance should be made on the basis of data, not argument, and that we don’t have the data. One issue is that a random bot is more difficult to create (no matter whether it is taught its knowledge by hand or by machine learning), which you could take as counterbalancing any advantage it might gain by playing random. Another issue is that a bot which wants to play against random opponents is not able to on this ladder. Of course, in the end the organizers are doing the work, and they get to make the decisions.

Overall, the appearance of a new ladder is a good sign of the health of the community. It fills a gap: It provides a better continuous measure of how well different bots are doing than we have had before. Those who like the design decisions may prefer it over other competitions, and those who don’t also gain by living in a richer world.

Trackbacks

No Trackbacks

Comments

krasi0 on Thursday, March 22. 2018:

Thank you for your interest and the informative blog post on the topic! :)

Your questions have been answered in the *new* FAQ section on the ladder website.

Marian on Thursday, March 22. 2018:

Good to know that krasi0's ladder is alive and well.
Although random is in general more difficult to play the advantage is pretty big.
Let's say you are a protoss playing vs random.
What opening do you use?
Forge nexus is really bad vs protoss and not very good vs terran.
2 gate is pretty bad vs terran who knows how to build his base and micro, it's also not very good on large distance maps.
Nexus first is in general risky and not good vs protoss in general.
So you are left with 1 gate tech, but there are also questions - do you go for quick zealot or core?
Random is actually forbidden in most tournaments and pro play, unless it's 2v2 where double zerg is not allowed.
However I agree that the level of bot play is not high enough to effectively abuse this advantage.

krasi0 on Thursday, March 22. 2018:

Yeah, there have been numerous discussions in the bot community (on Twitch, Discord and even Facebook) on why Random race should be forbidden in bot ladders / tournaments. So I decided not to discuss that topic once again :)
BTW, BW *is* already at that level. Look at tscmooR for example. Even though its openings are far from very good, many top bots (mine included) still lose when playing against it due to the initial minutes wasted not knowing how to proceed adequately.

PS you should make the community happy! People have been waiting for a KillerBot update for years now :P

Antiga / Iruian on Thursday, March 22. 2018:

I'm generally of the opinion also that random isn't healthy and that long run to be competitive every top bot would be random unless limited. Given the option, every pro player would play random over their best race which should be enough to understand how powerful it really is. 2v2 has been for years been random + Zerg as a duo hoping for double zerg and imo has been worse off for it. Likely if 2v2 is ever played again at the professional level random will be gone forever.

MicroDK on Friday, March 23. 2018:

I totally agree that playing random is an advantage and I really like the changed rules on the new ladder.

skar1ath on Friday, March 23. 2018:

Are there any plans to allow bots to select their race intelligently? For example, a bot that can play multiple races might prefer its TvP but its ZvT, or something like that. Then there is more benefit (and strategy) in making a multi-race bot, even if it can't play random and start as an unknown race to the opponent. The big issue I can see is that there would have to be some interface for race selection, and it would get ugly if bots could see and react to updates to the other bot's race. You could have race selection reveal the opponent's name but not their selected race for this game, and then the common case of single-race bots would be discoverable by learning. Of course, the idea is that you would be able to see the opponent's race selection as soon as the game actually started.

Antiga / Iruian on Friday, March 23. 2018:

What you are talking about is called "race picking" in the professional / amateur scene. It's been debated at length, but in reality it essentially comes down to a forced advantage and at this point is highly frowned on. Very few modern tournaments allow it and those that do are amateur only and without fail stir drama and conflict due to it.

MicroDK on Monday, April 2. 2018:

Only problem with the new ladder is that it deletes all spaces in enemy names, and when it download bots from SSCAIT where spaces in bot names are allowed, bots with hardcoded strategies vs. specific bots will not work anymore... bots need to updated to use bot names without spaces. Eg. Microwave is using two strategies vs Tomas Vajda and these strategies are not part of its normal strategy selection, so Microwave will loose a lot of games to it and this can be seen in the cross tables atm.

Add Comment

Name*

Homepage

Comment*

In reply to

E-Mail addresses will not be displayed and will only be used for E-Mail notifications.

To prevent automated Bots from commentspamming, please enter the string you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly.
CAPTCHA