FIBS Board backgammon forum

Backgammon => Using backgammon software => Topic started by: blitzxz on July 06, 2008, 02:58:37 AM

Title: Bots are perfect
Post by: blitzxz on July 06, 2008, 02:58:37 AM
There was some discussion in other topic that bots can't play massive backgames. Someone was asking proof so here it is. The most incredible mistake I have ever seen gnubg to make.

Money session I'm on roll (black) and have a triple shot. This looks like a redouble... but gnu disagrees. It beavers me! And the cube is skyrocketing...

Could someone try the position to snowie too?

2-ply evaluation

Cube analysis
2-ply cubeless equity  +0,098
  0,613 0,106 0,002 - 0,387 0,214 0,022
Cubeful equities:
1. No double            +0,318
2. Double, pass         +1,000  ( +0,682)
3. Double, take         -0,067  ( -0,385)
Proper cube action: No redouble, beaver (36,1%)

2-ply mini rollout

Cube analysis
Rollout cubeless equity  +0,431

Cubeful equities:
1. Double, take         +0,875
2. Double, pass         +1,000  ( +0,125)
3. No double            +0,853  ( -0,022)
Proper cube action: Redouble, take
Rollout details:
Player You owns 2-cube:
  0,711 0,159 0,009 - 0,289 0,144 0,015 CL  +0,431 CF  +0,853
[0,011 0,008 0,005 - 0,011 0,009 0,003 CL   0,030 CF   0,051]
Player gnubg owns 4-cube:
  0,747 0,177 0,021 - 0,253 0,125 0,015 CL  +1,103 CF  +0,875
[0,011 0,011 0,006 - 0,011 0,008 0,004 CL   0,060 CF   0,090]
Full cubeful rollout with var.redn.
100 games, Mersenne Twister dice gen. with seed 828094824 and quasi-random dice
Play:  2-ply cubeful prune [world class]
keep the first 0 0-ply moves and up to 8 more moves within equity 0,12
Skip pruning for 1-ply moves.
Cube: 2-ply cubeful prune [world class]
Title: Re: Bots are perfect
Post by: inim on July 06, 2008, 11:48:12 PM
Some remarks in no particular order.

Title: Re: Bots are perfect
Post by: FrankBerger on July 08, 2008, 06:07:23 PM
Quote from: blitzxz on July 06, 2008, 02:58:37 AM
There was some discussion in other topic that bots can't play massive backgames. Someone was asking proof so here it is. The most incredible mistake I have ever seen gnubg to make.

Well, some people really regards bot as perfect, especially two of them but this is obviously not the case.

Your position shows that GnuBG has problems with extreme Backgammon. This is well known fact (at least to some people). If you dig very deep you can find a backgame on r.g.b. were GnuBG plays close to random (it is a little work to find it, IIRC it was from "bucko" posted and around 2004) and I guess the most incredible mistake is probably in that game.

What I dislike about your posting is the connotation "bots = GnuBG" and the term "proof". You've shown that in one particular position one bot misplayed it. Although I agree, that this is not an accident due to a broken installation, but a systematic error, one should remind that
At least by my standards, this is not a proof.

ciao
Frank
Title: Re: Bots are perfect
Post by: blitzxz on July 09, 2008, 05:17:00 AM
You're right. I shouldn't have said bots but gnubg.  I tried this position and several others with snowie4 and it was playing clearly better then gnubg. How would bgblitz perform in massive backgame positions? That would be also very interesting test. But this position is certainly a proof. It's a proof that gnubg is not perfect. You only need one error to prove that. But everybody knew that already, right? Maybe, but sometimes I have a feeling that everybody don't remember it. And every time I have said that gnubg has problems with these kind of positions there is at least dozen of people saying that this not the case. Although I agree that it is a well known fact among the developers at least (so I have no reason to send it gnubg mailing list).
Title: Re: Bots are perfect
Post by: FrankBerger on July 09, 2008, 10:45:49 AM
I havn't tested it systematically (lack of time), but BGBlitz seems to behave pretty robust in unusual positions. In this position BGB evaluates as D/T. The rollout deviates, but to a much lesser extent.

And I absolutely agree with you, that bots evaluation are often taken as granted, ignoring that we don't have yet the perfect bot. Each bot has it's weaknesses and sometimes fails in astonishingly simple position (e.g. recently a simply bearoff at r.g.b where S4 (and BGB too   :cry: ) fails.) There is still way to go and anyone really serious should cross check.

Unfortunately the developer scene is much less vibrant than in other games. I have not really an idea why this is, I personally find it a real challenge, I only regret that I have only my spare time for BGB. But compare it with other brain games.... how much AI's do we have? Compare this with Chess, Shogi, Go ...... It might be that people loving BG are more money oriented than in other games, and from the material point of view it's depressing ;) Or it might be that, different as in chess, there is no established standard program to host a bar bone AI (like Winboard, Fritz(?) and other). BGBlitz is able to host different AI's, but obviously no one had taken the opportunity. Developing an AI is much, much, much less work than coming up with a full blown BG program. The latter is a huge amount of work (at least 5-10 man years)  so that might explain, why no new contender come up, but programming an barbone AIs is a matter of month....

ciao
Frank