News:

VIP Donor members can now edit photo galleries and utilise an extended Custom Profile with buddy lists. Join them - for the price of a beer!

Main Menu

How to play a massive backgame?

Started by blitzxz, June 29, 2008, 12:27:47 PM

Previous topic - Next topic

blitzxz

I played very interesting backgame against fellow fibster a while ago. I tried to analyze it but got nowhere. What I found out was that gnu 2-ply is propably playing almost half to moves wrong after the massive 3-point backgame is achieved. This was according to very quick 0-ply rollouts. I tried to make 2-ply rollouts but just one rollout will take several days and I wouldn't still trust it.

Here is 5 positions from the game. What is correct way to handle these positions?

Score is also little bit tricky: 4-2 crawford game black leading. So no cube and gammons doesn't matter. But backgammons does.

I also attach the whole game in here.

blitzxz

Couple options here. Gnu wants to hit here which I wouldn't do.

Pipcount: White 147 Black 217

blitzxz

#2
This is very hard choice at least for me. Should black hit or not? And if hit what with the 4?

Pipcount: White 140 Black 217

blitzxz

Again gnu is hitting here.

Pip count: White 156 Black 208

blitzxz

This is a little bit awkward roll.

Pip count: White 148 Black 225

blitzxz

Hit, hit, hit, says the bot.

Pip count: White 148 Black 213

PersianLord

That's true, dear friend, gnubg is often helpless in evaluating massive backgames. A while ago, I played a historical 1-pt with yeti the great and when analyzing it I found that gnubg is totally unable to evaluate it. here is the link:

http://groups.google.com/group/rec.games.backgammon/browse_thread/thread/be1aab7e9bff99da/9ed23c7a02e2ed35#9ed23c7a02e2ed35
The leftist's feelings of inferiority run so deep that he cannot tolerate any classification of some things as successful or superior and other things as failed or inferior. This also underlies the rejection by many leftists of the concept of mental illness and of the utility of IQ tests.  - T.K

inim

#7
Quote from: PersianLord on June 29, 2008, 02:57:17 PM
gnubg is often helpless in evaluating massive backgames

Why do you think that, can you give statistically relevant data? How often exactly is "often"? Can you give concrete examples of positions where gnubg goes wrong? Can you give examples for where it is right, as "often" doesn't mean "always" ...

Quote from: PersianLord on June 29, 2008, 02:57:17 PM
I found that gnubg is totally unable to evaluate it

How did you find that? What was your method or experiment? If gnu was "wrong", what reference was used as "right"? Is there such a thing as a "right" move at all?

BTW, the sum of the equities of both players is never zero (in the r.b.g. article you assume that), as equities are always >= 0. Dependeing of the type of equity used (normalized or unnormalized) both actually may not even have any upper limit, assuming you play for an unlimited pot in a money game.
This space is available for rent by advertisers. Call 0900-INIMITE today, and see your sales skyrocketing in no time! New customers receive free Vl@9rå and a penis enlargement set as a bonus! We support banners, flash banners, and scrollers. Discrete handling by our HQ on the Dutch Antilles.

playBunny

Quote from: inim on June 29, 2008, 10:29:15 PM
BTW, the sum of the equities of both players is never zero (in the r.b.g. article you assume that), as equities are always >= 0.

1-point match, Opening roll 5-2

    1. Cubeful 3-ply    24/22 13/8                   Eq.:  -0.009
        49.6%  13.3%   0.4% -  50.4%  12.9%   0.4%

Equities are always zero or positive except, of course, when they are negative. ;)

What PersianLord is referring to is that generally, barring big swings, the equity of one player is the opposite sign of the opponent's from one move to the next. One's good fortune is the other's downfall. When analysing certain backgames, however, GnuBg can be seen to report that whoever is on roll is a huge favourite. Both players have positive equity and the highest wins percentage. It seems that all it takes is owning the dice. Or the opposite, that both players are losing very badly from one roll to the next, with minor changes in the position. That suggests more that GnuBg doesn't have a clue than that it knows something that we don't. The example in the newsgroup shows that, +.9 for each player.

What's happening is that Gnubg is trying to map the given position, which it's never seen (or perhaps only a couple of times, very early on in its training when it didn't know how to play), to the nearest reference position - and failing miserably. The more extreme the backgame the more likely it is that there's no reference position in sight.

PersianLord

Quote from: playBunny on June 30, 2008, 06:31:56 AM
1-point match, Opening roll 5-2

    1. Cubeful 3-ply    24/22 13/8                   Eq.:  -0.009
        49.6%  13.3%   0.4% -  50.4%  12.9%   0.4%

Equities are always zero or positive except, of course, when they are negative. ;)

What PersianLord is referring to is that generally, barring big swings, the equity of one player is the opposite sign of the opponent's from one move to the next. One's good fortune is the other's downfall. When analysing certain backgames, however, GnuBg can be seen to report that whoever is on roll is a huge favourite. Both players have positive equity and the highest wins percentage. It seems that all it takes is owning the dice. Or the opposite, that both players are losing very badly from one roll to the next, with minor changes in the position. That suggests more that GnuBg doesn't have a clue than that it knows something that we don't. The example in the newsgroup shows that, +.9 for each player.

What's happening is that Gnubg is trying to map the given position, which it's never seen (or perhaps only a couple of times, very early on in its training when it didn't know how to play), to the nearest reference position - and failing miserably. The more extreme the backgame the more likely it is that there's no reference position in sight.

Thank you very much dear bunny for educating and enlightening inim.  :thumbsup: :thumbsup:
The leftist's feelings of inferiority run so deep that he cannot tolerate any classification of some things as successful or superior and other things as failed or inferior. This also underlies the rejection by many leftists of the concept of mental illness and of the utility of IQ tests.  - T.K

dorbel

PersianLord has still not replied to, or even addressed the valid questions raised by inim. let me repeat these questions. Why does PersianLord think that "gnubg is often helpless in evaluating massive backgames"? How did he find "that gnubg was totally unable to evaluate it"?
Further, positions where the advantage lies with the player on roll are quite commonplace and certainly not confined to backgames.

PersianLord

Quote from: dorbel on June 30, 2008, 12:59:52 PM
PersianLord has still not replied to, or even addressed the valid questions raised by inim. let me repeat these questions. Why does PersianLord think that "gnubg is often helpless in evaluating massive backgames"? How did he find "that gnubg was totally unable to evaluate it"?
Further, positions where the advantage lies with the player on roll are quite commonplace and certainly not confined to backgames.

The question is how many positions are statistically enough to support my claim? I can play yeti and can show you at least 100 positions in which gnubg attributes to both sides ~+1 equities, which is mathematically wrong. If this may be of help, I'm ready to do it.

The leftist's feelings of inferiority run so deep that he cannot tolerate any classification of some things as successful or superior and other things as failed or inferior. This also underlies the rejection by many leftists of the concept of mental illness and of the utility of IQ tests.  - T.K

inim

#12
Quote from: playBunny on June 30, 2008, 06:31:56 AM
1-point match, Opening roll 5-2

    1. Cubeful 3-ply    24/22 13/8                   Eq.:  -0.009
        49.6%  13.3%   0.4% -  50.4%  12.9%   0.4%

Equities are always zero or positive except, of course, when they are negative. ;)

The -0.009 is just the difference to the equity you had before you rolled 5-2, mulitplied with a scale dependent constant. It basically says that 5-2 is so bad an opening roll that it actually reduces your chances to win.

This becomes more obvious when one transforms gnu's EMG equity into the equivalent (and for many more intuitive) form of MWC, match winning chanches. In this 1pt match, the MWC would have changed from ~50% to ~49.996%. Obviously you can not have a chance less than 0% to win the match -- it means you already lost it.

Depending on the value range you project (normalize) equity into you can epress the MWC %ages an positive or negative values at your liking. You may normalize 0..1, or -1 .. 1, or even π .. 2π.

I stand corrected insofar that in Equivalent-to-Money-Game equity normalization, or EMG as gnubg calls it, indeed normalizes to -1 .. 1, so 0 is the neutral position (before the match starts) and minimum equity is indeed -1. Personally I prefer the normalization 0..1, which more closely resembles the MWC notation.

This space is available for rent by advertisers. Call 0900-INIMITE today, and see your sales skyrocketing in no time! New customers receive free Vl@9rå and a penis enlargement set as a bonus! We support banners, flash banners, and scrollers. Discrete handling by our HQ on the Dutch Antilles.

playBunny

Quote from: inim on June 30, 2008, 06:27:40 PM
The -0.009 is just the difference to the equity you had before you rolled 5-2, mulitplied with a scale dependent constant. It basically says that 5-2 is so bad an opening roll that it actually reduces your chances to win.

My understanding of -0.009 is that if you play out a sufficiently large number of games from that position (your GnuBgesque play against GnubG) and you've got money on every game then you will walk away with less money than you arrived. That's negative equity in real world terms. I've never used MWC much and it's almost meaningless to me.

QuoteThe -0.009 is just the difference to the equity you had before you rolled 5-2,

We differ in our understanding of this. The -0.009 is the absolute equity of the position after the move, regardless of what went before. If that position had been reached with different dice then it would still have the same equity. Evaluations do not know, nor do they care to know, the history of the position.

inim

#14
Quote from: playBunny on June 30, 2008, 08:11:02 PM
My understanding of -0.009 is that if you play out a sufficiently large number of games from that position  [...] you will walk away with less money than you arrived. That's negative equity in real world terms. I've never used MWC much and it's almost meaningless to me.

Both EMG and MWC normalizations have their merits, and both are options for display in gnubg.

Your statement is correct, you long term will lose money. But as the amount is scaled over a range of 2, it doesn't of course mean that would be 0.009 monetary units in a real money game. It is more like a % of the amount of money in your average pot after considering cubes, gammons etc. Nevertheless, EMG allows quick assesment for overall win/lose situation.

IMO, the notation of MWC is superior if you e.g. look at cube market windows, which very naturally are expressed as MWC. As I do that more often than playing for money, it's my prefered form.

Quote from: playBunny on June 30, 2008, 08:11:02 PM
We differ in our understanding of this. The -0.009 is the absolute equity of the position after the move, regardless of what went before. If that position had been reached with different dice then it would still have the same equity. Evaluations do not know, nor do they care to know, the history of the position.

I'm unsure whether the figure gnubg actually displays in this slot is a delta or an absolute, need to check that. It for sure displays deltas somewhere in the ranked list of moves in some position, may have mixed that up. Several derived values, such as gnubg's definition of of luck, use equity deltas in their formulas as well (as thresholds), because large deltas correspond to large positional swings.

The sweet thing in the special case of the first roll is: the delta to the previous score of 0.000, and the absoulute new EMG equity are the same number, here -0.009. So we are both right for THIS roll, at least :)
This space is available for rent by advertisers. Call 0900-INIMITE today, and see your sales skyrocketing in no time! New customers receive free Vl@9rå and a penis enlargement set as a bonus! We support banners, flash banners, and scrollers. Discrete handling by our HQ on the Dutch Antilles.

playBunny

Quote from: inim on June 30, 2008, 08:34:28 PM
IMO, the notation of MWC is superior if you e.g. look at cube market windows, which very naturally are expressed as MWC. As I do that more often than playing for money, it's my prefered form.

As the cube is where I'm weakest and I've only recently started exploring the market window dialogue, without that much comprehension, unfortunately, I'll have to give that a go. Thanks for the tip. :)

QuoteI'm unsure whether the figure gnubg actually displays in this slot is a delta or an absolute, need to check that. It for sure displays deltas somewhere in the ranked list of moves in some position, may have mixed that up.
Yes, the first move is absolute and subsequent moves show their absolute equity plus the delta.

QuoteThe sweet thing in the special case of the first roll is: the delta to the previous score of 0.000, and the absoulute new EMG equity are the same number, here -0.009. So we are both right for THIS roll, at least :)

Yep! :cool:

blitzxz

No one has commented the positions yet.....

playBunny

QuoteFurther, positions where the advantage lies with the player on roll are quite commonplace and certainly not confined to backgames

I agree with you that there are positions in other phases of the game where the equity is positive for whoever's one roll but how frequent is "commonplace"? Of those positions how many are situations, ie. not confined to just a move or two but for a whole period? And how often, too, is the change on the order of +90% to -90%. Wuold you seriously suggest that the player attempting to contain a checker has 90% chances just because they're on roll but on the next roll those chances drop to 10%, or some other such massive swing?

I think the best answer comes from GnuBg itself. You'll see some totally daft moves in some backgames. I had one against it a while back where it made a stacke of 5 or 6 checkers. It looked like a beginner playing "No blots, please". There's no way that I could explain that in terms of it attempting to contain my loose blot but it did make a kind of sense if it was misrecognising the position and getting its neurals in a twist.

inim

#18
Quote from: playBunny on July 05, 2008, 04:32:43 PM
I think the best answer comes from GnuBg itself. [...] It looked like a beginner playing "No blots, please". There's no way that I could explain that in terms of it attempting to contain my loose blot but it did make a kind of sense if it was misrecognising the position and getting its neurals in a twist.

The best answer comes from Joseph, who trained the initial gnubg net, and it hasn't changed a lot since:  http://pages.quicksilver.net.nz/pepe/ngb/index.html. He was completely aware of the issue with containment, and he experiented with a dedicated NN for such positions (see chapter "The Crashed Net"). However, he found that when containment improved other positions degraded in the net. The crashed net is still in the gnubg sources, but not activated.

So this is really an open research issue, and the the gnubg team would surely be happy to hear about any approach to solve this problem. You can contribute even if you are not a pro coder, the tools to train and verify nets are useable with slightly advanced PC skills already. Some math skills required, however.
This space is available for rent by advertisers. Call 0900-INIMITE today, and see your sales skyrocketing in no time! New customers receive free Vl@9rå and a penis enlargement set as a bonus! We support banners, flash banners, and scrollers. Discrete handling by our HQ on the Dutch Antilles.

playBunny

I actually would like to learn how to train the net and have already studied the code but logical I am, mathematical I'm not. Time is also an issue. There are plenty of neural network texts that delight in explaining the maths but I've not come across any that give the idiot's guide to impementing them. I wouldn't even know where to start. I'd love to create a neural net to play other games. Ludo would be a nice small project, for instance. If I knew enough about it then extending GnuBg to play other gammon variants would be an interesting future project.