Evaluations are often wrong by large emouts of equity. In complex situations
I'd never trust an evaluation, unless the answer is relatively obvious, even for my eyes. How reliable evaluations are depends on how familiar gnubg is with this type of situation. gnubg's neural net was trained in many, many sessions playing against himself (or ist it herself?). Situations that rarely happend during this training are therefore less well understood by gnubg.
Just a nice example: Try to play against gnubg with this starting position:

I never lost against gnubg yet with this starting position. If you later analyse gnubg's moves and cube decisions thoroughly, you will se how gnubg blundered his way through the match.
Therfore: The more uncommon a position is to gnubg, the more likely gnubg will not find the correct answer by simple evaluations.
Use a rollout!And yes, I support rubreg's statement: There is no best setting, it's a question of CPU time and the importance of a trustful result. I usually start with a few trials (324, 648), look how significant the difference between the decisions is and prolong the rollout (for the best candidates) until I get a significant output. For Nack Ballards and Paul Weavers
Rollout Project I conducted up to 31,104 trials for a single decision. Such a rollout takes one to two weeks

My preferred (and most trusted) settings are:
- no truncation!
- don't stop depending on STD
- don't stop depending on j.s.d
- I'd recommend cubeful rollouts
- use variance reduction
- use quasi random dice
- strength of the cube decisons: always "world class" (=2-ply)
For checker play I rarely use "expert" (=0-ply) for "quick and dirty" rollouts, when the position is easy to understand for gnubg, "world class" for all other rollouts.
I choose the amount of trials depending on the required precision, often starting with only 324 or 648, later 1296. If the decision is still very close, I usuallsy accept, that the difference is too small to say. (For opening moves I sometime do more trials, 2592+).
Hope that helped

Hardy
