Changeset 12876 for branches/HeuristicLab.Problems.GrammaticalOptimization-gkr/HeuristicLab.Algorithms.Bandits/Policies/ChernoffIntervalEstimationPolicy.cs
- Timestamp:
- 08/17/15 19:13:19 (8 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
branches/HeuristicLab.Problems.GrammaticalOptimization-gkr/HeuristicLab.Algorithms.Bandits/Policies/ChernoffIntervalEstimationPolicy.cs
r11806 r12876 35 35 var avgReward = aInfo.SumReward / aInfo.Tries; 36 36 37 // page 5 of "A simple distribution-free appr aoch to the max k-armed bandit problem"37 // page 5 of "A simple distribution-free approach to the max k-armed bandit problem" 38 38 // var alpha = Math.Log(2 * totalTries * k / delta); 39 39 double alpha = Math.Log(2.0) + Math.Log(totalTries) + Math.Log(k) - Math.Log(delta);
Note: See TracChangeset
for help on using the changeset viewer.