source:
branches/HeuristicLab.Problems.GrammaticalOptimization-gkr/HeuristicLab.Algorithms.Bandits/Policies
@
13264
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
UCTPolicy.cs | 1.6 KB | 11981 | 10 years | gkronber | #2283: cleanup and included HeuristicLab.dlls to create a … |
UCBPolicy.cs | 1.6 KB | 12876 | 9 years | gkronber | #2283: implemented first crude version of extreme hunter algorithm in … |
UCBNormalPolicy.cs | 1.6 KB | 11832 | 10 years | gkronber | linear value function approximation and good results for poly-10 benchmark |
UCB1TunedPolicy.cs | 2.0 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
UCB1Policy.cs | 1.7 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
ThresholdAscentPolicy.cs | 4.7 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
SingleArmPolicy.cs | 755 bytes | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
RandomPolicy.cs | 715 bytes | 11806 | 10 years | gkronber | #2283: separated value-states from done-states in GenericGrammarPolicy … |
ModifiedUCTPolicy.cs | 1.7 KB | 11806 | 10 years | gkronber | #2283: separated value-states from done-states in GenericGrammarPolicy … |
IntervalEstimationPolicy.cs | 1.6 KB | 12876 | 9 years | gkronber | #2283: implemented first crude version of extreme hunter algorithm in … |
GenericThompsonSamplingPolicy.cs | 1.3 KB | 11974 | 10 years | gkronber | #2283: eurocast experiments |
GaussianThompsonSamplingPolicy.cs | 3.1 KB | 11974 | 10 years | gkronber | #2283: eurocast experiments |
ExtremeHunterPolicy.cs | 3.5 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
EpsGreedyPolicy.cs | 2.0 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
ChernoffIntervalEstimationPolicy.cs | 2.3 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
BoltzmannExplorationWithCoolingPolicy.cs | 2.3 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
BoltzmannExplorationPolicy.cs | 2.4 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
BernoulliThompsonSamplingPolicy.cs | 1.2 KB | 11832 | 10 years | gkronber | linear value function approximation and good results for poly-10 benchmark |
ActiveLearningPolicy.cs | 1.8 KB | 12893 | 9 years | gkronber | #2283: experiments on grammatical optimization algorithms (maxreward … |
|
Note: See TracBrowser
for help on using the repository browser.