Skip to content

Code used for simulations in DSAA'16 submission

Notifications You must be signed in to change notification settings

pareshnakhe/Exp3.T

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Exp3.T

This project contains the code used for the simulations in the paper: "Trend Detection based Regret Minimization for Bandit Problems" - Nakhe and Reiffenhäuser.

The code essentially implements four algorithms, namely

  1. Standard Exp3
  2. Exp3.S
  3. Exp3.R
  4. Exp3D (algorithm proposed in the paper).

The performance of these algorithms is compared for two different reward models, namely a. dynamic stochastic regime b. adverserial regime with gap

These models represent a generalization of the conventional models.

About

Code used for simulations in DSAA'16 submission

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages