Please, visit https://odalric-ambrymmaillard.github.io
Rémi Bardenet, Odalric-Ambrym Maillard. In Bernoulli Journal, 2014. Abstract: Concentration inequalities quantify the deviation of a random variable from a fixed value. In spite of numerous applications, such as opinion surveys or ecological counting...
Read moreOdalric-Ambrym Maillard, Shie Mannor In International Conference on Machine Learning, 2014. Abstract: We consider a multi-armed bandit problem where the reward distributions are indexed by two sets –one for arms, one for type– and can be partitioned into...
Read moreOdalric-Ambrym Maillard In Algorithmic Learning Theory, 2013. Abstract: We study a variant of the standard stochastic multi-armed bandit problem when one is not interested in the arm with the best mean, but instead in the arm maximizing some coherent...
Read moreOlivier Cappé, Aurélien Garivier, Odalric-Ambrym Maillard, Rémi Munos, Gilles Stoltz. In The Annals of Statistics, 2013. Abstract: We consider optimal sequential allocation in the context of the so-called stochastic multi-armed bandit model. We describe...
Read morePhuong Nguyen, Odalric-Ambrym Maillard, Daniil Ryabko,Ronald Ortner. In International Conference on Artificial Intelligence and Statistics, 2013. Abstract: We consider a reinforcement learning setting where the learner also has to deal with the problem...
Read moreOdalric-Ambrym Maillard, Phuong Nguyen, Ronald Ortner, Daniil Ryabko. In Proceedings of the 30th international conference on machine learning, ICML 2013, 2013. Abstract: We consider an agent interacting with an environment in a single stream of actions,...
Read more
By Odalric-Ambrym Maillard. In Proceedings of the 25th conference on advances in Neural Information Processing Systems, NIPS '12, 2012. Abstract: This paper aims to take a step forwards making the term ''intrinsic motivation'' from reinforcement learning...
Read moreBy Alexandra Carpentier and Odalric-Ambrym Maillard. In Proceedings of the 25th conference on advances in Neural Information Processing Systems, NIPS '12, 2012. Abstract: In the setting of active learning for the multi-armed bandit, where the goal of...
Read moreBy Odalric-Ambrym Maillard and Rémi Munos, In Journal of Machine Learning Research 2012, vol:13, pp:2735-2772. Abstract: We investigate a method for regression that makes use of a randomly generated subspace G_P (of finite dimension P) of a given large...
Read moreThis page is dedicated to start discussions about the article "Selecting the State-Representation in Reinforcement Learning". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract:...
Read more