By Odalric-Ambrym Maillard. In Proceedings of the 25th conference on advances in Neural Information Processing Systems, NIPS '12, 2012. Abstract: This paper aims to take a step forwards making the term ''intrinsic motivation'' from reinforcement learning...
This page is dedicated to start discussions about the article "Compressed Least Squares Regression". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: "We consider the problem...
This page is dedicated to start discussions about the article "Finite-Sample Analysis of Bellman Residual Minimization". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: "We...
This page is dedicated to start discussions about the article "Selecting the State-Representation in Reinforcement Learning". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract:...
By Odalric-Ambrym Maillard and Rémi Munos, In Journal of Machine Learning Research 2012, vol:13, pp:2735-2772. Abstract: We investigate a method for regression that makes use of a randomly generated subspace G_P (of finite dimension P) of a given large...
Phuong Nguyen, Odalric-Ambrym Maillard, Daniil Ryabko,Ronald Ortner. In International Conference on Artificial Intelligence and Statistics, 2013. Abstract: We consider a reinforcement learning setting where the learner also has to deal with the problem...
Odalric-Ambrym Maillard In Algorithmic Learning Theory, 2013. Abstract: We study a variant of the standard stochastic multi-armed bandit problem when one is not interested in the arm with the best mean, but instead in the arm maximizing some coherent...
Rémi Bardenet, Odalric-Ambrym Maillard. In Bernoulli Journal, 2014. Abstract: Concentration inequalities quantify the deviation of a random variable from a fixed value. In spite of numerous applications, such as opinion surveys or ecological counting...
By Alexandra Carpentier and Odalric-Ambrym Maillard. In Proceedings of the 25th conference on advances in Neural Information Processing Systems, NIPS '12, 2012. Abstract: In the setting of active learning for the multi-armed bandit, where the goal of...
This page is dedicated to start discussions about the article "LSTD with Random Projections". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: "We consider the problem of reinforcement...
Olivier Cappé, Aurélien Garivier, Odalric-Ambrym Maillard, Rémi Munos, Gilles Stoltz. In The Annals of Statistics, 2013. Abstract: We consider optimal sequential allocation in the context of the so-called stochastic multi-armed bandit model. We describe...
This page is dedicated to start discussions about the article "Scrambled Objects for Least-Squares Regression". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: "We consider...
This page is dedicated to start discussions about the article "Complexity versus Agreement for Many Views". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: "The paper considers...
Odalric-Ambrym Maillard, Shie Mannor In International Conference on Machine Learning, 2014. Abstract: We consider a multi-armed bandit problem where the reward distributions are indexed by two sets –one for arms, one for type– and can be partitioned into...
This page is dedicated to start discussions about the article "Adaptive bandits: Towards the best history-dependent strategy". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract:...
Odalric-Ambrym Maillard, Phuong Nguyen, Ronald Ortner, Daniil Ryabko. In Proceedings of the 30th international conference on machine learning, ICML 2013, 2013. Abstract: We consider an agent interacting with an environment in a single stream of actions,...
This page is dedicated to start discussions about the article "A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing...
This page is dedicated to start discussions about the article "Sparse Recovery with Brownian Sensing". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: " We consider the problem...
This page is dedicated to start discussions about the article "Online Learning in Adversarial Lipschitz Environments". Feel free to post any comment, sugggestion, question, correction, extension... I will enjoy discussing this with you. Abstract: "We...
Blogs and videos:
Seminars:
Research centers:
Schools: