Hi, my name is Abdeslam Boularias. I am a Postdoctoral Fellow at the Robotics Institute of Carnegie Mellon University. I work with
Drew Bagnell and Anthony Stentz. Previously, I was a research scientist at the Max Planck Institute for Intelligent Systems in Tübingen. I worked with Jan Peters, in the Empirical Inference department, which was directed by Bernhard Schölkopf. From January 2006 to July 2010, I was a PhD student at Laval University under the supervision of Brahim Chaib-draa. My thesis focused on planning under uncertainty, reinforcement learning, imitation learning, and multi-agent systems. 
My current research interests focus on machine learning techniques for robotics. Here is a link to my CV, my contact coordinates, and some (old) news

Referred Journal and Conference Papers
  • Abdeslam Boularias, J. Andrew Bagnell and Anthony Stentz (2014). "Efficient Optimization for Autonomous Robotic Manipulation of Natural Objects".
    In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI), Quebec City, Quebec, Canada, 2014.
    [PDF][video 1][video 2][unedited videos of all the experiments]
    [BibTex]
  • Claudio Persello, Abdeslam Boularias, Michele Dalponte, Terje Gobakken, Erik Naesset and Bernhard Schölkopf (2014). "Cost-Sensitive Active Learning With Lookahead: Optimizing Field Surveys for Remote Sensing Data Classification".
    In IEEE Transactions on Geoscience and Remote Sensing 99: 1-13, 2014.
    [PDF]
    [BibTex]
  • Katharina Mülling, Abdeslam Boularias, Betty Mohler, Bernhard Schölkopf and Jan Peters (2014). "Learning Strategies in Table Tennis using Inverse Reinforcement Learning".
    In Biological Cybernetics , 2014.
    [PDF]
    [BibTex]
  • Abdeslam Boularias and Brahim Chaib-draa (2013). "Apprenticeship Learning with Few Examples".
    In Neurocomputing , 2013.
    [PDF]
    [BibTex]
  • Abdeslam Boularias, Oliver Kroemer and Jan Peters (2012). "Algorithms for Learning Markov Field Policies".
    In Advances in Neural Information Processing Systems 26 (NIPS), Lake Tahoe, NV, USA, 2012.
    [PDF][poster (PDF)]
    [BibTex]
  • Samory Kpotufe and Abdeslam Boularias (2012). "Gradient Weights help Nonparametric Regressors".
    In Advances in Neural Information Processing Systems 26 (NIPS), Lake Tahoe, NV, USA, 2012.
    [PDF][poster (PDF)]
    [BibTex]
  • Abdeslam Boularias, Oliver Kroemer and Jan Peters (2012). "Structured Apprenticeship Learning".
    In Proceedings of the European Conference on Machine Learning (ECML), Bristol, UK, 2012.
    [PDF]
    [BibTex]
  • Yu Nishiyama, Abdeslam Boularias, Arthur Gretton and Kenji Fukumizu (2012). "Hilbert Space Embeddings of POMDPs".
    In Uncertainty in Artificial Intelligence (UAI), Catalina, CA, USA, 2012.
    [PDF][poster (PDF)]
    [BibTex]
  • Abdeslam Boularias, Oliver Kroemer and Jan Peters (2011). "Learning Robot Grasping from 3-D Images with Markov Random Fields".
    In Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), San Francisco, CA, USA, 2011.
    [PDF][video]
    [BibTex]
  • Zhikun Wang, Abdeslam Boularias, Katharina Muelling and Jan Peters (2011). "Balancing Safety and Exploitability in Opponent Modeling".
    In Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, USA, 2011.
    [PDF]
    [BibTex]
  • Abdeslam Boularias, Jens Kober and Jan Peters (2011). "Relative Entropy Inverse Reinforcement Learning".
    In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS), Fort Lauderdale, FL, USA, 2011. Volume 15 of JMLR: W&CP 15.
    [PDF][poster (PDF)]
    [BibTex]
  • Abdeslam Boularias and Brahim Chaib-draa (2010). "Bootstrapping Apprenticeship Learning".
    In Advances in Neural Information Processing Systems 24 (NIPS), Vancouver, Canada, 2010.
    [PDF][poster (PDF)]
    [BibTex]
  • Abdeslam Boularias and Brahim Chaib-draa (2010). "Apprenticeship Learning via Soft Local Homomorphisms".
    In Proceedings of 2010 IEEE International Conference on Robotics and Automation (ICRA), Anchorage, USA, 2010.
    [PDF]
    [BibTex]
  • Abdeslam Boularias and Brahim Chaib-draa (2009). "Predictive Representations for Policy Gradient in POMDPs".
    In Proceedings of the Twenty-sixth International Conference on Machine Learning (ICML), Montreal, Canada, 2009.
    [PDF][poster (PDF)]
    [BibTex]
  • Abdeslam Boularias and Brahim Chaib-draa (2008). "Exact Dynamic Programming for Decentralized POMDPs with Lossless Policy Compression".
    In Proceedings of the International Conference on Automated Planning and Scheduling (ICAPS), Sydney, Australia, 2008.
    [PDF][poster (PDF)]
    [BibTex]
  • Abdeslam Boularias, Masoumeh Izadi and Brahim Chaib-draa (2008). "Prediction-directed Compression of POMDPs".
    In Proceedings of the International Conference on Machine Learning and Applications (ICMLA), San Diego, CA, USA, 2008.
    [PDF]
    [BibTex]
  • Abdeslam Boularias (2008). " A Predictive Model for Imitation Learning in Partially Observable Environments".
    In Proceedings of the International Conference on Machine Learning and Applications (ICMLA), San Diego, CA, USA, 2008.
    [PDF]
    [BibTex]
  • Abdeslam Boularias, Masoumeh Izadi and Brahim Chaib-draa (2008). "State Space Compression with Predictive Representations".
    In Proceedings of 21st International Florida Artificial Intelligence Research Society Conference (FLAIRS), Coconut Grove, FL, USA, 2008.
    [PDF]
    [BibTex]
  • Andriy Burkov, Abdeslam Boularias and Brahim Chaib-draa (2007). "Competition and Coordination in Stochastic Games".
    In Proceedings of the 2007 Twentieth Canadian Conference on Artificial Intelligence (Canadian AI), Montreal, Canada, May 28-30, 2007.
    [PDF]
    [BibTex]

Referred Workshop Papers

  • Katharina Mülling, Abdeslam Boularias, Betty Mohler, Bernhard Schölkopf and Jan Peters (2013). "Inverse Reinforcement Learning for Strategy Extraction".
    In Proceedings of ECML'13 Workshop on Machine Learning and Data Mining for Sports Analytics, Prague, Czech Republic, 2013.
    [PDF]
  • Yu Nishiyama, Abdeslam Boularias, Arthur Gretton and Kenji Fukumizu (2012). "Kernel Bellman Equations in POMDPs".
    In Proceedings of the Technical Committee on Infomation-Based Induction Sciences and Machine Learning (IBISML), Tokyo, Japan, 2012.
  • Abdeslam Boularias, Oliver Kroemer and Jan Peters (2012). "Structured Apprenticeship Learning".
    In The 10th European Workshop on Reinforcement Learning (EWRL), Edinburgh, UK, 2012.
    [ poster (PDF)] [ slides (PDF)]
  • Abdeslam Boularias, Hamid R. Chinaei and Brahim Chaib-draa (2010). "Learning the Reward Model of Dialogue POMDPs".
    In NIPS'10 Workshop on Machine Learning for Assistive Technology (MLAT) , Whistler, Canada, 2010.
    [PDF]
  • Abdeslam Boularias and Brahim Chaib-draa (2009). "Policy Transfer in Apprenticeship Learning".
    In NIPS'09 Workshop on Transfer Learning for Structured Data , Whistler, Canada, 2009.
    [poster (PDF)]
  • Abdeslam Boularias and Brahim Chaib-draa (2009). "Learning Probabilistic Models via Bayesian Inverse Planning".
    In NIPS'09 Workshop on Probabilistic Approaches for Robotics and Control , Whistler, Canada, 2009.
    [poster (PDF)]
  • Abdeslam Boularias and Brahim Chaib-draa (2008). "Planning in Decentralized POMDPs with Predictive Policy Representations".
    In Proceedings of ICAPS'08 Multiagent Planning Workshop (MASPLAN), Sydney, Australia, 2008.
    [PDF]
  • Abdeslam Boularias and Brahim Chaib-draa (2007). "Les Représentations Prédictives des États et des Politiques".
    In Actes des Quatrièmes Journées Francophones Modèles Formels de l'Interaction (MFI), Paris, France, May 30-June 1, 2007.
    [PDF]

Thesis

  • Abdeslam Boularias. "Predictive Representations For Sequential Decision Making Under Uncertainty". PhD Thesis. July 2010.
    [PDF] [ slides (PDF)]