Speaker: Subramanian Ramamoorthy
Speaker Affiliation: School of Informatics, The University of Edinburgh
Host: Lorenzo Rosasco, Armando Tacchella
Host Affiliation:DIBRIS, Universita' degli studi di Genova.
Date: 2015-06-25
Time: 14:30
Location: Conference Room 363bis, DIBRIS Valletta Puggia. Via Dodecaneso 35, Genova, IT.
Abstract
We are motivated by the problem of building interactively intelligent robots. One attribute of such an autonomous system is the ability to make predictions about the actions and intentions of other agents in a dynamic environment, and to adapt its own decisions accordingly. This kind of ability is especially important when robots are called upon to work closely together with human users and operators. I will begin my talk by briefly describing some robotic systems we have built that exhibit this ability. This includes mobile robots that can navigate in crowded spaces and humanoid robots that can cooperate with human co-workers. Underpinning such systems are a variety of algorithmic tools for behaviour prediction, categorization and decision-making. I will present three recent results from my group’s work in this area. Firstly, we will look at the problem of adaptation of an interface to a diverse population of users with varying levels of skill and other personal traits. I will outline a latent variable model and a Bayesian algorithm for selecting action sets that constitute a best response to the agent’s belief about the user profile. I will report on experiments with this model involving both simulated and human users, showing that our adaptive solution outperforms alternate static solutions and adaptive baselines such as EXP-3. Next, I will outline a model for ad hoc multi-agent interaction without prior coordination, which extends the above insights to an explicitly strategic setting. By conceptualizing the interaction as a stochastic Bayesian game, the choice problem is formulated in terms of types in an incomplete information game, allowing for a learning algorithm that combines the benefits of Harsanyi’s notion of types and Bellman’s notion of optimality in sequential decisions. These theoretical arguments will be supported by some preliminary results from experiments involving human-machine interaction, such as in prisoner’s dilemma, where we show a better rate of coordination than alternate multi-agent learning algorithms. Where do these behavioural types come from? One explanation is that decision processes admit categorization in terms of behavioural equivalence. I will conclude by discussing our current work on categorizing decision processes in terms of their behavioural equivalence, in the form of an algorithm for clustering Markov Decision Processes with a view to enabling transfer and policy reuse. This is a step towards answering the question of why we expect there to be compact libraries of types that are exploited by techniques such as those mentioned above.
Bio
Dr. Subramanian Ramamoorthy is a Reader (Associate Professor) in Robotics at the School of Informatics, University of Edinburgh, where he has been since 2007. He is the Coordinator of the EPSRC Robotarium Research Facility, and Executive Committee Member for the Centre for Doctoral Training in Robotics and Autonomous Systems. Previously, he received a PhD in Electrical and Computer Engineering from The University of Texas at Austin. He is an elected Member of the Young Academy of Scotland at the Royal Society of Edinburgh. His current research is focussed on problems of autonomous learning and decision-making under uncertainty, by long-lived agents and agent teams interacting within dynamic environments. This work is motivated by applications in autonomous robotics, human-robot interaction, intelligent interfaces and other autonomous agents in mixed human-machine environments. These problems are solved using a combination of methods involving layered representations based on geometric/topological abstractions, game theoretic and behavioural models of inter-dependent decision making, and machine learning with emphasis on issues of transfer, online and reinforcement learning. His work has been recognised by nominations for Best Paper Awards at major international conferences - ICRA 2008, IROS 2010, ICDL 2012 and EACL 2014. He serves in editorial and programme committee roles for conferences and journals in the areas of AI and Robotics. He leads Team Edinferno, the first UK entry in the Standard Platform League at the RoboCup International Competition. This work has received media coverage, including by BBC News and The Telegraph, and has resulted in many public engagement activities, such as at the Royal Society Summer Science Exhibition, Edinburgh International Science festival and Edinburgh Festival Fringe. Before joining the School of Informatics, he was a Staff Engineer with National Instruments Corp., where he contributed to five products in the areas of motion control, computer vision and dynamic simulation. This work resulted in seven US patents and numerous industry awards for product innovation.