Pfeiffer, Thomas
A lunar reconnaissance drone for cooperative exploration and high-resolution mapping of extreme locations
Tonasso, Roméo, Tataru, Daniel, Rauch, Hippolyte, Pozsgay, Vincent, Pfeiffer, Thomas, Uythoven, Erik, Rodríguez-Martínez, David
An efficient characterization of scientifically significant locations is essential prior to the return of humans to the Moon. The highest resolution imagery acquired from orbit of south-polar shadowed regions and other relevant locations remains, at best, an order of magnitude larger than the characteristic length of most of the robotic systems to be deployed. This hinders the planning and successful implementation of prospecting missions and poses a high risk for the traverse of robots and humans, diminishing the potential overall scientific and commercial return of any mission. We herein present the design of a lightweight, compact, autonomous, and reusable lunar reconnaissance drone capable of assisting other ground-based robotic assets, and eventually humans, in the characterization and high-resolution mapping (~0.1 m/px) of particularly challenging and hard-to-access locations on the lunar surface. The proposed concept consists of two main subsystems: the drone and its service station. With a total combined wet mass of 100 kg, the system is capable of 11 flights without refueling the service station, enabling almost 9 km of accumulated flight distance. The deployment of such a system could significantly impact the efficiency of upcoming exploration missions, increasing the distance covered per day of exploration and significantly reducing the need for recurrent contacts with ground stations on Earth.
Decision Market Based Learning For Multi-agent Contextual Bandit Problems
Wang, Wenlong, Pfeiffer, Thomas
Information is often stored in a distributed and proprietary form, and agents who own information are often self-interested and require incentives to reveal their information. Suitable mechanisms are required to elicit and aggregate such distributed information for decision making. In this paper, we use simulations to investigate the use of decision markets as mechanisms in a multi-agent learning system to aggregate distributed information for decision-making in a contextual bandit problem. The system utilises strictly proper decision scoring rules to assess the accuracy of probabilistic reports from agents, which allows agents to learn to solve the contextual bandit problem jointly. Our simulations show that our multi-agent system with distributed information can be trained as efficiently as a centralised counterpart with a single agent that receives all information. Moreover, we use our system to investigate scenarios with deterministic decision scoring rules which are not incentive compatible. We observe the emergence of more complex dynamics with manipulative behaviour, which agrees with existing theoretical analyses.
Replication Markets: Results, Lessons, Challenges and Opportunities in AI Replication
Liu, Yang, Gordon, Michael, Wang, Juntao, Bishop, Michael, Chen, Yiling, Pfeiffer, Thomas, Twardy, Charles, Viganola, Domenico
The last decade saw the emergence of systematic large-scale replication projects in the social and behavioral sciences, (Camerer et al., 2016, 2018; Ebersole et al., 2016; Klein et al., 2014, 2018; Collaboration, 2015). These projects were driven by theoretical and conceptual concerns about a high fraction of "false positives" in the scientific publications (Ioannidis, 2005) (and a high prevalence of "questionable research practices" (Simmons, Nelson, and Simonsohn, 2011). Concerns about the credibility of research findings are not unique to the behavioral and social sciences; within Computer Science, Artificial Intelligence (AI) and Machine Learning (ML) are areas of particular concern (Lucic et al., 2018; Freire, Bonnet, and Shasha, 2012; Gundersen and Kjensmo, 2018; Henderson et al., 2018). Given the pioneering role of the behavioral and social sciences in the promotion of novel methodologies to improve the credibility of research, it is a promising approach to analyze the lessons learned from this field and adjust strategies for Computer Science, AI and ML In this paper, we review approaches used in the behavioral and social sciences and in the DARPA SCORE project. We particularly focus on the role of human forecasting of replication outcomes, and how forecasting can leverage the information gained from relatively labor and resource-intensive replications. We will discuss opportunities and challenges of using these approaches to monitor and improve the credibility of research areas in Computer Science, AI, and ML.
Adaptive Polling for Information Aggregation
Pfeiffer, Thomas (Harvard University) | Gao, Xi Alice (Harvard University) | Chen, Yiling (Harvard University) | Mao, Andrew (Harvard University) | Rand, David G. (Harvard University)
The flourishing of online labor markets such as Amazon Mechanical Turk (MTurk) makes it easy to recruit many workers for solving small tasks. We study whether information elicitation and aggregation over a combinatorial space can be achieved by integrating small pieces of potentially imprecise information, gathered from a large number of workers through simple, one-shot interactions in an online labor market. We consider the setting of predicting the ranking of $n$ competing candidates, each having a hidden underlying strength parameter. At each step, our method estimates the strength parameters from the collected pairwise comparison data and adaptively chooses another pairwise comparison question for the next recruited worker. Through an MTurk experiment, we show that the adaptive method effectively elicits and aggregates information, outperforming a naive method using a random pairwise comparison question at each step.