Goto

Collaborating Authors

 complex social system




No Free Delivery Service: Epistemic limits of passive data collection in complex social systems

Neural Information Processing Systems

Rapid model validation via the train-test paradigm has been a key driver for the breathtaking progress in machine learning and AI. However, modern AI systems often depend on a combination of tasks and data collection practices that violate all assumptions ensuring test validity. Yet, without rigorous model validation we cannot ensure the intended outcomes of deployed AI systems, including positive social impact, nor continue to advance AI research in a scientifically sound way. In this paper, I will show that for widely considered inference settings in complex social systems the train-test paradigm does not only lack a justification but is indeed invalid for any risk estimator, including counterfactual and causal estimators, with high probability. These formal impossibility results highlight a fundamental epistemic issue, i.e., that for key tasks in modern AI we cannot know whether models are valid under current data collection practices. Importantly, this includes variants of both recommender systems and reasoning via large language models, and neither naïve scaling nor limited benchmarks are suited to address this issue.


No Free Delivery Service: Epistemic limits of passive data collection in complex social systems

Nickel, Maximilian

arXiv.org Machine Learning

Rapid model validation via the train-test paradigm has been a key driver for the breathtaking progress in machine learning and AI. However, modern AI systems often depend on a combination of tasks and data collection practices that violate all assumptions ensuring test validity. Yet, without rigorous model validation we cannot ensure the intended outcomes of deployed AI systems, including positive social impact, nor continue to advance AI research in a scientifically sound way. In this paper, I will show that for widely considered inference settings in complex social systems the train-test paradigm does not only lack a justification but is indeed invalid for any risk estimator, including counterfactual and causal estimators, with high probability. These formal impossibility results highlight a fundamental epistemic issue, i.e., that for key tasks in modern AI we cannot know whether models are valid under current data collection practices. Importantly, this includes variants of both recommender systems and reasoning via large language models, and neither na\"ive scaling nor limited benchmarks are suited to address this issue. I am illustrating these results via the widely used MovieLens benchmark and conclude by discussing the implications of these results for AI in social systems, including possible remedies such as participatory data curation and open science.


Researchers find horses have distinct facial expressions when they feel disappointed or frustrated

Daily Mail - Science & tech

Horses have distinct facial expressions for disappointment and frustration, according to a study. Researchers at the University of Lincoln put 30 horses through a food-reward task, which made them either disappointed or frustrated. When disappointed, the horses tended to blink a lot, lift their nostrils, stick their tongue out and make chewing movements. When frustrated, they showed more of the whites of their eyes and rotated their ears backwards. Dr Claire Ricci-Bonot, lead author of the study, said that horses are'are generally gregarious animals, living within a complex social system'.


Optimal Control of Complex Systems through Variational Inference with a Discrete Event Decision Process

Dong, Wen, Liu, Bo, Yang, Fan

arXiv.org Artificial Intelligence

Complex social systems are composed of interconnected individuals whose interactions result in group behaviors. Optimal control of a real-world complex system has many applications, including road traffic management, epidemic prevention, and information dissemination. However, such real-world complex system control is difficult to achieve because of high-dimensional and non-linear system dynamics, and the exploding state and action spaces for the decision maker. Prior methods can be divided into two categories: simulation-based and analytical approaches. Existing simulation approaches have high-variance in Monte Carlo integration, and the analytical approaches suffer from modeling inaccuracy. We adopted simulation modeling in specifying the complex dynamics of a complex system, and developed analytical solutions for searching optimal strategies in a complex network with high-dimensional state-action space. To capture the complex system dynamics, we formulate the complex social network decision making problem as a discrete event decision process. To address the curse of dimensionality and search in high-dimensional state action spaces in complex systems, we reduce control of a complex system to variational inference and parameter learning, introduce Bethe entropy approximation, and develop an expectation propagation algorithm. Our proposed algorithm leads to higher system expected rewards, faster convergence, and lower variance of value function in a real-world transportation scenario than state-of-the-art analytical and sampling approaches.


Why predicting the future is more than just horseplay

Christian Science Monitor | Science

April 24, 2017 --Three years out of a PhD in physics in 1953, John Kelly Jr. published a breakthrough paper about insider information in horse racing in an unlikely place: the Bell Labs Technical Journal. By the time it was in print, the paper's title had been scrubbed of its references to gambling – the AT&T executives didn't care for Bell Labs to be so directly associated with horse racing – but the content remained. Dr. Kelly had not just cracked the mathematics underlying a type of gambling, but he had also revealed deeper patterns about the nature of prediction. When the odds posted by the track are different from the odds determined using insider information, Kelly's formula explains how to take those differences and place the best bets possible, mathematically speaking. The formula is powerful in its simplicity.