AITopics | Scientific Discovery

Collaborating Authors

Scientific Discovery

"The problem of giving rules for producing true scientific statements has been replaced by the problem of finding efficient heuristic rules for culling the reasonable candidates for an explanation from an appropriate set of possible candidates [and finding methods for constructing the candidates]."
– B. Buchanan, quoted in Lindley Darden. Recent Work in Computational Scientific Discovery.

News Overviews Instructional Materials AI-Alerts Classics

DRNets can solve Sudoku, speed scientific discovery

#artificialintelligenceSep-17-2021, 15:23:33 GMT

Say you're driving with a friend in a familiar neighborhood, and the friend asks you to turn at the next intersection. The friend doesn't say which way to turn, but since you both know it's a one-way street, it's understood. That type of reasoning is at the heart of a new artificial-intelligence framework – tested successfully on overlapping Sudoku puzzles – that could speed discovery in materials science, renewable energy technology and other areas. An interdisciplinary research team led by Carla Gomes, the Ronald C. and Antonia V. Nielsen Professor of Computing and Information Science in the Cornell Ann S. Bowers College of Computing and Information Science, has developed Deep Reasoning Networks (DRNets), which combine deep learning – even with a relatively small amount of data – with an understanding of the subject's boundaries and rules, known as "constraint reasoning." Di Chen, a computer science doctoral student in Gomes' group, is first author of "Automating Crystal-Structure Phase Mapping by Combining Deep Learning with Constraint Reasoning," published Sept. 16 in Nature Machine Intelligence.

discovery, drnet, xrd pattern, (14 more...)

#artificialintelligence

AI-Alerts: 2021 > 2021-09 > AAAI AI-Alert for Sep 21, 2021 (1.00)

Country: North America > United States > California (0.05)

Industry:

Energy > Renewable (0.72)
Leisure & Entertainment > Games > Sudoku (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.42)

Add feedback

Everything you need to know about Hypothesis Testing in Machine Learning

#artificialintelligenceSep-10-2021, 22:00:15 GMT

Hypothesis testing is done to confirm our observation about the population using sample data, within the desired error level.

hypothesis, hypothesis testing, significance level, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.69)

Add feedback

Facebook Researcher's New Algorithm Ushers New Paradigm Of Image Recognition

#artificialintelligenceSep-6-2021, 14:55:05 GMT

"VICReg could be used to model the dependencies between a video clip and the frame that comes after, therefore learning to predict the future in a video." Humans have an innate capability to identify objects in the wild, even from a blurred glimpse of the thing. We do this efficiently by remembering only high-level features that get the job done (identification) and ignoring the details unless required. In the context of deep learning algorithms that do object detection, contrastive learning explored the premise of representation learning to obtain a large picture instead of doing the heavy lifting by devouring pixel-level details. But, contrastive learning has its own limitations.

learning, representation, vicreg, (10 more...)

#artificialintelligence

Country:

North America > United States > New York (0.05)
Asia > India (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.41)
Information Technology > Artificial Intelligence > Cognitive Science > Creativity & Intelligence (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.40)

Add feedback

From Individual Control to Social Protection: New Paradigms for Privacy Law In The Age of Predictive Analytics

#artificialintelligenceAug-19-2021, 14:50:36 GMT

By Dennis D. Hirsch, Published on 01/01/20

individual control, new paradigm, social protection, (2 more...)

#artificialintelligence

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.69)
Information Technology > Artificial Intelligence > Cognitive Science > Creativity & Intelligence (0.69)

Add feedback

Automatically Steering Experiments Toward Scientific Discovery

#artificialintelligenceAug-11-2021, 21:10:22 GMT

Kevin Yager (front) and Masafumi Fukuto at Brookhaven Lab's National Synchrotron Light Source II, where they've been implementing a method of autonomous experimentation. In the popular view of traditional science, scientists are in the lab hovering over their experiments, micromanaging every little detail. For example, they may iteratively test a wide variety of material compositions, synthesis and processing protocols, and environmental conditions to see how these parameters influence material properties. In each iteration, they analyze the collected data, looking for patterns and relying on their scientific knowledge and intuition to select useful follow-on measurements. This manual approach consumes limited instrument time and the attention of human experts who could otherwise focus on the bigger picture.

beamline, experiment, parameter space, (14 more...)

#artificialintelligence

Country:

North America > United States (0.48)
Europe > Poland > Masovia Province > Warsaw (0.04)
Europe > France (0.04)

Industry:

Energy (0.96)
Government > Regional Government (0.49)
Media > News (0.40)

Technology:

Information Technology > Communications > Social Media (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.40)

Add feedback

A unified framework for bandit multiple testing

Xu, Ziyu, Wang, Ruodu, Ramdas, Aaditya

arXiv.org Machine LearningJul-15-2021

In bandit multiple hypothesis testing, each arm corresponds to a different null hypothesis that we wish to test, and the goal is to design adaptive algorithms that correctly identify large set of interesting arms (true discoveries), while only mistakenly identifying a few uninteresting ones (false discoveries). One common metric in non-bandit multiple testing is the false discovery rate (FDR). We propose a unified, modular framework for bandit FDR control that emphasizes the decoupling of exploration and summarization of evidence. We utilize the powerful martingale-based concept of ``e-processes'' to ensure FDR control for arbitrary composite nulls, exploration rules and stopping times in generic problem settings. In particular, valid FDR control holds even if the reward distributions of the arms could be dependent, multiple arms may be queried simultaneously, and multiple (cooperating or competing) agents may be querying arms, covering combinatorial semi-bandit type settings as well. Prior work has considered in great detail the setting where each arm's reward distribution is independent and sub-Gaussian, and a single arm is queried at each step. Our framework recovers matching sample complexity guarantees in this special case, and performs comparably or better in practice. For other settings, sample complexities will depend on the finer details of the problem (composite nulls being tested, exploration algorithm, data dependence structure, stopping rule) and we do not explore these; our contribution is to show that the FDR guarantee is clean and entirely agnostic to these details.

algorithm, fdr control, hypothesis, (14 more...)

arXiv.org Machine Learning

2107.07322

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.45)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Add feedback

Generalized Multivariate Signs for Nonparametric Hypothesis Testing in High Dimensions

Majumdar, Subhabrata, Chatterjee, Snigdhansu

arXiv.org Machine LearningJul-2-2021

High-dimensional data, where the dimension of the feature space is much larger than sample size, arise in a number of statistical applications. In this context, we construct the generalized multivariate sign transformation, defined as a vector divided by its norm. For different choices of the norm function, the resulting transformed vector adapts to certain geometrical features of the data distribution. Building up on this idea, we obtain one-sample and two-sample testing procedures for mean vectors of high-dimensional data using these generalized sign vectors. These tests are based on U-statistics using kernel inner products, do not require prohibitive assumptions, and are amenable to a fast randomization-based implementation. Through experiments in a number of data settings, we show that tests using generalized signs display higher power than existing tests, while maintaining nominal type-I error rates. Finally, we provide example applications on the MNIST and Minnesota Twin Studies genomic data.

asymptotic distribution, hypothesis, test statistic, (15 more...)

arXiv.org Machine Learning

2107.01103

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.40)

Add feedback

A fuzzy take on the logical issues of statistical hypothesis testing

Booth, Matthew, Paillusson, Fabien

arXiv.org Artificial IntelligenceJun-24-2021

Statistical Hypothesis Testing (SHT) is a class of inference methods whereby one makes use of empirical data to test a hypothesis and often emit a judgment about whether to reject it or not. In this paper we focus on the logical aspect of this strategy, which is largely independent of the adopted school of thought, at least within the various frequentist approaches. We identify SHT as taking the form of an unsound argument from Modus Tollens in classical logic, and, in order to rescue SHT from this difficulty, we propose that it can instead be grounded in t-norm based fuzzy logics. We reformulate the frequentists' SHT logic by making use of a fuzzy extension of modus Tollens to develop a model of truth valuation for its premises. Importantly, we show that it is possible to preserve the soundness of Modus Tollens by exploring the various conventions involved with constructing fuzzy negations and fuzzy implications (namely, the S and R conventions). We find that under the S convention, it is possible to conduct the Modus Tollens inference argument using Zadeh's compositional extension and any possible t-norm. Under the R convention we find that this is not necessarily the case, but that by mixing R-implication with S-negation we can salvage the product t-norm, for example. In conclusion, we have shown that fuzzy logic is a legitimate framework to discuss and address the difficulties plaguing frequentist interpretations of SHT.

fuzzy take, logical issue, statistical hypothesis testing, (1 more...)

arXiv.org Artificial Intelligence

doi: 10.3390/philosophies6010021

2106.13241

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.60)

Add feedback

Statistics Fundamentals (7/9) Hypothesis Testing

#artificialintelligenceMay-26-2021, 15:56:57 GMT

Statistics Fundamentals (7/9) Hypothesis Testing Statistical Hypothesis Testing: Theory and Python Welcome to Statistics Fundamentals 7, Hypothesis Testing. This course is for beginners who are interested in statistical analysis. Description Welcome to Statistics Fundamentals 7, Hypothesis Testing. This course is for beginners who are interested in statistical analysis. And anyone who is not a beginner but wants to go over from the basics is also welcome!

hypothesis testing, inferential statistics, statistical analysis, (6 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.83)

Industry: Education (0.66)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)

Add feedback

Living in the wilderness: hypothesis testing in a world that disagrees with statistical theory

#artificialintelligenceMay-23-2021, 17:10:35 GMT

Sometimes it seems paradoxical to call the famous bell curve "normal". Among all the assumptions made by traditional statistical theory, the normality assumption is notorious for the frequency it doesn't hold. My aim in this article is to show a way to test hypotheses when the normality assumption of traditional hypothesis tests is violated. In this scenario, we can't rely on theoretical results, so we need to depart from theory's ivory tower and double the bet on our data. To get there, first I briefly review what hypothesis testing is, focusing on an intuitive grasp of the reasoning behind it (no equations allowed!). Then I proceed to a case study motivated by a business problem where the normality assumption doesn't hold. This makes matters concrete and will direct our discussion. After the problem is explained, I will show that bootstrapping is a good way to fill the gaps left by theory without changing anything in the reasoning at the heart of hypothesis testing. In particular, I will show that bootstrapping leads to the right conclusion about the test. I conclude this article with a critical evaluation of bootstrapping and similar methods, pointing out their pros and cons. Many data scientists have trouble understanding hypothesis testing.

credit score, hypothesis, test statistic, (17 more...)

#artificialintelligence

Genre: Research Report > Experimental Study (0.47)

Industry: Banking & Finance (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.75)

Add feedback