AITopics | Materials

Collaborating Authors

Materials

Generalization Metrics for Practical Quantum Advantage in Generative Models

Gili, Kaitlin, Mauri, Marta, Perdomo-Ortiz, Alejandro

arXiv.org Artificial IntelligenceMay-11-2023

As the quantum computing community gravitates towards understanding the practical benefits of quantum computers, having a clear definition and evaluation scheme for assessing practical quantum advantage in the context of specific applications is paramount. Generative modeling, for example, is a widely accepted natural use case for quantum computers, and yet has lacked a concrete approach for quantifying success of quantum models over classical ones. In this work, we construct a simple and unambiguous approach to probe practical quantum advantage for generative modeling by measuring the algorithm's generalization performance. Using the sample-based approach proposed here, any generative model, from state-of-the-art classical generative models such as GANs to quantum models such as Quantum Circuit Born Machines, can be evaluated on the same ground on a concrete well-defined framework. In contrast to other sample-based metrics for probing practical generalization, we leverage constrained optimization problems (e.g., cardinality-constrained problems) and use these discrete datasets to define specific metrics capable of unambiguously measuring the quality of the samples and the model's generalization capabilities for generating data beyond the training set but still within the valid solution space. Additionally, our metrics can diagnose trainability issues such as mode collapse and overfitting, as we illustrate when comparing GANs to quantum-inspired models built out of tensor networks. Our simulation results show that our quantum-inspired models have up to a $68 \times$ enhancement in generating unseen unique and valid samples compared to GANs, and a ratio of 61:2 for generating samples with better quality than those observed in the training set. We foresee these metrics as valuable tools for rigorously defining practical quantum advantage in the domain of generative modeling.

artificial intelligence, generalization, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2201.0877

Country:

North America > United States (0.45)
Europe > United Kingdom (0.45)

Genre: Research Report > New Finding (1.00)

Industry:

Banking & Finance (0.67)
Materials > Chemicals > Industrial Gases > Liquified Gas (0.45)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.45)
Energy > Oil & Gas > Midstream (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.85)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Autocorrelations Decay in Texts and Applicability Limits of Language Models

Mikhaylovskiy, Nikolay, Churilov, Ilya

arXiv.org Artificial IntelligenceMay-11-2023

To avoid any terminological doubt, when we write "models of the language", we refer to any models that explain some linguistic phenomena, while "language models" refer to probabilistic language models as defined in Subsection 2.3 Probabilistic Language Models. While not long ago probabilistic language models were just models that assign probabilities to sequences of words [4], now they are the cornerstone of any task in computational linguistics through few-shot learning [6], prompt engineering [38] or fine-tuning [13]. On the other hand, current language models fail to catch long-range dependencies in the text consistently. For example, text generation with maximum likelihood target leads to rapid text degeneration, and consistent text generation requires probabilistic sampling and other tricks [22]. Large language models such as GPT-3 [6] push the boundary of "short text" rather far (specifically, to 2048 tokens), but do not remove the problem. Our contributions in this work are the following: We explain how the laws of autocorrelations decay in texts are related to applicability of language models to long texts; We pioneer the use of pretrained word vectors for autocorrelation computations that allows us to study a widest range of autocorrelation distances; We show that the autocorrelations in literary texts decay according to power laws for all these distances; We show that distributional semantics typically provides coherent autocorrelations decay exponents for texts translated to multiple languages, unlike earlier flawed approaches; We show that the behavior of autocorrelations decay in generated texts is quantitatively and often qualitatively different from the literary texts.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2305.06615

Country:

Europe > United Kingdom (0.45)
North America > Canada (0.04)
South America > French Guiana > Guyane > Cayenne (0.04)
(14 more...)

Genre:

Personal > Interview (1.00)
Research Report (0.82)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Law (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.66)

Add feedback

Rhino: An Autonomous Robot for Mapping Underground Mine Environments

Tatsch, Christopher, Jnr, Jonas Amoama Bredu, Covell, Dylan, Tulu, Ihsan Berk, Gu, Yu

arXiv.org Artificial IntelligenceMay-11-2023

There are many benefits for exploring and exploiting underground mines, but there are also significant risks and challenges. One such risk is the potential for accidents caused by the collapse of the pillars, and roofs which can be mitigated through inspections. However, these inspections can be costly and may put the safety of the inspectors at risk. To address this issue, this work presents Rhino, an autonomous robot that can navigate underground mine environments and generate 3D maps. These generated maps will allow mine workers to proactively respond to potential hazards and prevent accidents. The system being developed is a skid-steer, four-wheeled unmanned ground vehicle (UGV) that uses a LiDAR and IMU to perform long-duration autonomous navigation and generation of maps through a LIO-SAM framework. The system has been tested in different environments and terrains to ensure its robustness and ability to operate for extended periods of time while also generating 3D maps.

artificial intelligence, rhino, robot, (18 more...)

arXiv.org Artificial Intelligence

2305.06958

Country:

North America > United States > West Virginia > Monongalia County > Morgantown (0.04)
North America > United States > Virginia (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Japan > Honshū > Kansai > Hyogo Prefecture > Kobe (0.04)

Genre: Research Report (0.50)

Industry: Materials > Metals & Mining (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.88)

Add feedback

On Pitfalls of $\textit{RemOve-And-Retrain}$: Data Processing Inequality Perspective

Song, Junhwa, Cha, Keumgang, Seo, Junghoon

arXiv.org Artificial IntelligenceMay-10-2023

Approaches for appraising feature importance approximations, alternatively referred to as attribution methods, have been established across an extensive array of contexts. The development of resilient techniques for performance benchmarking constitutes a critical concern in the sphere of explainable deep learning. This study scrutinizes the dependability of the RemOve-And-Retrain (ROAR) procedure, which is prevalently employed for gauging the performance of feature importance estimates. The insights gleaned from our theoretical foundation and empirical investigations reveal that attributions containing lesser information about the decision function may yield superior results in ROAR benchmarks, contradicting the original intent of ROAR. This occurrence is similarly observed in the recently introduced variant RemOve-And-Debias (ROAD), and we posit a persistent pattern of blurriness bias in ROAR attribution metrics. Our findings serve as a warning against indiscriminate use on ROAR metrics. The code is available as open source.

attribution method, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2304.13836

Genre: Research Report > New Finding (1.00)

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.68)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.68)
Energy > Oil & Gas > Midstream (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science (0.93)
(2 more...)

Add feedback

Structured Sentiment Analysis as Transition-based Dependency Parsing

Fernández-González, Daniel

arXiv.org Artificial IntelligenceMay-9-2023

Structured sentiment analysis (SSA) aims to automatically extract people's opinions from a text in natural language and adequately represent that information in a graph structure. One of the most accurate methods for performing SSA was recently proposed and consists of approaching it as a dependency parsing task. Although we can find in the literature how transition-based algorithms excel in dependency parsing in terms of accuracy and efficiency, all proposed attempts to tackle SSA following that approach were based on graph-based models. In this article, we present the first transition-based method to address SSA as dependency parsing. Specifically, we design a transition system that processes the input text in a left-to-right pass, incrementally generating the graph structure containing all identified opinions. To effectively implement our final transition-based model, we resort to a Pointer Network architecture as a backbone. From an extensive evaluation, we demonstrate that our model offers the best performance to date in practically all cases among prior dependency-based methods, and surpass recent task-specific techniques on the most challenging datasets. We additionally include an in-depth analysis and empirically prove that the overall time-complexity cost of our approach is quadratic in the sentence length, being more efficient than top-performing graph-based parsers.

computational linguistic, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2305.05311

Country:

Europe (1.00)
North America > United States > Minnesota (0.28)

Genre: Research Report (1.00)

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.67)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.67)
Energy > Oil & Gas > Midstream (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neurosymbolic Artificial Intelligence (NSAI) based Algorithm for predicting the Impact Strength of Additive Manufactured Polylactic Acid (PLA) Specimens

Mishra, Akshansh, Jatti, Vijaykumar S

arXiv.org Artificial IntelligenceMay-7-2023

In this study, we introduce application of Neurosymbolic Artificial Intelligence (NSAI) for predicting the impact strength of additive manufactured polylactic acid (PLA) components, representing the first-ever use of NSAI in the domain of additive manufacturing. The NSAI model amalgamates the advantages of neural networks and symbolic AI, offering a more robust and accurate prediction than traditional machine learning techniques. Experimental data was collected and synthetically augmented to 1000 data points, enhancing the model's precision. The Neurosymbolic model was developed using a neural network architecture comprising input, two hidden layers, and an output layer, followed by a decision tree regressor representing the symbolic component. The model's performance was benchmarked against a Simple Artificial Neural Network (ANN) model by assessing mean squared error (MSE) and R-squared (R2) values for both training and validation datasets. The results reveal that the Neurosymbolic model surpasses the Simple ANN model, attaining lower MSE and higher R2 values for both training and validation sets. This innovative application of the Neurosymbolic approach in estimating the impact strength of additive manufactured PLA components underscores its potential for optimizing the additive manufacturing process. Future research could investigate further refinements to the Neurosymbolic model, extend its application to other materials and additive manufacturing processes, and incorporate real-time monitoring and control for enhanced process optimization.

artificial intelligence, machine learning, neurosymbolic model, (13 more...)

arXiv.org Artificial Intelligence

2305.05668

Country:

Europe > Italy > Lombardy > Milan (0.04)
Asia > India > Maharashtra > Pune (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Machinery > Industrial Machinery (1.00)
Materials > Chemicals > Commodity Chemicals > Bioplastics (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Machine learning helps researchers separate compostable from conventional plastic waste

AIHubMay-5-2023, 15:10:54 GMT

Disposable plastics are everywhere: Food containers, coffee cups, plastic bags. Some of these plastics, called compostable plastics, can be engineered to biodegrade under controlled conditions. However, they often look identical to conventional plastics, get recycled incorrectly and, as a result, contaminate plastic waste streams and reduce recycling efficiency. Similarly, recyclable plastics are often mistaken for compostable ones, resulting in polluted compost. Researchers at University College London (UCL) have published a paper in Frontiers in Sustainability in which they used machine learning to automatically sort different types of compostable and biodegradable plastics and differentiate them from conventional plastics.

help researcher separate compostable, miodownik, plastic, (13 more...)

AIHub

AI-Alerts: 2023 > 2023-05 > AAAI AI-Alert for May 10, 2023 (1.00)

Industry:

Materials (0.80)
Water & Waste Management > Solid Waste Management (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Materials Informatics: An Algorithmic Design Rule

Bishnoi, Bhupesh

arXiv.org Artificial IntelligenceMay-5-2023

We have researched the organic semiconductor's enigmas through the material informatics approach. By applying diverse neural network topologies, logical axiom, and inferencing information science, we have developed data-driven procedures for novel organic semiconductor discovery for the semiconductor industry and knowledge extraction for the material science community. We have reviewed and corresponded with various algorithms for the neural network design topology for the material informatics dataset, as shown in Figure 1, a generalized neural network topology. We have used four chemical compound space databases for model training and validation in this research notebook. The first one is the general quantum chemistry structures and properties of 134-kilo molecules (QM9) of computed geometric, energetic, electronic, and thermodynamic properties for 134-kilo stable small organic molecules made up of C, H, O, N, F for the novel design of new drugs and materials.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2305.03797

Country: North America > United States (1.00)

Genre:

Research Report (0.82)
Workflow (0.67)
Instructional Material (0.67)

Industry:

Materials > Chemicals (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy > Energy Storage (1.00)
(4 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

Incorporating Background Knowledge in Symbolic Regression using a Computer Algebra System

Fox, Charles, Tran, Neil, Nacion, Nikki, Sharlin, Samiha, Josephson, Tyler R.

arXiv.org Artificial IntelligenceMay-4-2023

Since John Koza pioneered the paradigm of programming by means of natural selection, many applications for SR in scientific discovery have emerged [1]. Unlike other applications of machine learning techniques, scientific research demands explanation and verification, both of which are made more feasible by the generation of human-interpretable mathematical models (as opposed to fitting a model with thousands of parameters) [2-4]. Furthermore, SR can be effective even with very small datasets ( 10 items) such as those produced by difficult or expensive experiments which are not easily repeated. The mathematical expressions produced by SR can easily be extrapolated to untested or otherwise unreachable domains within a dataset (such as extreme pressures or temperatures). For decades, SR has discovered interesting models from data in many applications including inferring process models at the Dow Chemical Company [5], rainfall-runoff modeling [6], and rediscovering equations for double-pendulum motion [7]. Symbolic regression has been applied across all scales of scientific investigation, including the atomistic (interatomic potentials [8]), macroscopic (computational fluid dynamics [9]), and cosmological (dark matter overdensity [10]) scales. Some techniques facilitate search through billions of candidate expressions, such as the space of nonlinear descriptors of material properties [11]. While most applications of SR in science focus on identifying empirical patterns in data, such "data-only" approaches do not account for potential insights from background theory. In fact, some SR works emphasize their capabilities of discovery "without any prior knowledge about physics, kinematics, or geometry" [7].

artificial intelligence, evolutionary algorithm, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2301.11919

Country: North America > United States > Maryland (0.46)

Genre: Research Report (0.82)

Industry: Materials > Chemicals (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.89)

Add feedback

Joint Graph Learning and Model Fitting in Laplacian Regularized Stratified Models

Cheng, Ziheng, Zhang, Junzi, Agrawal, Akshay, Boyd, Stephen

arXiv.org Artificial IntelligenceMay-4-2023

Laplacian regularized stratified models (LRSM) are models that utilize the explicit or implicit network structure of the sub-problems as defined by the categorical features called strata (e.g., age, region, time, forecast horizon, etc.), and draw upon data from neighboring strata to enhance the parameter learning of each sub-problem. They have been widely applied in machine learning and signal processing problems, including but not limited to time series forecasting, representation learning, graph clustering, max-margin classification, and general few-shot learning. Nevertheless, existing works on LRSM have either assumed a known graph or are restricted to specific applications. In this paper, we start by showing the importance and sensitivity of graph weights in LRSM, and provably show that the sensitivity can be arbitrarily large when the parameter scales and sample sizes are heavily imbalanced across nodes. We then propose a generic approach to jointly learn the graph while fitting the model parameters by solving a single optimization problem. We interpret the proposed formulation from both a graph connectivity viewpoint and an end-to-end Bayesian perspective, and propose an efficient algorithm to solve the problem. Convergence guarantees of the proposed optimization algorithm is also provided despite the lack of global strongly smoothness of the Laplacian regularization term typically required in the existing literature, which may be of independent interest. Finally, we illustrate the efficiency of our approach compared to existing methods by various real-world numerical examples.

artificial intelligence, machine learning, optimization problem, (17 more...)

arXiv.org Artificial Intelligence

2305.02573

Country:

North America > United States > Wyoming (0.04)
North America > United States > Virginia (0.04)
North America > United States > Montana (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.68)
Materials (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.86)

Add feedback