AITopics | Model-Based Reasoning

Collaborating Authors

Model-Based Reasoning

News Overviews Instructional Materials AI-Alerts Classics

Natural Language to Code Translation with Execution

Shi, Freda, Fried, Daniel, Ghazvininejad, Marjan, Zettlemoyer, Luke, Wang, Sida I.

arXiv.org Artificial IntelligenceNov-1-2022

Generative models of code, pretrained on large corpora of programs, have shown great success in translating natural language to code (Chen et al., 2021; Austin et al., 2021; Li et al., 2022, inter alia). While these models do not explicitly incorporate program semantics (i.e., execution results) during training, they are able to generate correct solutions for many problems. However, choosing a single correct program from a generated set for each problem remains challenging. In this work, we introduce execution result--based minimum Bayes risk decoding (MBR-EXEC) for program selection and show that it improves the few-shot performance of pretrained code models on natural-language-to-code tasks. We select output programs from a generated candidate set by marginalizing over program implementations that share the same semantics. Because exact equivalence is intractable, we execute each program on a small number of test inputs to approximate semantic equivalence. Across datasets, execution or simulated execution significantly outperforms the methods that do not involve program semantics. We find that MBR-EXEC consistently improves over all execution-unaware selection methods, suggesting it as an effective approach for natural language to code translation. We open-source our code at github.com/facebookresearch/mbr-exec and data at dl.fbaipublicfiles.com/mbr-exec/mbr-exec-release.zip

computational linguistic, large language model, natural language, (14 more...)

arXiv.org Artificial Intelligence

2204.11454

Country:

Europe > Germany > Berlin (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(12 more...)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)

Add feedback

Tutorial: Julia for Scientific Machine Learning – TAMIDS Scientific Machine Learning Lab

#artificialintelligenceOct-27-2022, 02:36:11 GMT

Julia (https://julialang.org/) is a generic programming language designed for high-performance computing. It solves the "two language problem" of scientific computing. Julia is dynamically typed like scripting language such as Python and can be compiled into native machine code. Besides, composability via multiple dispatches makes Julia ideal for integration across packages. SciML (https://sciml.ai/) is an open-source software for scientific machine learning based on the Julia language that combines machine learning and scientific computing by integrating numerous standalone packages.

application, tamid scientific machine learning lab, tutorial, (5 more...)

#artificialintelligence

Country:

North America > United States > Texas > Brazos County > College Station (0.40)
Europe > Portugal > Braga > Braga (0.08)
Asia > Taiwan (0.08)

Genre: Instructional Material (0.38)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Modeling Document-level Temporal Structures for Building Temporal Dependency Graphs

Choubey, Prafulla Kumar, Huang, Ruihong

arXiv.org Artificial IntelligenceOct-21-2022

We propose to leverage news discourse profiling to model document-level temporal structures for building temporal dependency graphs. Our key observation is that the functional roles of sentences used for profiling news discourse signify different time frames relevant to a news story and can, therefore, help to recover the global temporal structure of a document. Our analyses and experiments with the widely used knowledge distillation technique show that discourse profiling effectively identifies distant inter-sentence event and (or) time expression pairs that are temporally related and otherwise difficult to locate.

computational linguistic, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2210.11787

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Georgia > Fulton County > Atlanta (0.05)
Asia > South Korea (0.04)
(17 more...)

Genre: Research Report (0.64)

Industry: Media > News (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Improving aircraft performance using machine learning: a review

Clainche, Soledad Le, Ferrer, Esteban, Gibson, Sam, Cross, Elisabeth, Parente, Alessandro, Vinuesa, Ricardo

arXiv.org Artificial IntelligenceOct-20-2022

Climate change and increasing resource scarcity are challenges that Europe needs to face in the coming decades. All this has a direct impact on air transport, which is struggling to maintain its performance and competitiveness while ensuring a development focused on sustainable mobility. Research and innovation are essential to maintain the capabilities of the aviation industry, driven by the rise of new markets and new competitors as a result of globalization. A new longterm vision for the aeronautics sector is essential to ensure its successful advancement. In this line, new requirements for the future aviation industry have been defined by the ACARE Flightpath 2050, a Group of Recognized Personalities in the aeronautic sector, including stakeholders from the aeronautics industry, air traffic management, airports, airlines, energy providers and the research community. Aeronautics and air transport comprises both: air vehicle and system technology.

data mining, machine learning, reinforcement learning, (22 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.ast.2023.108354

2210.11481

Country:

North America > United States (1.00)
Europe > United Kingdom > England (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Industry:

Transportation > Air (1.00)
Energy > Oil & Gas > Upstream (1.00)
Aerospace & Defense > Aircraft (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
(7 more...)

Add feedback

Physics-Informed Graph Learning

Peng, Ciyuan, Xia, Feng, Saikrishna, Vidya, Liu, Huan

arXiv.org Artificial IntelligenceOct-20-2022

An expeditious development of graph learning in recent years has found innumerable applications in several diversified fields. Of the main associated challenges are the volume and complexity of graph data. The graph learning models suffer from the inability to efficiently learn graph information. In order to indemnify this inefficacy, physics-informed graph learning (PIGL) is emerging. PIGL incorporates physics rules while performing graph learning, which has enormous benefits. This paper presents a systematic review of PIGL methods. We begin with introducing a unified framework of graph learning models followed by examining existing PIGL methods in relation to the unified framework. We also discuss several future challenges for PIGL. This survey paper is expected to stimulate innovative research and development activities pertaining to PIGL.

artificial intelligence, machine learning, neural network, (18 more...)

arXiv.org Artificial Intelligence

2202.10679

Country:

Oceania > Australia (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Arizona > Maricopa County > Tempe (0.04)

Genre: Overview (0.88)

Industry:

Energy (0.69)
Information Technology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

NSF-funded project to develop probabilistic scientific machine learning – TAMIDS Scientific Machine Learning Lab

#artificialintelligenceOct-15-2022, 06:20:51 GMT

Across engineering and scientific disciplines, machine learning is the main method for analyzing and identifying patterns in big data and making informed decisions around that data. Recently, a new area within artificial intelligence called scientific machine learning has emerged, which introduces physics laws into machine learning models. Scientific machine learning combines the areas of artificial intelligence and scientific computation. Because scientific machine learning algorithms are informed and constrained by physics laws, they do not rely only on data and can even make predictions where there is no data. However, there has been little work on probabilistic methods in scientific machine learning, meaning that current algorithms cannot model uncertainty in the data or the physics.

engineering, scientific machine, scientific machine learning lab, (10 more...)

#artificialintelligence

Country:

North America > United States > Texas > Brazos County > College Station (0.40)
Europe > Portugal > Braga > Braga (0.13)
Europe > Finland (0.10)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Probabilistic Model of Activity Recognition with Loose Clothing

Shen, Tianchen, Di Giulio, Irene, Howard, Matthew

arXiv.org Artificial IntelligenceSep-23-2022

Human activity recognition has become an attractive research area with the development of on-body wearable sensing technology. With comfortable electronic-textiles, sensors can be embedded into clothing so that it is possible to record human movement outside the laboratory for long periods. However, a long-standing issue is how to deal with motion artefacts introduced by movement of clothing with respect to the body. Surprisingly, recent empirical findings suggest that cloth-attached sensor can actually achieve higher accuracy of activity recognition than rigid-attached sensor, particularly when predicting from short time-windows. In this work, a probabilistic model is introduced in which this improved accuracy and resposiveness is explained by the increased statistical distance between movements recorded via fabric sensing. The predictions of the model are verified in simulated and real human motion capture experiments, where it is evident that this counterintuitive effect is closely captured.

artificial intelligence, machine learning, sensor, (18 more...)

arXiv.org Artificial Intelligence

2209.11579

Country:

Europe > United Kingdom > England > Greater London > London (0.05)
North America > United States (0.04)
North America > Canada (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.40)

Add feedback

A Robust Scientific Machine Learning for Optimization: A Novel Robustness Theorem

#artificialintelligenceSep-17-2022, 08:14:45 GMT

Scientific machine learning (SciML) is a field of increasing interest in several different application fields. In an optimization context, SciML-based tools have enabled the development of more efficient optimization methods. However, implementing SciML tools for optimization must be rigorously evaluated and performed with caution. This work proposes the deductions of a robustness test that guarantees the robustness of multiobjective SciML-based optimization by showing that its results respect the universal approximator theorem. The test is applied in the framework of a novel methodology which is evaluated in a series of benchmarks illustrating its consistency. Moreover, the proposed methodology results are compared with feasible regions of rigorous optimization, which requires a significantly higher computational effort.

novel robustness theorem, optimization, robust scientific machine learning, (2 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Physics-based Digital Twins for Autonomous Thermal Food Processing: Efficient, Non-intrusive Reduced-order Modeling

Kannapinn, Maximilian, Pham, Minh Khang, Schäfer, Michael

arXiv.org Artificial IntelligenceSep-7-2022

One possible way of making thermal processing controllable is to gather real-time information on the product's current state. Often, sensory equipment cannot capture all relevant information easily or at all. Digital Twins close this gap with virtual probes in real-time simulations, synchronized with the process. This paper proposes a physics-based, data-driven Digital Twin framework for autonomous food processing. We suggest a lean Digital Twin concept that is executable at the device level, entailing minimal computational load, data storage, and sensor data requirements. This study focuses on a parsimonious experimental design for training non-intrusive reduced-order models (ROMs) of a thermal process. A correlation ($R=-0.76$) between a high standard deviation of the surface temperatures in the training data and a low root mean square error in ROM testing enables efficient selection of training data. The mean test root mean square error of the best ROM is less than 1 Kelvin (0.2 % mean average percentage error) on representative test sets. Simulation speed-ups of Sp $\approx$ 1.8E4 allow on-device model predictive control. The proposed Digital Twin framework is designed to be applicable within the industry. Typically, non-intrusive reduced-order modeling is required as soon as the modeling of the process is performed in software, where root-level access to the solver is not provided, such as commercial simulation software. The data-driven training of the reduced-order model is achieved with only one data set, as correlations are utilized to predict the training success a priori.

artificial intelligence, machine learning, real time system, (19 more...)

arXiv.org Artificial Intelligence

2209.03062

Country: North America > United States (0.46)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.46)

Industry:

Health & Medicine (1.00)
Government (1.00)
Energy > Oil & Gas > Upstream (1.00)
Food & Agriculture > Agriculture (0.93)

Technology:

Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

MetaGraspNet_v0: A Large-Scale Benchmark Dataset for Vision-driven Robotic Grasping via Physics-based Metaverse Synthesis

Chen, Yuhao, Zeng, E. Zhixuan, Gilles, Maximilian, Wong, Alexander

arXiv.org Artificial IntelligenceAug-30-2022

There has been increasing interest in smart factories powered by robotics systems to tackle repetitive, laborious tasks. One impactful yet challenging task in robotics-powered smart factory applications is robotic grasping: using robotic arms to grasp objects autonomously in different settings. Robotic grasping requires a variety of computer vision tasks such as object detection, segmentation, grasp prediction, pick planning, etc. While significant progress has been made in leveraging of machine learning for robotic grasping, particularly with deep learning, a big challenge remains in the need for large-scale, high-quality RGBD datasets that cover a wide diversity of scenarios and permutations. To tackle this big, diverse data problem, we are inspired by the recent rise in the concept of metaverse, which has greatly closed the gap between virtual worlds and the physical world. Metaverses allow us to create digital twins of real-world manufacturing scenarios and to virtually create different scenarios from which large volumes of data can be generated for training models. In this paper, we present MetaGraspNet: a large-scale benchmark dataset for vision-driven robotic grasping via physics-based metaverse synthesis. The proposed dataset contains 100,000 images and 25 different object types and is split into 5 difficulties to evaluate object detection and segmentation model performance in different grasping scenarios. We also propose a new layout-weighted performance metric alongside the dataset for evaluating object detection and segmentation performance in a manner that is more appropriate for robotic grasp applications compared to existing general-purpose performance metrics. Our benchmark dataset is available open-source on Kaggle, with the first phase consisting of detailed object detection, segmentation, layout annotations, and a layout-weighted performance metric script.

benchmark dataset, dataset, scenario, (13 more...)

arXiv.org Artificial Intelligence

2112.14663

Country:

Asia > China > Guangxi Province > Nanning (0.05)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(4 more...)

Genre: Research Report (0.41)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Model-Based Reasoning (0.62)

Add feedback