AITopics

2402.08708

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.05)
North America > United States > Colorado > El Paso County > Colorado Springs (0.04)
Europe > Denmark > Capital Region > Kongens Lyngby (0.04)

Genre: Research Report (0.64)

Industry:

Energy (0.93)
Materials > Chemicals (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)

Zheng, Junhao, Qiu, Shengjie, Ma, Qianli

Concept-1K: A Novel Benchmark for Instance Incremental Learning

arXiv.org Artificial IntelligenceFeb-13-2024

Incremental learning (IL) is essential to realize the human-level intelligence in the neural network. However, existing IL scenarios and datasets are unqualified for assessing forgetting in PLMs, giving an illusion that PLMs do not suffer from catastrophic forgetting. To this end, we propose a challenging IL scenario called instance-incremental learning (IIL) and a novel dataset called Concept-1K, which supports an order of magnitude larger IL steps. Based on the experiments on Concept-1K, we reveal that billion-parameter PLMs still suffer from catastrophic forgetting, and the forgetting is affected by both model scale, pretraining, and buffer size. Furthermore, existing IL methods and a popular finetuning technique, LoRA, fail to achieve satisfactory performance. Our study provides a novel scenario for future studies to explore the catastrophic forgetting of PLMs and encourage more powerful techniques to be designed for alleviating the forgetting in PLMs. The data, code and scripts are publicly available at https://github.com/zzz47zzz/pretrained-lm-for-incremental-learning.

accuracy, buffer size, concept-1k, (12 more...)

2402.08526

Country:

North America > United States > California (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > United Kingdom (0.04)
(5 more...)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.93)

Industry:

Media > Television (1.00)
Media > Film (1.00)
Materials (1.00)
(21 more...)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Cartesian atomic cluster expansion for machine learning interatomic potentials

Cheng, Bingqing

Machine learning interatomic potentials are revolutionizing large-scale, accurate atomistic modelling in material science and chemistry. These potentials often use atomic cluster expansion or equivariant message passing with spherical harmonics as basis functions. However, the dependence on Clebsch-Gordan coefficients for maintaining rotational symmetry leads to computational inefficiencies and redundancies. We propose an alternative: a Cartesian-coordinates-based atomic density expansion. This approach provides a complete description of atomic environments while maintaining interaction body orders. Additionally, we integrate low-dimensional embeddings of various chemical elements and inter-atomic message passing. The resulting potential, named Cartesian Atomic Cluster Expansion (CACE), exhibits good accuracy, stability, and generalizability. We validate its performance in diverse systems, including bulk water, small molecules, and 25-element high-entropy alloys.

artificial intelligence, cace, machine learning, (17 more...)

2402.07472

Country:

Europe (0.28)
North America > United States > California > Alameda County > Berkeley (0.14)

Genre: Research Report (0.40)

Industry:

Materials > Chemicals (0.50)
Energy > Oil & Gas (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Fischer, Georg K. J., Bergau, Max, Gómez-Rosal, D. Adriana, Wachaja, Andreas, Gräter, Johannes, Odenweller, Matthias, Piechottka, Uwe, Hoeflinger, Fabian, Gosala, Nikhil, Wetzel, Niklas, Büscher, Daniel, Valada, Abhinav, Burgard, Wolfram

Evaluation of a Smart Mobile Robotic System for Industrial Plant Inspection and Supervision

Automated and autonomous industrial inspection is a longstanding research field, driven by the necessity to enhance safety and efficiency within industrial settings. In addressing this need, we introduce an autonomously navigating robotic system designed for comprehensive plant inspection. This innovative system comprises a robotic platform equipped with a diverse array of sensors integrated to facilitate the detection of various process and infrastructure parameters. These sensors encompass optical (LiDAR, Stereo, UV/IR/RGB cameras), olfactory (electronic nose), and acoustic (microphone array) capabilities, enabling the identification of factors such as methane leaks, flow rates, and infrastructural anomalies. The proposed system underwent individual evaluation at a wastewater treatment site within a chemical plant, providing a practical and challenging environment for testing. The evaluation process encompassed key aspects such as object detection, 3D localization, and path planning. Furthermore, specific evaluations were conducted for optical methane leak detection and localization, as well as acoustic assessments focusing on pump equipment and gas leak localization.

artificial intelligence, machine learning, reinforcement learning, (21 more...)

2402.07691

Country:

Europe > Germany (0.29)
North America > United States (0.14)

Genre:

Overview (0.68)
Research Report (0.50)

Industry:

Materials > Chemicals (1.00)
Energy > Power Industry > Utilities > Nuclear (0.46)
Energy > Oil & Gas > Midstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Zhong, Victor, Misra, Dipendra, Yuan, Xingdi, Côté, Marc-Alexandre

Policy Improvement using Language Feedback Models

We introduce Language Feedback Models (LFMs) that identify desirable behaviour - actions that help achieve tasks specified in the instruction - for imitation learning in instruction following. To train LFMs, we obtain feedback from Large Language Models (LLMs) on visual trajectories verbalized to language descriptions. First, by using LFMs to identify desirable behaviour to imitate, we improve in task-completion rate over strong behavioural cloning baselines on three distinct language grounding environments (Touchdown, ScienceWorld, and ALFWorld). Second, LFMs outperform using LLMs as experts to directly predict actions, when controlling for the number of LLM output tokens. Third, LFMs generalize to unseen environments, improving task-completion rate by 3.5-12.0% through one round of adaptation. Finally, LFM can be modified to provide human-interpretable feedback without performance loss, allowing human verification of desirable behaviour for imitation learning.

feedback model, instruction, llm, (13 more...)

2402.07876

Genre: Research Report (0.64)

Industry:

Transportation (0.46)
Materials (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Wang, Haoyu, Ma, Guozheng, Meng, Ziqiao, Qin, Zeyu, Shen, Li, Zhang, Zhong, Wu, Bingzhe, Liu, Liu, Bian, Yatao, Xu, Tingyang, Wang, Xueqian, Zhao, Peilin

Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. However, most current methods complete the data collection and training steps in a single round, which may overlook the continuously improving ability of self-aligned models. This gives rise to a key query: What if we do multi-time bootstrapping self-alignment? Does this strategy enhance model performance or lead to rapid degradation? In this paper, our pioneering exploration delves into the impact of bootstrapping self-alignment on large language models. Our findings reveal that bootstrapping self-alignment markedly surpasses the single-round approach, by guaranteeing data diversity from in-context learning. To further exploit the capabilities of bootstrapping, we investigate and adjust the training order of data, which yields improved performance of the model. Drawing on these findings, we propose Step-On-Feet Tuning (SOFT) which leverages model's continuously enhanced few-shot ability to boost zero or one-shot performance. Based on easy-to-hard training recipe, we propose SOFT+ which further boost self-alignment's performance. Our experiments demonstrate the efficiency of SOFT (SOFT+) across various classification and generation tasks, highlighting the potential of bootstrapping self-alignment on continually enhancing model alignment performance.

iclexample, internal thought, reliable assistant, (12 more...)

2402.0761

Country:

North America > Canada (0.14)
Asia > China (0.05)
Europe > Spain (0.04)
(9 more...)

Genre:

Personal (1.00)
Research Report > New Finding (0.87)

Industry:

Leisure & Entertainment (1.00)
Law (1.00)
Health & Medicine > Consumer Health (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Hweij, Zaina Abu, Liang, Florence, Zhang, Sophie

Noninvasive Acute Compartment Syndrome Diagnosis Using Random Forest Machine Learning

Acute compartment syndrome (ACS) is an orthopedic emergency, caused by elevated pressure within a muscle compartment, that leads to permanent tissue damage and eventually death. Diagnosis of ACS relies heavily on patient-reported symptoms, a method that is clinically unreliable and often supplemented with invasive intracompartmental pressure measurements that can malfunction in motion settings. This study proposes an objective and noninvasive diagnostic for ACS. The device detects ACS through a random forest machine learning model that uses surrogate pressure readings from force-sensitive resistors (FSRs) placed on the skin. To validate the diagnostic, a data set containing FSR measurements and the corresponding simulated intracompartmental pressure was created for motion and motionless scenarios. The diagnostic achieved up to 98% accuracy. The device excelled in key performance metrics, including sensitivity and specificity, with a statistically insignificant performance difference in motion present cases. Manufactured for 73 USD, our device may be a cost-effective solution. These results demonstrate the potential of noninvasive ACS diagnostics to meet clinical accuracy standards in real world settings.

accuracy, compartment syndrome, syndrome, (13 more...)

2401.10386

Country:

North America > United States > Washington > Clark County > Camas (0.04)
North America > United States > California > Orange County > Irvine (0.04)
Europe > Croatia > Primorje-Gorski Kotar County > Rijeka (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Materials > Paper & Forest Products > Forest Products (0.61)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Chen, Hao, Flores, Gonzalo E. Constante, Li, Can

Physics-Informed Neural Networks with Hard Linear Equality Constraints

arXiv.org Artificial IntelligenceFeb-11-2024

These equations are derived from fundamental principles and mechanistic laws, such as the physical laws in thermodynamics and transport phenomena. High-fidelity models with these equations can serve as digital representations of the physical systems in the real world. However, the physically accurate representation is accompanied by a heightened mathematical complexity that elevates the computational expense of simulation. This impedes the use of high-fidelity physical models especially in applications where it is essential to simulate a system repeatedly in a timely manner. To efficiently generate simulation outputs, data-driven approaches have sought to substitute a high-fidelity physical model with a surrogate model (Misener and Biegler, 2023; Bhosekar and Ierapetritou, 2018; Bradley et al., 2022; Williams and Cremaschi, 2021), A surrogate model stands for a reducedorder model that aims for a computationally efficient approximation at the cost of a certain level of accuracy. This approach provides a more practical means of inferring a system's responses under a great variety of conditions.

artificial intelligence, constraint, machine learning, (21 more...)

2402.07251

Genre: Research Report (0.82)

Industry:

Materials > Chemicals > Commodity Chemicals > Petrochemicals (1.00)
Energy > Oil & Gas (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Kuang, Taojie, Liu, Pengfei, Ren, Zhixiang

The Impact of Domain Knowledge and Multi-Modality on Intelligent Molecular Property Prediction: A Systematic Survey

arXiv.org Artificial IntelligenceFeb-11-2024

The precise prediction of molecular properties is essential for advancements in drug development, particularly in virtual screening and compound optimization. The recent introduction of numerous deep learning-based methods has shown remarkable potential in enhancing molecular property prediction (MPP), especially improving accuracy and insights into molecular structures. Yet, two critical questions arise: does the integration of domain knowledge augment the accuracy of molecular property prediction and does employing multi-modal data fusion yield more precise results than unique data source methods? To explore these matters, we comprehensively review and quantitatively analyze recent deep learning methods based on various benchmarks. We discover that integrating molecular information will improve both MPP regression and classification tasks by upto 3.98% and 1.72%, respectively. We also discover that the utilizing 3-dimensional information with 1-dimensional and 2-dimensional information simultaneously can substantially enhance MPP upto 4.2%. The two consolidated insights offer crucial guidance for future advancements in drug discovery.

molecular property prediction, property prediction, representation, (14 more...)

2402.07249

Country:

Asia > China (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Materials > Chemicals (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

VanGessel, Francis G., Perry, Efrem, Mohan, Salil, Barham, Oliver M., Cavolowsky, Mark

NLP for Knowledge Discovery and Information Extraction from Energetics Corpora

arXiv.org Artificial IntelligenceFeb-10-2024

The study of energetics necessarily involves numerous scientific domains, spanning shock physics and detonation science, fluid dynamics, material science, thermodynamics, and chemical synthesis. The plethora of sub-disciplines of math, physics, chemistry, and engineering pose a challenge to practitioners who would wish to amass an expertise of energetics. Furthermore, maintaining awareness of advancements in energetics research is complicated by the exponential rate at which new research is published across scientific disciplines, including energetics. Thus, the development of automated and intelligent approaches for extracting knowledge from papers, reports, textbooks, and patents related to energetics could aid researchers and accelerate progress in energetics science. Natural Language Processing (NLP) is a sub-field of linguistics, computer science, and Machine Learning (ML) involving the interactions between computers and human (natural) languages. NLP techniques are used to analyze and generate human language, allowing computers to read, interpret, and understand text and speech. In the context of energetics research, NLP can be used to analyze large volumes of textual data, such as scientific papers, technical reports, and patents, in order to extract relevant information about the concepts that underlie and explain energetics phenomenon. Furthermore, NLP can enable natural language understanding that could be further applied to text mining journal articles and performing numerous natural language tasks such as classification, summarization, and recommendation. Overall, the use of NLP in energetics research has the potential to enhance our understanding of energetic materials and phenomenon, and assist in the development novel propellants, explosives, and pyrotechnics.

dataset, language model, public release, (15 more...)

2402.06964

Country:

North America > United States > Maryland (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report > New Finding (0.46)

Industry: Materials > Chemicals (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(4 more...)