AITopics

2407.09719

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Virginia > Fairfax County > Reston (0.04)
(5 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report (0.83)

Industry:

Law (1.00)
Materials > Construction Materials (0.54)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)

MIT Technology ReviewJul-11-2024, 13:10:00 GMT

The Download: automating warehouse tasks, and problems with recycling plastics

Before almost any item reaches your door, it traverses the global supply chain on a pallet. More than 2 billion pallets are in circulation in the United States alone, and 400 billion worth of goods are exported on them annually. However, loading boxes onto these pallets is a task stuck in the past: Heavy loads and repetitive movements leave workers at high risk of injury, and in the rare instances when robots are used, they take months to program using handheld computers that have changed little since the 1980s. Jacobi Robotics, a startup spun out of the labs of the University of California, Berkeley, says it can vastly speed up that process with AI. If successful, Jacobi aims to replace the legacy methods customers are currently using to train their bots, whittling down the time it takes to code a paletting process from months to a single day.

artificial intelligence, recycling plastic, warehouse task, (4 more...)

MIT Technology Review

Country: North America > United States > California > Alameda County > Berkeley (0.27)

Industry:

Water & Waste Management > Solid Waste Management (0.73)
Materials (0.56)

Technology: Information Technology > Artificial Intelligence > Robots (0.60)

arXiv.org Artificial IntelligenceJul-11-2024

Three-layer deep learning network random trees for fault detection in chemical production process

Lu, Ming, Gao, Zhen, Zou, Ying, Chen, Zuguo, Li, Pei

With the development of technology, the chemical production process is becoming increasingly complex and large-scale, making fault detection particularly important. However, current detective methods struggle to address the complexities of large-scale production processes. In this paper, we integrate the strengths of deep learning and machine learning technologies, combining the advantages of bidirectional long and short-term memory neural networks, fully connected neural networks, and the extra trees algorithm to propose a novel fault detection model named three-layer deep learning network random trees (TDLN-trees). First, the deep learning component extracts temporal features from industrial data, combining and transforming them into a higher-level data representation. Second, the machine learning component processes and classifies the features extracted in the first step. An experimental analysis based on the Tennessee Eastman process verifies the superiority of the proposed method.

artificial intelligence, machine learning, tdln-tree, (16 more...)

2405.00311

Country:

Asia (0.46)
North America > United States > Tennessee (0.25)

Genre: Research Report (1.00)

Industry:

Materials > Chemicals (0.86)
Energy > Oil & Gas > Downstream (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJul-11-2024

An Improved Traditional Chinese Evaluation Suite for Foundation Model

Tam, Zhi-Rui, Pai, Ya-Ting, Lee, Yen-Wei, Chen, Jun-Da, Chu, Wei-Min, Cheng, Sega, Shuai, Hong-Han

We present TMMLU+, a new benchmark designed for Traditional Chinese language understanding. TMMLU+ is a multi-choice question-answering dataset with 66 subjects from elementary to professional level. It is six times larger and boasts a more balanced subject distribution than its predecessor, Taiwan Massive Multitask Language Understanding (TMMLU). We also benchmark closed-source models and 26 open-weight Chinese large language models (LLMs) of parameters ranging from 1.8B to 72B on the proposed TMMLU+. Our findings reveal that (1.) Traditional Chinese models still trail behind their Simplified Chinese counterparts, highlighting a need for more focused advancements in LLMs catering to Traditional Chinese. (2.) Current LLMs still fall short of human performance in average scores, indicating a potential need for future research to delve deeper into social science and humanities subjects. (3.) Among all the tokenization compression metrics examined, we identify that only the fertility score uniquely demonstrates strong correlations with our benchmark results. We foresee that TMMLU+ will pinpoint areas for future model improvement, thereby narrowing the gap between machine and human linguistic capabilities and supporting researchers in developing Traditional Chinese LLMs. Our dataset, along with the benchmark source code, is accessible at huggingface.co/datasets/ikala/tmmluplus.

english translation, language model, preprint, (14 more...)

2403.01858

Country:

Asia > Taiwan (0.26)
North America > United States (0.14)
Europe > Spain (0.14)
(6 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Law (1.00)
Government (1.00)
Banking & Finance > Insurance (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

A Machine Learning and Explainable AI Framework Tailored for Unbalanced Experimental Catalyst Discovery

Semnani, Parastoo, Bogojeski, Mihail, Bley, Florian, Zhang, Zizheng, Wu, Qiong, Kneib, Thomas, Herrmann, Jan, Weisser, Christoph, Patcas, Florina, Müller, Klaus-Robert

The successful application of machine learning (ML) in catalyst design relies on high-quality and diverse data to ensure effective generalization to novel compositions, thereby aiding in catalyst discovery. However, due to complex interactions, catalyst design has long relied on trial-and-error, a costly and labor-intensive process leading to scarce data that is heavily biased towards undesired, low-yield catalysts. Despite the rise of ML in this field, most efforts have not focused on dealing with the challenges presented by such experimental data. To address these challenges, we introduce a robust machine learning and explainable AI (XAI) framework to accurately classify the catalytic yield of various compositions and identify the contributions of individual components. This framework combines a series of ML practices designed to handle the scarcity and imbalance of catalyst data. We apply the framework to classify the yield of various catalyst compositions in oxidative methane coupling, and use it to evaluate the performance of a range of ML models: tree-based models, logistic regression, support vector machines, and neural networks. These experiments demonstrate that the methods used in our framework lead to a significant improvement in the performance of all but one of the evaluated models. Additionally, the decision-making process of each ML model is analyzed by identifying the most important features for predicting catalyst performance using XAI methods. Our analysis found that XAI methods, providing class-aware explanations, such as Layer-wise Relevance Propagation, identified key components that contribute specifically to high-yield catalysts. These findings align with chemical intuition and existing literature, reinforcing their validity. We believe that such insights can assist chemists in the development and identification of novel catalysts with superior performance.

catalyst, feature importance, relevance, (17 more...)

2407.18935

Country:

Europe > Germany > Berlin (0.04)
Europe > Germany > Lower Saxony > Gottingen (0.04)
Europe > Germany > Saarland > Saarbrücken (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Materials > Chemicals > Specialty Chemicals (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates

Claxton, Owen, Malone, Connor, Carson, Helen, Ford, Jason, Bolton, Gabe, Shames, Iman, Milford, Michael

Visual Place Recognition (VPR) systems often have imperfect performance, which affects robot navigation decisions. This research introduces a novel Multi-Layer Perceptron (MLP) integrity monitor for VPR which demonstrates improved performance and generalizability over the previous state-of-the-art SVM approach, removing per-environment training and reducing manual tuning requirements. We test our proposed system in extensive real-world experiments, where we also present two real-time integrity-based VPR verification methods: an instantaneous rejection method for a robot navigating to a goal zone (Experiment 1); and a historical method that takes a best, verified, match from its recent trajectory and uses an odometer to extrapolate forwards to a current position estimate (Experiment 2). Noteworthy results for Experiment 1 include a decrease in aggregate mean along-track goal error from ~9.8m to ~3.1m in missions the robot pursued to completion, and an increase in the aggregate rate of successful mission completion from ~41% to ~55%. Experiment 2 showed a decrease in aggregate mean along-track localization error from ~2.0m to ~0.5m, and an increase in the aggregate precision of localization attempts from ~97% to ~99%. Overall, our results demonstrate the practical usefulness of a VPR integrity monitor in real-world robotics to improve VPR localization and consequent navigation performance.

integrity monitor, place recognition, visual place recognition, (11 more...)

2407.08162

Country:

Oceania > Australia > Queensland > Brisbane (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
Europe > Switzerland (0.04)

Genre: Research Report > New Finding (0.54)

Industry: Materials > Metals & Mining (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)

Quintanilla, Paulina, Fernández, Francisco, Mancilla, Cristobal, Rojas, Matías, Estrada, Mauricio, Navia, Daniel

Digital twin with automatic disturbance detection for real-time optimization of a semi-autogenous grinding (SAG) mill

This work describes the development and validation of a digital twin for a semi-autogenous grinding (SAG) mill controlled by an expert system. The digital twin consists of three modules emulating a closed-loop system: fuzzy logic for the expert control, a state-space model for regulatory control, and a recurrent neural network for the SAG mill process. The model was trained with 68 hours of data and validated with 8 hours of test data. It predicts the mill's behavior within a 2.5-minute horizon with a 30-second sampling time. The disturbance detection evaluates the need for retraining, and the digital twin shows promise for supervising the SAG mill with the expert control system. Future work will focus on integrating this digital twin into real-time optimization strategies with industrial validation.

equation, expert control system, opération, (14 more...)

2407.06216

Country:

North America > United States > New York > New York County > New York City (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)
North America > United States > New York > Montgomery County > Amsterdam (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry:

Materials (0.46)
Energy > Renewable (0.36)

Technology:

Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.49)

Yu, Simon Chi Lok, He, Jie, Minervini, Pasquale, Pan, Jeff Z.

Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models

With the emergence of large language models, such as LLaMA and OpenAI GPT-3, In-Context Learning (ICL) gained significant attention due to its effectiveness and efficiency. However, ICL is very sensitive to the choice, order, and verbaliser used to encode the demonstrations in the prompt. Retrieval-Augmented ICL methods try to address this problem by leveraging retrievers to extract semantically related examples as demonstrations. While this approach yields more accurate results, its robustness against various types of adversarial attacks, including perturbations on test samples, demonstrations, and retrieved data, remains under-explored. Our study reveals that retrieval-augmented models can enhance robustness against test sample attacks, outperforming vanilla ICL with a 4.87% reduction in Attack Success Rate (ASR); however, they exhibit overconfidence in the demonstrations, leading to a 2% increase in ASR for demonstration attacks. Adversarial training can help improve the robustness of ICL methods to adversarial attacks; however, such a training scheme can be too costly in the context of LLMs. As an alternative, we introduce an effective training-free adversarial defence method, DARD, which enriches the example pool with those attacked samples. We show that DARD yields improvements in performance and robustness, achieving a 15% reduction in ASR over the baselines. Code and data are released to encourage further research: https://github.com/simonucl/adv-retreival-icl

demonstration, robustness, semanticscholar, (15 more...)

2405.15984

Country:

North America > Canada > Newfoundland and Labrador > Newfoundland (0.14)
Europe > Slovenia (0.04)
North America > United States > Nevada (0.04)
(24 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Materials > Metals & Mining (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.94)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Baldwin, Martha, Meisel, Nicholas A., McComb, Christopher

Smooth Like Butter: Evaluating Multi-Lattice Transitions in Property-Augmented Latent Spaces

Additive manufacturing has revolutionized structural optimization by enhancing component strength and reducing material requirements. One approach used to achieve these improvements is the application of multi-lattice structures, where the macro-scale performance relies on the detailed design of mesostructural lattice elements. Many current approaches to designing such structures use data-driven design to generate multi-lattice transition regions, making use of machine learning models that are informed solely by the geometry of the mesostructures. However, it remains unclear if the integration of mechanical properties into the dataset used to train such machine learning models would be beneficial beyond using geometric data alone. To address this issue, this work implements and evaluates a hybrid geometry/property Variational Autoencoder (VAE) for generating multi-lattice transition regions. In our study, we found that hybrid VAEs demonstrate enhanced performance in maintaining stiffness continuity through transition regions, indicating their suitability for design tasks requiring smooth mechanical properties.

lattice structure, transition region, unit cell, (12 more...)

doi: 10.1089/3dp.2023.0316

2407.08074

Country:

North America > United States > Pennsylvania > Centre County > University Park (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.47)

Industry: Materials (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceJul-9-2024

Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning

Crosbie, J., Shutova, E.

As Large language models have shown a remarkable a significant milestone in this area, Elhage et al. ability to learn and perform complex tasks through (2021) demonstrated the existence of induction in-context learning (ICL) (Brown et al., 2020; Touvron heads in Transformer LMs. These heads scan the et al., 2023b). In ICL, the model receives context for previous instances of the current token a demonstration context and a query question as using a prefix matching mechanism, which identifies a prompt for prediction. Unlike supervised learning, if and where a token has appeared before. ICL utilises the pretrained model's capabilities If a matching token is found, the head employs to recognise and replicate patterns within the a copying mechanism to increase the probability demonstration context, thereby enabling accurate of the subsequent token, facilitating exact or approximate predictions for the query without the use of gradient repetition of sequences and embodying updates.

foo, mur, res, (17 more...)

2407.07011

Country:

Asia > Singapore (0.04)
North America > United States > New York (0.04)
North America > Canada > Ontario > Toronto (0.04)
(20 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Government (1.00)
Energy > Renewable > Biofuel > Ethanol (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)