AITopics | Parts Supplier

Collaborating Authors

Parts Supplier

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Ou, Yixin, Yao, Yunzhi, Zhang, Ningyu, Jin, Hui, Sun, Jiacheng, Deng, Shumin, Li, Zhenguo, Chen, Huajun

arXiv.org Artificial IntelligenceFeb-16-2025

Despite exceptional capabilities in knowledge-intensive tasks, Large Language Models (LLMs) face a critical gap in understanding how they internalize new knowledge, particularly how to structurally embed acquired knowledge in their neural computations. We address this issue through the lens of knowledge circuit evolution, identifying computational subgraphs that facilitate knowledge storage and processing. Our systematic analysis of circuit evolution throughout continual pre-training reveals several key findings: (1) the acquisition of new knowledge is influenced by its relevance to pre-existing knowledge; (2) the evolution of knowledge circuits exhibits a distinct phase shift from formation to optimization; (3) the evolution of knowledge circuits follows a deep-to-shallow pattern. These insights not only advance our theoretical understanding of the mechanisms of new knowledge acquisition in LLMs, but also provide potential implications for improving continual pre-training strategies to enhance model performance. Code and data will be available at https://github.com/zjunlp/DynamicKnowledgeCircuits.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.11196

Country:

North America > United States > Wisconsin (1.00)
North America > United States > Virginia (1.00)
North America > United States > Tennessee (1.00)
(21 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Semiconductors & Electronics (1.00)
Media (1.00)
Leisure & Entertainment (1.00)
(16 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Exploring Visual Embedding Spaces Induced by Vision Transformers for Online Auto Parts Marketplaces

Armijo, Cameron, Rivas, Pablo

arXiv.org Artificial IntelligenceFeb-8-2025

This study examines the capabilities of the Vision Transformer (ViT) model in generating visual embeddings for images of auto parts sourced from online marketplaces, such as Craigslist and OfferUp. By focusing exclusively on single-modality data, the analysis evaluates ViT's potential for detecting patterns indicative of illicit activities. The workflow involves extracting high-dimensional embeddings from images, applying dimensionality reduction techniques like Uniform Manifold Approximation and Projection (UMAP) to visualize the embedding space, and using K-Means clustering to categorize similar items. Representative posts nearest to each cluster centroid provide insights into the composition and characteristics of the clusters. While the results highlight the strengths of ViT in isolating visual patterns, challenges such as overlapping clusters and outliers underscore the limitations of single-modal approaches in this domain. This work contributes to understanding the role of Vision Transformers in analyzing online marketplaces and offers a foundation for future advancements in detecting fraudulent or illegal activities.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2502.05756

Country: North America > United States (0.46)

Genre: Research Report > New Finding (1.00)

Industry: Automobiles & Trucks > Parts Supplier (0.63)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

STEER: Simple Temporal Regularization For Neural ODEs Arnab Ghosh Harkirat Singh Behl Emilien Dupont University of Oxford

Neural Information Processing SystemsJan-27-2025, 11:09:18 GMT

Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive. Indeed, computing the forward pass of such models involves solving an ODE which can become arbitrarily complex during training. Recent works have shown that regularizing the dynamics of the ODE can partially alleviate this. In this paper we propose a new regularization technique: randomly sampling the end time of the ODE during training. The proposed regularization is simple to implement, has negligible overhead and is effective across a wide variety of tasks. Further, the technique is orthogonal to several other methods proposed to regularize the dynamics of ODEs and as such can be used in conjunction with them. We show through experiments on normalizing flows, time series models and image recognition that the proposed regularization can significantly decrease training time and even improve performance over baseline models.

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.51)
North America (0.46)

Industry:

Materials > Chemicals (0.41)
Industrial Conglomerates (0.41)
Automobiles & Trucks > Parts Supplier (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Vehicle Suspension Recommendation System: Multi-Fidelity Neural Network-based Mechanism Design Optimization

Lee, Sumin, Kang, Namwoo

arXiv.org Artificial IntelligenceOct-3-2024

Mechanisms are designed to perform functions in various fields. Often, there is no unique mechanism that performs a well-defined function. For example, vehicle suspensions are designed to improve driving performance and ride comfort, but different types are available depending on the environment. This variability in design makes performance comparison difficult. Additionally, the traditional design process is multi-step, gradually reducing the number of design candidates while performing costly analyses to meet target performance. Recently, AI models have been used to reduce the computational cost of FEA. However, there are limitations in data availability and different analysis environments, especially when transitioning from low-fidelity to high-fidelity analysis. In this paper, we propose a multi-fidelity design framework aimed at recommending optimal types and designs of mechanical mechanisms. As an application, vehicle suspension systems were selected, and several types were defined. For each type, mechanism parameters were generated and converted into 3D CAD models, followed by low-fidelity rigid body dynamic analysis under driving conditions. To effectively build a deep learning-based multi-fidelity surrogate model, the results of the low-fidelity analysis were analyzed using DBSCAN and sampled at 5% for high-cost flexible body dynamic analysis. After training the multi-fidelity model, a multi-objective optimization problem was formulated for the performance metrics of each suspension type. Finally, we recommend the optimal type and design based on the input to optimize ride comfort-related performance metrics. To validate the proposed methodology, we extracted basic design rules of Pareto solutions using data mining techniques. We also verified the effectiveness and applicability by comparing the results with those obtained from a conventional deep learning-based design process.

artificial intelligence, machine learning, performance metric, (18 more...)

arXiv.org Artificial Intelligence

2410.03045

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Industry:

Automobiles & Trucks > Parts Supplier (0.50)
Materials > Metals & Mining (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Learning-Based Prediction of Suspension Dynamics Performance in Multi-Axle Vehicles

Lin, Kai Chun, Lin, Bo-Yi

arXiv.org Artificial IntelligenceOct-3-2024

This paper presents a deep learning-based framework for predicting the dynamic performance of suspension systems in multi-axle vehicles, emphasizing the integration of machine learning with traditional vehicle dynamics modeling. A Multi-Task Deep Belief Network Deep Neural Network (MTL-DBN-DNN) was developed to capture the relationships between key vehicle parameters and suspension performance metrics. The model was trained on data generated from numerical simulations and demonstrated superior prediction accuracy compared to conventional DNN models. A comprehensive sensitivity analysis was conducted to assess the impact of various vehicle and suspension parameters on dynamic suspension performance. Additionally, the Suspension Dynamic Performance Index (SDPI) was introduced as a holistic measure to quantify overall suspension performance, accounting for the combined effects of multiple parameters.

artificial intelligence, machine learning, vehicle, (13 more...)

arXiv.org Artificial Intelligence

2410.02566

Genre: Research Report > New Finding (0.69)

Industry: Automobiles & Trucks > Parts Supplier (0.39)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Bus group Ryobi sets up forex fund to survive population fall

The Japan TimesSep-3-2024, 03:56:00 GMT

A unit of Ryobi Holdings, a bus route operator in rural Japan, is setting up an artificial intelligence-powered hedge fund specializing in forex to survive a sharp drop in the country's population outside Tokyo. Ryobi has enlisted Kyosuke Suzuki, a former currency trader at Societe Generale, to set up the fund by the end of the year. The hedge fund, part of Ryobi's subsidiary, Ryobi Systems, will begin investing around the beginning of 2025. Ryobi was founded 114 years ago as a short railway connecting towns and villages in Okayama Prefecture. Today, it is at the front line of the country's demographic crisis, grappling with aging customers and shrinking communities.

artificial intelligence, ryobi, survive population fall, (4 more...)

The Japan Times

Country:

Asia > Japan > Honshū > Chūgoku > Okayama Prefecture > Okayama (0.38)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.31)

Industry:

Banking & Finance > Trading (1.00)
Automobiles & Trucks > Parts Supplier (1.00)

Technology: Information Technology > Artificial Intelligence (0.67)

Add feedback

Japan's Nidec lifts annual profit outlook after first quarter

The Japan TimesJul-23-2024, 12:48:00 GMT

Japanese electric motor maker Nidec raised its full-year operating profit forecast by 4.3% on Tuesday off the back of a recovery in demand for hard drive motors amid efforts to raise its profitability and a weaker yen. The company raised its expectations for operating profit in the financial year to the end of March 2025 to 240 billion ( 1.53 billion), versus an earlier forecast of 230 billion and an average analyst view of 246.9 billion. Operating profit for the April-June quarter totaled 60.3 billion, largely in line with 60.2 billion in the same period a year earlier and compared with the average estimate of 58.4 billion in a survey of four analysts by LSEG. "Nidec expects rapid demand expansion of power generators which are essential to data centers," the company said in a statement. It said it saw demand for hard drive disk motors recovering and that for water-cooling modules for artificial intelligence expanding rapidly. The company said last month its business in water-cooling modules for generative AI data centers could expand to 1 trillion in sales in the future.

artificial intelligence, nidec lift annual profit outlook, water-cooling module, (4 more...)

The Japan Times

Country: Asia > Japan (0.40)

Industry:

Automobiles & Trucks > Parts Supplier (0.94)
Information Technology > Services (0.84)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning

Singh, Nishesh, Ramesh, Sidharth, Shankar, Abhishek, Duttagupta, Jyotishka, D'Souza, Leander Stephen, Singh, Sanjay

arXiv.org Artificial IntelligenceJul-4-2024

Planetary exploration requires traversal in environments with rugged terrains. In addition, Mars rovers and other planetary exploration robots often carry sensitive scientific experiments and components onboard, which must be protected from mechanical harm. This paper deals with an active suspension system focused on chassis stabilisation and an efficient traversal method while encountering unavoidable obstacles. Soft Actor-Critic (SAC) was applied along with Proportional Integral Derivative (PID) control to stabilise the chassis and traverse large obstacles at low speeds. The model uses the rover's distance from surrounding obstacles, the height of the obstacle, and the chassis' orientation to actuate the control links of the suspension accurately. Simulations carried out in the Gazebo environment are used to validate the proposed active system.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2406.18899

Country: North America > United States (1.00)

Genre: Research Report (0.64)

Industry:

Automobiles & Trucks > Parts Supplier (0.50)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Learning to Model and Plan for Wheeled Mobility on Vertically Challenging Terrain

Datar, Aniket, Pan, Chenhui, Xiao, Xuesu

arXiv.org Artificial IntelligenceSep-23-2023

Most autonomous navigation systems assume wheeled robots are rigid bodies and their 2D planar workspaces can be divided into free spaces and obstacles. However, recent wheeled mobility research, showing that wheeled platforms have the potential of moving over vertically challenging terrain (e.g., rocky outcroppings, rugged boulders, and fallen tree trunks), invalidate both assumptions. Navigating off-road vehicle chassis with long suspension travel and low tire pressure in places where the boundary between obstacles and free spaces is blurry requires precise 3D modeling of the interaction between the chassis and the terrain, which is complicated by suspension and tire deformation, varying tire-terrain friction, vehicle weight distribution and momentum, etc. In this paper, we present a learning approach to model wheeled mobility, i.e., in terms of vehicle-terrain forward dynamics, and plan feasible, stable, and efficient motion to drive over vertically challenging terrain without rolling over or getting stuck. We present physical experiments on two wheeled robots and show that planning using our learned model can achieve up to 60% improvement in navigation success rate and 46% reduction in unstable chassis roll and pitch angles.

artificial intelligence, machine learning, robot, (15 more...)

arXiv.org Artificial Intelligence

2306.11611

Genre: Research Report (0.50)

Industry: Automobiles & Trucks > Parts Supplier (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.93)
Information Technology > Artificial Intelligence > Robots > Locomotion (0.88)

Add feedback

Panel Data Nowcasting: The Case of Price-Earnings Ratios

Babii, Andrii, Ball, Ryan T., Ghysels, Eric, Striaukas, Jonas

arXiv.org Machine LearningJul-5-2023

The paper uses structured machine learning regressions for nowcasting with panel data consisting of series sampled at different frequencies. Motivated by the problem of predicting corporate earnings for a large cross-section of firms with macroeconomic, financial, and news time series sampled at different frequencies, we focus on the sparse-group LASSO regularization which can take advantage of the mixed frequency time series panel data structures. Our empirical results show the superior performance of our machine learning panel data regression models over analysts' predictions, forecast combinations, firm-specific time series regression models, and standard machine learning methods.

artificial intelligence, machine learning, regression, (17 more...)

arXiv.org Machine Learning

2307.02673

Country: North America > United States > North Carolina (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Semiconductors & Electronics (1.00)
Information Technology (1.00)
Industrial Conglomerates (1.00)
(10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Add feedback