AITopics | Problem-Independent Architectures

Collaborating Authors

Problem-Independent Architectures

News Overviews Instructional Materials AI-Alerts Classics

Towards Neural Architecture Search for Transfer Learning in 6G Networks

Orucu, Adam, Moradi, Farnaz, Ebrahimi, Masoumeh, Johnsson, Andreas

arXiv.org Artificial IntelligenceJun-4-2024

Abstract--The future 6G network is envisioned to be AI-native, and as such, ML models will be pervasive in support of optimizing performance, reducing energy consumption, and in coping with increasing complexity and heterogeneity. A key challenge is automating the process of finding optimal model architectures satisfying stringent requirements stemming from varying tasks, dynamicity and available resources in the infrastructure and deployment positions. In this paper, we describe and review the state-of-the-art in Neural Architecture Search and Transfer Learning and their applicability in networking. Further, we identify open research challenges and set directions with a specific focus on three main requirements with elements unique to the future network, namely combining NAS and TL, multi-objective search, and tabular data. Artificial Intelligence (AI) and Machine Learning (ML) are technologies which are envisioned to have a prominent Transfer Learning (TL) is a key technology that can play a role in the future AI-native 6G network.

architecture, architecture search, search space, (13 more...)

arXiv.org Artificial Intelligence

2406.02333

Country:

Europe > Sweden > Uppsala County > Uppsala (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Genre: Overview (0.89)

Industry: Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.83)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.64)

Add feedback

Generating 3D Terrain with 2D Cellular Automata

Fachada, Nuno, Rodrigues, António R., de Andrade, Diogo, Lopes, Phil

arXiv.org Artificial IntelligenceJun-1-2024

This paper presents an initial exploration on the use of 2D cellular automata (CA) for generating 3D terrains through a simple yet effective additive approach. By experimenting with multiple CA transition rules, this preliminary investigation yielded aesthetically interesting landscapes, hinting at the technique's potential applicability for real-time terrain generation in games.

heightmap, landscape, terrain, (11 more...)

arXiv.org Artificial Intelligence

2406.00443

Country:

North America > United States > New York > New York County > New York City (0.05)
Europe > Portugal > Lisbon > Lisbon (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games > Computer Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Add feedback

Causal-Aware Graph Neural Architecture Search under Distribution Shifts

Li, Peiwen, Wang, Xin, Zhang, Zeyang, Qin, Yijian, Zhang, Ziwei, Wang, Jialong, Li, Yang, Zhu, Wenwu

arXiv.org Artificial IntelligenceMay-26-2024

Graph neural architecture search (Graph NAS) has emerged as a promising approach for autonomously designing graph neural network architectures by leveraging the correlations between graphs and architectures. However, the existing methods fail to generalize under distribution shifts that are ubiquitous in real-world graph scenarios, mainly because the graph-architecture correlations they exploit might be spurious and varying across distributions. In this paper, we propose to handle the distribution shifts in the graph architecture search process by discovering and exploiting the causal relationship between graphs and architectures to search for the optimal architectures that can generalize under distribution shifts. The problem remains unexplored with the following critical challenges: 1) how to discover the causal graph-architecture relationship that has stable predictive abilities across distributions, 2) how to handle distribution shifts with the discovered causal graph-architecture relationship to search the generalized graph architectures. To address these challenges, we propose a novel approach, Causal-aware Graph Neural Architecture Search (CARNAS), which is able to capture the causal graph-architecture relationship during the architecture search process and discover the generalized graph architecture under distribution shifts. Specifically, we propose Disentangled Causal Subgraph Identification to capture the causal subgraphs that have stable prediction abilities across distributions. Then, we propose Graph Embedding Intervention to intervene on causal subgraphs within the latent space, ensuring that these subgraphs encapsulate essential features for prediction while excluding non-causal elements. Additionally, we propose Invariant Architecture Customization to reinforce the causal invariant nature of the causal subgraphs, which are utilized to tailor generalized graph architectures. Extensive experiments on synthetic and real-world datasets demonstrate that our proposed CARNAS achieves advanced out-of-distribution generalization ability by discovering the causal relationship between graphs and architectures during the search process.

architecture, dataset, distribution shift, (14 more...)

arXiv.org Artificial Intelligence

2405.16489

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report > Promising Solution (0.54)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Misaka: Interactive Swarm Testbed for Smart Grid Distributed Algorithm Test and Evaluation

Zhang, Tingliang, Zhong, Haiwang, Tan, Zhenfei, Yan, Xinfei

arXiv.org Artificial IntelligenceApr-25-2024

In this paper, we present Misaka, a visualized swarm testbed for smart grid algorithm evaluation, also an extendable open-source open-hardware platform for developing tabletop tangible swarm interfaces. The platform consists of a collection of custom-designed 3 omni-directional wheels robots each 10 cm in diameter, high accuracy localization through a microdot pattern overlaid on top of the activity sheets, and a software framework for application development and control, while remaining affordable (per unit cost about 30 USD at the prototype stage). We illustrate the potential of tabletop swarm user interfaces through a set of smart grid algorithm application scenarios developed with Misaka.

algorithm, interface, misaka, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICPSAsia48933.2020.9208421

2404.17125

Country: Asia > China > Beijing > Beijing (0.05)

Genre: Research Report (0.40)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.41)

Add feedback

Implementation and Evaluation of a Gradient Descent-Trained Defensible Blackboard Architecture System

Milbrath, Jordan, Rivard, Jonathan, Straub, Jeremy

arXiv.org Artificial IntelligenceApr-17-2024

A variety of forms of artificial intelligence systems have been developed. Two well-known techniques are neural networks and rule-fact expert systems. The former can be trained from presented data while the latter is typically developed by human domain experts. A combined implementation that uses gradient descent to train a rule-fact expert system has been previously proposed. A related system type, the Blackboard Architecture, adds an actualization capability to expert systems. This paper proposes and evaluates the incorporation of a defensible-style gradient descent training capability into the Blackboard Architecture. It also introduces the use of activation functions for defensible artificial intelligence systems and implements and evaluates a new best path-based training algorithm.

expert system, neural network, training iteration, (14 more...)

arXiv.org Artificial Intelligence

2404.11714

Country:

Europe > United Kingdom > England > West Midlands > Coventry (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > North Dakota > Cass County > Fargo (0.04)
(6 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine (1.00)
Education (0.93)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Blackboard Systems (1.00)
(2 more...)

Add feedback

Tensor-Networks-based Learning of Probabilistic Cellular Automata Dynamics

Casagrande, Heitor P., Xing, Bo, Munro, William J., Guo, Chu, Poletti, Dario

arXiv.org Artificial IntelligenceApr-17-2024

Algorithms developed to solve many-body quantum problems, like tensor networks, can turn into powerful quantum-inspired tools to tackle problems in the classical domain. In this work, we focus on matrix product operators, a prominent numerical technique to study many-body quantum systems, especially in one dimension. It has been previously shown that such a tool can be used for classification, learning of deterministic sequence-to-sequence processes and of generic quantum processes. We further develop a matrix product operator algorithm to learn probabilistic sequence-to-sequence processes and apply this algorithm to probabilistic cellular automata. This new approach can accurately learn probabilistic cellular automata processes in different conditions, even when the process is a probabilistic mixture of different chaotic rules. In addition, we find that the ability to learn these dynamics is a function of the bit-wise difference between the rules and whether one is much more likely than the other.

cellular automata, probability, training sweep, (15 more...)

arXiv.org Artificial Intelligence

2404.11768

Country:

Asia > Singapore (0.06)
Asia > Japan > Kyūshū & Okinawa > Okinawa (0.04)
Asia > Middle East > Israel (0.04)
Asia > China (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.86)

Add feedback

Enhancing Robot Navigation Efficiency Using Cellular Automata with Active Cells

Alzoubi, Saleem, Miraz, Mahdi H.

arXiv.org Artificial IntelligenceApr-2-2024

Enhancing robot navigation efficiency is a crucial objective in modern robotics. Robots relying on external navigation systems are often susceptible to electromagnetic interference (EMI) and encounter environmental disturbances, resulting in orientation errors within their surroundings. Therefore, the study employed an internal navigation system to enhance robot navigation efficacy under interference conditions, based on the analysis of the internal parameters and the external signals. This article presents details of the robot's autonomous operation, which allows for setting the robot's trajectory using an embedded map. The robot's navigation process involves counting the number of wheel revolutions as well as adjusting wheel orientation after each straight path section. In this article, an autonomous robot navigation system has been presented that leverages an embedded control navigation map utilising cellular automata with active cells which can effectively navigate in an environment containing various types of obstacles. By analysing the neighbouring cells of the active cell, the cellular environment determines which cell should become active during the robot's next movement step. This approach ensures the robot's independence from external control inputs. Furthermore, the accuracy and speed of the robot's movement have been further enhanced using a hexagonal mosaic for navigation surface mapping. This concept of utilising on cellular automata with active cells has been extended to the navigation of a group of robots on a shared navigation surface, taking into account the intersections of the robots' trajectories over time. To achieve this, a distance control module has been used that records the travelled trajectories in terms of wheel turns and revolutions.

navigation, neighbourhood, robot, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.33166/AETiC.2024.02.005

2404.01885

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
Europe > Germany (0.04)
Asia > Malaysia (0.04)
(16 more...)

Genre: Research Report (0.82)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.92)

Add feedback

Latent Neural Cellular Automata for Resource-Efficient Image Restoration

Menta, Andrea, Archetti, Alberto, Matteucci, Matteo

arXiv.org Artificial IntelligenceMar-22-2024

Neural cellular automata represent an evolution of the traditional cellular automata model, enhanced by the integration of a deep learning-based transition function. This shift from a manual to a data-driven approach significantly increases the adaptability of these models, enabling their application in diverse domains, including content generation and artificial life. However, their widespread application has been hampered by significant computational requirements. In this work, we introduce the Latent Neural Cellular Automata (LNCA) model, a novel architecture designed to address the resource limitations of neural cellular automata. Our approach shifts the computation from the conventional input space to a specially designed latent space, relying on a pre-trained autoencoder. We apply our model in the context of image restoration, which aims to reconstruct high-quality images from their degraded versions. This modification not only reduces the model's resource consumption but also maintains a flexible framework suitable for various applications. Our model achieves a significant reduction in computational requirements while maintaining high reconstruction fidelity. This increase in efficiency allows for inputs up to 16 times larger than current state-of-the-art neural cellular automata models, using the same resources.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2403.15525

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Italy > Lombardy > Milan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Unsupervised Graph Neural Architecture Search with Disentangled Self-supervision

Zhang, Zeyang, Wang, Xin, Zhang, Ziwei, Shen, Guangyao, Shen, Shiqi, Zhu, Wenwu

arXiv.org Artificial IntelligenceMar-8-2024

The existing graph neural architecture search (GNAS) methods heavily rely on supervised labels during the search process, failing to handle ubiquitous scenarios where supervisions are not available. In this paper, we study the problem of unsupervised graph neural architecture search, which remains unexplored in the literature. The key problem is to discover the latent graph factors that drive the formation of graph data as well as the underlying relations between the factors and the optimal neural architectures. Handling this problem is challenging given that the latent graph factors together with architectures are highly entangled due to the nature of the graph and the complexity of the neural architecture search process. To address the challenge, we propose a novel Disentangled Self-supervised Graph Neural Architecture Search (DSGAS) model, which is able to discover the optimal architectures capturing various latent graph factors in a self-supervised fashion based on unlabeled graph data. Specifically, we first design a disentangled graph super-network capable of incorporating multiple architectures with factor-wise disentanglement, which are optimized simultaneously. Then, we estimate the performance of architectures under different factors by our proposed self-supervised training with joint architecture-graph disentanglement. Finally, we propose a contrastive search with architecture augmentations to discover architectures with factor-specific expertise. Extensive experiments on 11 real-world datasets demonstrate that the proposed model is able to achieve state-of-the-art performance against several baseline methods in an unsupervised manner.

architecture, architecture search, neural architecture search, (15 more...)

arXiv.org Artificial Intelligence

2403.05064

Country:

Asia > China > Beijing > Beijing (0.04)
Europe > Greece > Attica > Athens (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.93)
Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Interactive Continual Learning Architecture for Long-Term Personalization of Home Service Robots

Ayub, Ali, Nehaniv, Chrystopher, Dautenhahn, Kerstin

arXiv.org Artificial IntelligenceMar-5-2024

For robots to perform assistive tasks in unstructured home environments, they must learn and reason on the semantic knowledge of the environments. Despite a resurgence in the development of semantic reasoning architectures, these methods assume that all the training data is available a priori. However, each user's environment is unique and can continue to change over time, which makes these methods unsuitable for personalized home service robots. Although research in continual learning develops methods that can learn and adapt over time, most of these methods are tested in the narrow context of object classification on static image datasets. In this paper, we combine ideas from continual learning, semantic reasoning, and interactive machine learning literature and develop a novel interactive continual learning architecture for continual learning of semantic knowledge in a home environment through human-robot interaction. The architecture builds on core cognitive principles of learning and memory for efficient and real-time learning of new knowledge from humans. We integrate our architecture with a physical mobile manipulator robot and perform extensive system evaluations in a laboratory environment over two months. Our results demonstrate the effectiveness of our architecture to allow a physical robot to continually adapt to the changes in the environment from limited data provided by the users (experimenters), and use the learned knowledge to perform object fetching tasks.

architecture, learning, robot, (16 more...)

arXiv.org Artificial Intelligence

2403.03462

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Robots > Robots in the Home (0.62)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.60)

Add feedback