AITopics

2304.11376

Country:

Europe > Belgium > Brussels-Capital Region > Brussels (0.05)
North America > United States > Oregon > Lane County > Eugene (0.04)
Asia > Pakistan > Punjab > Lahore Division > Lahore (0.04)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Leisure & Entertainment > Games (1.00)
Education > Educational Setting > Higher Education (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Ghojogh, Benyamin, Ghodsi, Ali

Recurrent Neural Networks and Long Short-Term Memory Networks: Tutorial and Survey

arXiv.org Artificial IntelligenceApr-22-2023

Several solutions This is a tutorial paper on Recurrent Neural Network were proposed for this issue, some of which are close-toidentity (RNN), Long Short-Term Memory Network weight matrix (Mikolov et al., 2015), long delays (LSTM), and their variants. We start with a (Lin et al., 1995), leaky units (Jaeger et al., 2007; Sutskever dynamical system and backpropagation through & Hinton, 2010), and echo state networks (Jaeger & Haas, time for RNN. Then, we discuss the problems 2004; Jaeger, 2007). of gradient vanishing and explosion in longterm dependencies. We explain close-to-identity Sequence modeling requires both short-term and long-term weight matrix, long delays, leaky units, and echo dependencies. For example, consider the sentence "The state networks for solving this problem. Then, police is chasing the thief".

artificial intelligence, machine learning, schmidhuber, (18 more...)

2304.11461

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Europe > France (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(2 more...)

Genre:

Instructional Material > Course Syllabus & Notes (0.66)
Research Report (0.50)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceApr-21-2023

Event Tables for Efficient Experience Replay

Kompella, Varun, Walsh, Thomas J., Barrett, Samuel, Wurman, Peter, Stone, Peter

Experience replay (ER) is a crucial component of many deep reinforcement learning (RL) systems. However, uniform sampling from an ER buffer can lead to slow convergence and unstable asymptotic behaviors. This paper introduces Stratified Sampling from Event Tables (SSET), which partitions an ER buffer into Event Tables, each capturing important subsequences of optimal behavior. We prove a theoretical advantage over the traditional monolithic buffer approach and combine SSET with an existing prioritized sampling strategy to further improve learning speed and stability. Empirical results in challenging MiniGrid domains, benchmark RL environments, and a high-fidelity car racing simulator demonstrate the advantages and versatility of SSET over existing ER buffer sampling approaches.

machine learning, reinforcement learning, sset, (16 more...)

2211.00576

Country: North America > United States > Texas > Travis County > Austin (0.14)

Genre:

Research Report (1.00)
Instructional Material (1.00)

Industry:

Automobiles & Trucks (0.68)
Leisure & Entertainment > Sports > Motorsports (0.67)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.92)

arXiv.org Artificial IntelligenceApr-21-2023

Self Pre-training with Masked Autoencoders for Medical Image Classification and Segmentation

Zhou, Lei, Liu, Huidong, Bae, Joseph, He, Junjun, Samaras, Dimitris, Prasanna, Prateek

Masked Autoencoder (MAE) has recently been shown to be effective in pre-training Vision Transformers (ViT) for natural image analysis. By reconstructing full images from partially masked inputs, a ViT encoder aggregates contextual information to infer masked image regions. We believe that this context aggregation ability is particularly essential to the medical image domain where each anatomical structure is functionally and mechanically connected to other structures and regions. Because there is no ImageNet-scale medical image dataset for pre-training, we investigate a self pre-training paradigm with MAE for medical image analysis tasks. Our method pre-trains a ViT on the training set of the target data instead of another dataset. Thus, self pre-training can benefit more scenarios where pre-training data is hard to acquire. Our experimental results show that MAE self pre-training markedly improves diverse medical image tasks including chest X-ray disease classification, abdominal CT multi-organ segmentation, and MRI brain tumor segmentation. Code is available at https://github.com/cvlab-stonybrook/SelfMedMAE

artificial intelligence, machine learning, segmentation, (12 more...)

2203.05573

Country: North America > United States (0.15)

Genre:

Instructional Material > Online (0.40)
Instructional Material > Course Syllabus & Notes (0.40)
Research Report > New Finding (0.34)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

The GuardianApr-20-2023, 18:49:03 GMT

Fresh concerns raised over sources of training material for AI systems

Fresh fears have been raised about the training material used for some of the largest and most powerful artificial intelligence models, after several investigations exposed the fascist, pirated and malicious sources from which the data is harvested. One such dataset is the Colossal Clean Crawled Corpus, or C4, assembled by Google from more than 15m websites and used to train both the search engine's LaMDA AI as well as Meta's GPT competitor, LLaMA. The dataset is public, but its scale has made it difficult to examine the contents: it is supposedly a "clean" version of a more expansive dataset, Common Crawl, with "noisy" content, offensive language and racist slurs removed from the material. But an investigation by the Washington Post reveals that C4's "cleanliness" is only skin deep. While it draws on websites such as the Guardian – which makes up 0.05% of the entire dataset - and Wikipedia, as well as large databases such as Google Patents and the scientific journal hub PLOS, it also contains less reputable sites. The white nationalist site VDARE is in the database, one of the 1,000 largest sites, as is the far-right news site Breitbart.

dataset, fresh concern, training material, (7 more...)

The Guardian

Genre: Instructional Material (0.62)

Industry:

Media > News (0.57)
Law Enforcement & Public Safety > Terrorism (0.57)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)

Pursnani, Vinay, Sermet, Yusuf, Demir, Ibrahim

Performance of ChatGPT on the US Fundamentals of Engineering Exam: Comprehensive Assessment of Proficiency and Potential Implications for Professional Environmental Engineering Practice

In recent years, advancements in artificial intelligence (AI) have led to the development of large language models like GPT-4, demonstrating potential applications in various fields, including education. This study investigates the feasibility and effectiveness of using ChatGPT, a GPT-4 based model, in achieving satisfactory performance on the Fundamentals of Engineering (FE) Environmental Exam. This study further shows a significant improvement in the model's accuracy when answering FE exam questions through noninvasive prompt modifications, substantiating the utility of prompt modification as a viable approach to enhance AI performance in educational contexts. Furthermore, the findings reflect remarkable improvements in mathematical capabilities across successive iterations of ChatGPT models, showcasing their potential in solving complex engineering problems. Our paper also explores future research directions, emphasizing the importance of addressing AI challenges in education, enhancing accessibility and inclusion for diverse student populations, and developing AI-resistant exam questions to maintain examination integrity. By evaluating the performance of ChatGPT in the context of the FE Environmental Exam, this study contributes valuable insights into the potential applications and limitations of large language models in educational settings. As AI continues to evolve, these findings offer a foundation for further research into the responsible and effective integration of AI models across various disciplines, ultimately optimizing the learning experience and improving student outcomes.

chatgpt, environmental exam, exam, (16 more...)

2304.12198

Country:

North America > United States > Iowa > Johnson County > Iowa City (0.14)
North America > United States > Pennsylvania (0.04)

Genre:

Research Report > New Finding (1.00)
Instructional Material (0.88)
Research Report > Experimental Study (0.68)

Industry:

Health & Medicine (1.00)
Education > Curriculum > Subject-Specific Education (0.69)
Education > Educational Setting > Higher Education (0.46)
Education > Assessment & Standards > Student Performance (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

The e-Bike Motor Assembly: Towards Advanced Robotic Manipulation for Flexible Manufacturing

Rozo, Leonel, Kupcsik, Andras G., Schillinger, Philipp, Guo, Meng, Krug, Robert, van Duijkeren, Niels, Spies, Markus, Kesper, Patrick, Hoppe, Sabrina, Ziesche, Hanna, Bürger, Mathias, Arras, Kai O.

Robotic manipulation is currently undergoing a profound paradigm shift due to the increasing needs for flexible manufacturing systems, and at the same time, because of the advances in enabling technologies such as sensing, learning, optimization, and hardware. This demands for robots that can observe and reason about their workspace, and that are skillfull enough to complete various assembly processes in weakly-structured settings. Moreover, it remains a great challenge to enable operators for teaching robots on-site, while managing the inherent complexity of perception, control, motion planning and reaction to unexpected situations. Motivated by real-world industrial applications, this paper demonstrates the potential of such a paradigm shift in robotics on the industrial case of an e-Bike motor assembly. The paper presents a concept for teaching and programming adaptive robots on-site and demonstrates their potential for the named applications. The framework includes: (i) a method to teach perception systems onsite in a self-supervised manner, (ii) a general representation of object-centric motion skills and force-sensitive assembly skills, both learned from demonstration, (iii) a sequencing approach that exploits a human-designed plan to perform complex tasks, and (iv) a system solution for adapting and optimizing skills online. The aforementioned components are interfaced through a four-layer software architecture that makes our framework a tangible industrial technology. To demonstrate the generality of the proposed framework, we provide, in addition to the motivating e-Bike motor assembly, a further case study on dense box packing for logistics automation.

artificial intelligence, demonstration, machine learning, (20 more...)

2304.10595

Country:

North America > United States > New Jersey > Hudson County > Secaucus (0.04)
North America > United States > California (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(6 more...)

Genre:

Research Report (0.81)
Instructional Material (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Heteromated Decision-Making: Integrating Socially Assistive Robots in Care Relationships

Paluch, Richard, Aal, Tanja, Cerna, Katerina, Randall, Dave, Müller, Claudia

Technological development continues to advance, with consequences for the use of robots in health care. For this reason, this workshop contribution aims at consideration of how socially assistive robots can be integrated into care and what tasks they can take on. This also touches on the degree of autonomy of these robots and the balance of decision support and decision making in different situations. We want to show that decision making by robots is mediated by the balance between autonomy and safety. Our results are based on Design Fiction and Zine-Making workshops we conducted with scientific experts. Ultimately, we show that robots' actions take place in social groups. A robot does not typically decide alone, but its decision-making is embedded in group processes. The concept of heteromation, which describes the interconnection of human and machine actions, offers fruitful possibilities for exploring how robots can be integrated into caring relationships.

artificial intelligence, proceedings, robot, (14 more...)

2304.10116

Country:

North America > United States > New York > New York County > New York City (0.05)
Europe > Germany > North Rhine-Westphalia > Arnsberg Region > Siegen (0.05)
North America > United States > Hawaii (0.04)
(13 more...)

Genre:

Instructional Material (0.68)
Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area (0.69)
Health & Medicine > Health Care Providers & Services (0.68)
Health & Medicine > Health Care Technology (0.46)

Technology: Information Technology > Artificial Intelligence > Robots > Robots in the Home (0.72)

Alberts, Alex, Bilionis, Ilias

Physics-informed Information Field Theory for Modeling Physical Systems with Uncertainty Quantification

Data-driven approaches coupled with physical knowledge are powerful techniques to model systems. The goal of such models is to efficiently solve for the underlying field by combining measurements with known physical laws. As many systems contain unknown elements, such as missing parameters, noisy data, or incomplete physical laws, this is widely approached as an uncertainty quantification problem. The common techniques to handle all the variables typically depend on the numerical scheme used to approximate the posterior, and it is desirable to have a method which is independent of any such discretization. Information field theory (IFT) provides the tools necessary to perform statistics over fields that are not necessarily Gaussian. We extend IFT to physics-informed IFT (PIFT) by encoding the functional priors with information about the physical laws which describe the field. The posteriors derived from this PIFT remain independent of any numerical scheme and can capture multiple modes, allowing for the solution of problems which are ill-posed. We demonstrate our approach through an analytical example involving the Klein-Gordon equation. We then develop a variant of stochastic gradient Langevin dynamics to draw samples from the joint posterior over the field and model parameters. We apply our method to numerical examples with various degrees of model-form error and to inverse problems involving nonlinear differential equations. As an addendum, the method is equipped with a metric which allows the posterior to automatically quantify model-form uncertainty. Because of this, our numerical experiments show that the method remains robust to even an incorrect representation of the physics given sufficient data. We numerically demonstrate that the method correctly identifies when the physics cannot be trusted, in which case it automatically treats learning the field as a regression problem.

artificial intelligence, machine learning, posterior, (16 more...)

doi: 10.1016/j.jcp.2023.112100

2301.07609

Country:

North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report (1.00)
Overview (0.67)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Pimenta, Pedro Foletto, Avelar, Pedro H. C., Lamb, Luis C.

Solving the Kidney-Exchange Problem via Graph Neural Networks with No Supervision

arXiv.org Artificial IntelligenceApr-19-2023

This paper introduces a new learning-based approach for approximately solving the Kidney-Exchange Problem (KEP), an NP-hard problem on graphs. The problem consists of, given a pool of kidney donors and patients waiting for kidney donations, optimally selecting a set of donations to optimize the quantity and quality of transplants performed while respecting a set of constraints about the arrangement of these donations. The proposed technique consists of two main steps: the first is a Graph Neural Network (GNN) trained without supervision; the second is a deterministic non-learned search heuristic that uses the output of the GNN to find paths and cycles. To allow for comparisons, we also implemented and tested an exact solution method using integer programming, two greedy search heuristics without the machine learning module, and the GNN alone without a heuristic. We analyze and compare the methods and conclude that the learning-based two-stage approach is the best solution quality, outputting approximate solutions on average 1.1 times more valuable than the ones from the deterministic heuristic alone.

dataset, graph, node, (16 more...)

2304.09975

Country:

South America > Brazil > Rio Grande do Sul > Porto Alegre (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre:

Research Report > New Finding (0.68)
Instructional Material > Course Syllabus & Notes (0.48)

Industry: Health & Medicine > Therapeutic Area > Nephrology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)