AITopics

2210.00676

Country:

North America > Canada > Quebec > Montreal (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.61)

Hallmen, Tobias, Mertes, Silvan, Schiller, Dominik, André, Elisabeth

An Efficient Multitask Learning Architecture for Affective Vocal Burst Analysis

arXiv.org Artificial IntelligenceSep-28-2022

Affective speech analysis is an ongoing topic of research. A relatively new problem in this field is the analysis of vocal bursts, which are nonverbal vocalisations such as laughs or sighs. Current state-of-the-art approaches to address affective vocal burst analysis are mostly based on wav2vec2 or HuBERT features. In this paper, we investigate the use of the wav2vec successor data2vec in combination with a multitask learning pipeline to tackle different analysis problems at once. To assess the performance of our efficient multitask learning architecture, we participate in the 2022 ACII Affective Vocal Burst Challenge, showing that our approach substantially outperforms the baseline established there in three different subtasks.

artificial intelligence, machine learning, vocal burst, (14 more...)

2209.13914

Country:

Europe > Germany (0.05)
Europe > Portugal (0.04)
Asia > India > Telangana > Hyderabad (0.04)
Africa > Guinea-Bissau (0.04)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.81)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.68)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.61)

arXiv.org Artificial IntelligenceSep-28-2022

NAAP-440 Dataset and Baseline for Neural Architecture Accuracy Prediction

Hakim, Tal

Neural architecture search (NAS) has become a common approach to developing and discovering new neural architectures for different target platforms and purposes. However, scanning the search space is comprised of long training processes of many candidate architectures, which is costly in terms of computational resources and time. Regression algorithms are a common tool to predicting a candidate architecture's accuracy, which can dramatically accelerate the search procedure. We aim at proposing a new baseline that will support the development of regression algorithms that can predict an architecture's accuracy just from its scheme, or by only training it for a minimal number of epochs. Therefore, we introduce the NAAP-440 dataset of 440 neural architectures, which were trained on CIFAR10 using a fixed recipe. Our experiments indicate that by using off-the-shelf regression algorithms and running up to 10% of the training process, not only is it possible to predict an architecture's accuracy rather precisely, but that the values predicted for the architectures also maintain their accuracy order with a minimal number of monotonicity violations. This approach may serve as a powerful tool for accelerating NAS-based studies and thus dramatically increase their efficiency. The dataset and code used in the study have been made public.

architecture, artificial intelligence, machine learning, (17 more...)

2209.06626

Country: Asia > Middle East > Israel (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.94)

Towers, David, Forshaw, Matthew, Atapour-Abarghouei, Amir, McGough, Andrew Stephen

Long-term Reproducibility for Neural Architecture Search

arXiv.org Artificial IntelligenceJul-18-2022

It is a sad reflection of modern academia that code is often ignored after publication -- there is no academic 'kudos' for bug fixes / maintenance. Code is often unavailable or, if available, contains bugs, is incomplete, or relies on out-of-date / unavailable libraries. This has a significant impact on reproducibility and general scientific progress. Neural Architecture Search (NAS) is no exception to this, with some prior work in reproducibility. However, we argue that these do not consider long-term reproducibility issues. We therefore propose a checklist for long-term NAS reproducibility. We evaluate our checklist against common NAS approaches along with proposing how we can retrospectively make these approaches more long-term reproducible.

architecture search, artificial intelligence, machine learning, (16 more...)

2207.04821

Country: North America > United States > Washington > King County > Seattle (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.89)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.64)

arXiv.org Machine LearningJun-15-2022

Unsupervised Learning Architecture for Classifying the Transient Noise of Interferometric Gravitational-wave Detectors

Sakai, Yusuke, Itoh, Yousuke, Jung, Piljong, Kokeyama, Keiko, Kozakai, Chihiro, Nakahira, Katsuko T., Oshino, Shoichi, Shikano, Yutaka, Takahashi, Hirotaka, Uchiyama, Takashi, Ueshima, Gen, Washimi, Tatsuki, Yamamoto, Takahiro, Yokozawa, Takaaki

In the data obtained by laser interferometric gravitational wave detectors, transient noise with non-stationary and non-Gaussian features occurs at a high rate. This often results in problems such as detector instability and the hiding and/or imitation of gravitational-wave signals. This transient noise has various characteristics in the time--frequency representation, which is considered to be associated with environmental and instrumental origins. Classification of transient noise can offer clues for exploring its origin and improving the performance of the detector. One approach for accomplishing this is supervised learning. However, in general, supervised learning requires annotation of the training data, and there are issues with ensuring objectivity in the classification and its corresponding new classes. By contrast, unsupervised learning can reduce the annotation work for the training data and ensure objectivity in the classification and its corresponding new classes. In this study, we propose an unsupervised learning architecture for the classification of transient noise that combines a variational autoencoder and invariant information clustering. To evaluate the effectiveness of the proposed architecture, we used the dataset (time--frequency two-dimensional spectrogram images and labels) of the Laser Interferometer Gravitational-wave Observatory (LIGO) first observation run prepared by the Gravity Spy project. The classes provided by our proposed unsupervised learning architecture were consistent with the labels annotated by the Gravity Spy project, which manifests the potential for the existence of unrevealed classes.

artificial intelligence, interferometric gravitational-wave detector, machine learning, (3 more...)

arXiv.org Machine Learning

doi: 10.1038/s41598-022-13329-4

2111.10053

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.80)

#artificialintelligenceJun-1-2022, 02:29:41 GMT

Machine Learning Architecture

The story is about the birth of Machine Learning architecture and how each of the three architects developed their own AI style. We tried to figure out how to teach AI to make architecture without mimicking existing styles.

machine learning architecture

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.83)

#artificialintelligenceMay-29-2022, 17:26:10 GMT

Adding Human Rationality to MNIST via Neural Cellular Automata

In this experiment, I used the results of the market research of my thesis made with real people to alter the kernel of a CNN. The idea was to add human rationality to solve the MNIST task using a…

accuracy, cellular automata, kernel, (14 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > New Mexico (0.04)
North America > Canada (0.04)

Genre: Research Report (0.46)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.89)

Vandevelde, Simon, Vennekens, Joost

Problife: a Probabilistic Game of Life

arXiv.org Artificial IntelligenceJan-24-2022

This paper presents a probabilistic extension of the well-known cellular automaton, Game of Life. In Game of Life, cells are placed in a grid and then watched as they evolve throughout subsequent generations, as dictated by the rules of the game. In our extension, called ProbLife, these rules now have probabilities associated with them. Instead of cells being either dead or alive, they are denoted by their chance to live. After presenting the rules of ProbLife and its underlying characteristics, we show a concrete implementation in ProbLog, a probabilistic logic programming system. We use this to generate different images, as a form of rule-based generative art.

neighbour, probability, problife, (16 more...)

2201.09521

Country:

Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.05)
North America > United States > Wisconsin > Dane County > Madison (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.50)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.36)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.34)

#artificialintelligenceDec-1-2021, 19:27:00 GMT

Evolution of Cresta's machine learning architecture: Migration to AWS and PyTorch

Cresta Intelligence, a California-based AI startup, makes businesses radically more productive by using Expertise AI to help sales and service teams unlock their full potential. Cresta is bringing together world-renowned AI thought-leaders, engineers, and investors to create a real-time coaching and management solution that transforms sales and increases service productivity, weeks after application deployment. Cresta enables customers such as Intuit, Cox Communications, and Porsche to realize a 20% improvement in sales conversion rate, 25% greater average order value, and millions of dollars in additional annual revenue. This post discusses Cresta's journey as they moved from a multi-cloud environment to consolidating their machine learning (ML) workloads on AWS. It also gives a high-level view of their legacy and current training and inference architectures.

cresta, pytorch, workload, (17 more...)

Country:

North America > United States > Massachusetts (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)

Industry:

Retail > Online (0.40)
Information Technology > Services (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.40)

#artificialintelligenceNov-12-2021, 16:54:41 GMT

Learning from Mistakes – A Framework for Neural Architecture Search

Learning from one's mistakes is an effective human learning technique where the learners focus more on the topics where mistakes were made, so as to deepen their understanding. In this paper, we investigate if this human learning strategy can be applied in machine learning. We propose a novel machine learning method called Learning From Mistakes (LFM), wherein the learner improves its ability to learn by focusing more on the mistakes during revision. We formulate LFM as a three-stage optimization problem: 1) learner learns; 2) learner re-learns focusing on the mistakes, and; 3) learner validates its learning. We develop an efficient algorithm to solve the LFM problem.

learning, neural architecture search

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Problem-Independent Architectures (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)