AITopics

2302.14017

Country:

North America > United States (0.46)
Asia > Middle East (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.92)

Industry:

Information Technology (0.67)
Semiconductors & Electronics (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceFeb-26-2023, 11:45:09 GMT

Attention is All you Need. Unveiling the Science Behind ChatGPT

This article provides an overview of the ChatGPT language model, which has made significant contributions to the field of natural language processing. We discuss the limitations of traditional neural network architectures and introduce the transformer architecture, which uses self-attention mechanisms to handle long-term dependencies and variable-length inputs. We explain the key mechanisms behind ChatGPT, including attention, scale dot-product attention, multi-head attention, position-wise feed-forward networks, embeddings, softmax, and positional encoding. We also discuss the applications of attention and the importance of training, including training data and batching, hardware and schedule, optimizer, and regularization. Finally, we present the results of ChatGPT in various tasks, such as machine translation and model variations, demonstrating its potential to revolutionize the field of NLP.

input sequence, mechanism, natural language processing, (11 more...)

#artificialintelligence

Genre: Overview (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Double Matching Under Complementary Preferences

Li, Yuantong, Cheng, Guang, Dai, Xiaowu

In this paper, we propose a new algorithm for addressing the problem of matching markets with complementary preferences, where agents' preferences are unknown a priori and must be learned from data. The presence of complementary preferences can lead to instability in the matching process, making this problem challenging to solve. To overcome this challenge, we formulate the problem as a bandit learning framework and propose the Multi-agent Multi-type Thompson Sampling (MMTS) algorithm. The algorithm combines the strengths of Thompson Sampling for exploration with a double matching technique to achieve a stable matching outcome. Our theoretical analysis demonstrates the effectiveness of MMTS as it is able to achieve stability at every matching step, satisfies the incentive-compatibility property, and has a sublinear Bayesian regret over time. Our approach provides a useful method for addressing complementary preferences in real-world scenarios.

artificial intelligence, machine learning, survey article, (18 more...)

2301.1023

Country: North America > United States > California (0.28)

Genre:

Research Report (0.81)
Overview (0.67)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings

Bano, Sophia, Casella, Alessandro, Vasconcelos, Francisco, Qayyum, Abdul, Benzinou, Abdesslam, Mazher, Moona, Meriaudeau, Fabrice, Lena, Chiara, Cintorrino, Ilaria Anita, De Paolis, Gaia Romana, Biagioli, Jessica, Grechishnikova, Daria, Jiao, Jing, Bai, Bizhe, Qiao, Yanyan, Bhattarai, Binod, Gaire, Rebati Raman, Subedi, Ronast, Vazquez, Eduard, Płotka, Szymon, Lisowska, Aneta, Sitek, Arkadiusz, Attilakos, George, Wimalasundera, Ruwan, David, Anna L, Paladini, Dario, Deprest, Jan, De Momi, Elena, Mattos, Leonardo S, Moccia, Sara, Stoyanov, Danail

Fetoscopy laser photocoagulation is a widely adopted procedure for treating Twin-to-Twin Transfusion Syndrome (TTTS). The procedure involves photocoagulation pathological anastomoses to regulate blood exchange among twins. The procedure is particularly challenging due to the limited field of view, poor manoeuvrability of the fetoscope, poor visibility, and variability in illumination. These challenges may lead to increased surgery time and incomplete ablation. Computer-assisted intervention (CAI) can provide surgeons with decision support and context awareness by identifying key structures in the scene and expanding the fetoscopic field of view through video mosaicking. Research in this domain has been hampered by the lack of high-quality data to design, develop and test CAI algorithms. Through the Fetoscopic Placental Vessel Segmentation and Registration (FetReg2021) challenge, which was organized as part of the MICCAI2021 Endoscopic Vision challenge, we released the first largescale multicentre TTTS dataset for the development of generalized and robust semantic segmentation and video mosaicking algorithms. For this challenge, we released a dataset of 2060 images, pixel-annotated for vessels, tool, fetus and background classes, from 18 in-vivo TTTS fetoscopy procedures and 18 short video clips. Seven teams participated in this challenge and their model performance was assessed on an unseen test dataset of 658 pixel-annotated images from 6 fetoscopic procedures and 6 short clips. The challenge provided an opportunity for creating generalized solutions for fetoscopic scene understanding and mosaicking. In this paper, we present the findings of the FetReg2021 challenge alongside reporting a detailed literature review for CAI in TTTS fetoscopy. Through this challenge, its analysis and the release of multi-centre fetoscopic data, we provide a benchmark for future research in this field.

artificial intelligence, image understanding, machine learning, (20 more...)

2206.12512

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Nepal (0.04)
(11 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Surgery (1.00)
Health & Medicine > Health Care Providers & Services (0.93)
Health & Medicine > Diagnostic Medicine > Imaging (0.46)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.68)

A Survey on Learnable Evolutionary Algorithms for Scalable Multiobjective Optimization

Liu, Songbai, Lin, Qiuzhen, Li, Jianqiang, Tan, Kay Chen

Recent decades have witnessed great advancements in multiobjective evolutionary algorithms (MOEAs) for multiobjective optimization problems (MOPs). However, these progressively improved MOEAs have not necessarily been equipped with scalable and learnable problem-solving strategies for new and grand challenges brought by the scaling-up MOPs with continuously increasing complexity from diverse aspects, mainly including expensive cost of function evaluations, many objectives, large-scale search space, time-varying environments, and multi-task. Under different scenarios, divergent thinking is required in designing new powerful MOEAs for solving them effectively. In this context, research studies on learnable MOEAs with machine learning techniques have received extensive attention in the field of evolutionary computation. This paper begins with a general taxonomy of scaling-up MOPs and learnable MOEAs, followed by an analysis of the challenges that these MOPs pose to traditional MOEAs. Then, we synthetically overview recent advances of learnable MOEAs in solving various scaling-up MOPs, focusing primarily on four attractive directions (i.e., learnable evolutionary discriminators for environmental selection, learnable evolutionary generators for reproduction, learnable evolutionary evaluators for function evaluations, and learnable evolutionary transfer modules for sharing or reusing optimization experience). The insight of learnable MOEAs is offered to readers as a reference to the general track of the efforts in this field.

evolutionary algorithm, machine learning, optimization, (14 more...)

doi: 10.1109/TEVC.2023.3250350

2206.11526

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Slovenia > Central Slovenia > Municipality of Ljubljana > Ljubljana (0.04)
Asia > Vietnam > Long An Province > Tân An (0.04)
(4 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Education (0.67)
Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
(2 more...)

Santanam, Tejas, Trasatti, Anthony, Zhang, Hanyu, Riley, Connor, Van Hentenryck, Pascal, Krishnan, Ramayya

Changes in Commuter Behavior from COVID-19 Lockdowns in the Atlanta Metropolitan Area

This paper analyzes the impact of COVID-19 related lockdowns in the Atlanta, Georgia metropolitan area by examining commuter patterns in three periods: prior to, during, and after the pandemic lockdown. A cellular phone location dataset is utilized in a novel pipeline to infer the home and work locations of thousands of users from the Density-based Spatial Clustering of Applications with Noise (DBSCAN) algorithm. The coordinates derived from the clustering are put through a reverse geocoding process from which word embeddings are extracted in order to categorize the industry of each work place based on the workplace name and Point of Interest (POI) mapping. Frequencies of commute from home locations to work locations are analyzed in and across all three time periods. Public health and economic factors are discussed to explain potential reasons for the observed changes in commuter patterns.

artificial intelligence, machine learning, work location, (19 more...)

2302.13512

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.24)
North America > United States > Texas > Harris County > Houston (0.14)
North America > Mexico (0.04)
(2 more...)

Genre:

Research Report (0.90)
Overview (0.54)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Geographic Information Systems (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Towards Interpretable Federated Learning

Li, Anran, Liu, Rui, Hu, Ming, Tuan, Luu Anh, Yu, Han

Federated learning (FL) enables multiple data owners to build machine learning models collaboratively without exposing their private local data. In order for FL to achieve widespread adoption, it is important to balance the need for performance, privacy-preservation and interpretability, especially in mission critical applications such as finance and healthcare. Thus, interpretable federated learning (IFL) has become an emerging topic of research attracting significant interest from the academia and the industry alike. Its interdisciplinary nature can be challenging for new researchers to pick up. In this paper, we bridge this gap by providing (to the best of our knowledge) the first survey on IFL. We propose a unique IFL taxonomy which covers relevant works enabling FL models to explain the prediction results, support model debugging, and provide insights into the contributions made by individual data owners or data samples, which in turn, is crucial for allocating rewards fairly to motivate active and reliable participation in FL. We conduct comprehensive analysis of the representative IFL approaches, the commonly adopted performance evaluation metrics, and promising directions towards building versatile IFL techniques.

artificial intelligence, federated learning, machine learning, (16 more...)

2302.13473

Country: Asia > Singapore (0.04)

Genre:

Overview (0.68)
Research Report (0.50)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Principled and Efficient Transfer Learning of Deep Models via Neural Collapse

Li, Xiao, Liu, Sheng, Zhou, Jinxin, Lu, Xinyu, Fernandez-Granda, Carlos, Zhu, Zhihui, Qu, Qing

As model size continues to grow and access to labeled training data remains limited, transfer learning has become a popular approach in many scientific and engineering fields. This study explores the phenomenon of neural collapse (NC) in transfer learning for classification problems, which is characterized by the last-layer features and classifiers of deep networks having zero within-class variability in features and maximally and equally separated between-class feature means. Through the lens of NC, in this work the following findings on transfer learning are discovered: (i) preventing within-class variability collapse to a certain extent during model pre-training on source data leads to better transferability, as it preserves the intrinsic structures of the input data better; (ii) obtaining features with more NC on downstream data during fine-tuning results in better test accuracy. These results provide new insight into commonly used heuristics in model pre-training, such as loss design, data augmentation, and projection heads, and lead to more efficient and principled methods for fine-tuning large pre-trained models. Compared to full model fine-tuning, our proposed fine-tuning methods achieve comparable or even better performance while reducing fine-tuning parameters by at least 70% as well as alleviating overfitting.

artificial intelligence, machine learning, transfer accuracy, (18 more...)

2212.12206

Country: North America > United States (0.67)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry: Health & Medicine (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Çöltekin, Çağrı, Doğruöz, A. Seza, Çetinoğlu, Özlem

Resources for Turkish Natural Language Processing: A critical survey

arXiv.org Artificial IntelligenceFeb-25-2023

The recent (re)popularization of deep learning methods increased the importance and need for the data even further. Similarly, the other subfields of theoretical and applied linguistics have also seen a shift towards more data-driven methods. As a result, availability of large and high-quality language data is essential for both linguistic research and practical NLP applications. In this paper, we present a comprehensive and critical survey of linguistic resources for Turkish.

artificial intelligence, machine learning, natural language, (18 more...)

doi: 10.1007/s10579-022-09605-4

2204.05042

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
(49 more...)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.67)
Research Report > New Finding (0.45)

Industry:

Media > News (1.00)
Education (1.00)
Government (0.67)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(5 more...)

Benkert, Ryan, Aribido, Oluwaseun Joseph, AlRegib, Ghassan

Example Forgetting: A Novel Approach to Explain and Interpret Deep Neural Networks in Seismic Interpretation

arXiv.org Artificial IntelligenceFeb-24-2023

In recent years, deep neural networks have significantly impacted the seismic interpretation process. Due to the simple implementation and low interpretation costs, deep neural networks are an attractive component for the common interpretation pipeline. However, neural networks are frequently met with distrust due to their property of producing semantically incorrect outputs when exposed to sections the model was not trained on. We address this issue by explaining model behaviour and improving generalization properties through example forgetting: First, we introduce a method that effectively relates semantically malfunctioned predictions to their respectful positions within the neural network representation manifold. More concrete, our method tracks how models "forget" seismic reflections during training and establishes a connection to the decision boundary proximity of the target class. Second, we use our analysis technique to identify frequently forgotten regions within the training volume and augment the training set with state-of-the-art style transfer techniques from computer vision. We show that our method improves the segmentation performance on underrepresented classes while significantly reducing the forgotten regions in the F3 volume in the Netherlands.

artificial intelligence, machine learning, survey article, (17 more...)

doi: 10.1109/TGRS.2022.3178112

2302.14644

Country:

North America > United States (0.28)
Asia > Japan (0.28)
Europe > Netherlands (0.24)

Genre:

Research Report > Promising Solution (0.40)
Overview > Innovation (0.40)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)