AITopics | Calgary

Collaborating Authors

Calgary

Traveling Words: A Geometric Interpretation of Transformers

arXiv.org Artificial IntelligenceSep-18-2023

Transformers have significantly advanced the field of natural language processing, but comprehending their internal mechanisms remains a challenge. In this paper, we introduce a novel geometric perspective that elucidates the inner mechanisms of transformer operations. Our primary contribution is illustrating how layer normalization confines the latent features to a hyper-sphere, subsequently enabling attention to mold the semantic representation of words on this surface. This geometric viewpoint seamlessly connects established properties such as iterative refinement and contextual embeddings. We validate our insights by probing a pre-trained 124M parameter GPT-2 model. Our findings reveal clear query-key attention patterns in early layers and build upon prior observations regarding the subject-specific nature of attention heads at deeper layers. Harnessing these geometric insights, we present an intuitive understanding of transformers, depicting them as processes that model the trajectory of word particles along the hyper-sphere.

layer normalization, matrix, vector, (10 more...)

arXiv.org Artificial Intelligence

2309.07315

Country:

Europe > United Kingdom > Scotland (0.04)
South America > Bolivia (0.04)
North America > United States > Utah > Salt Lake County > Murray (0.04)
(11 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Leisure & Entertainment (0.93)
Health & Medicine (0.67)
Food & Agriculture > Agriculture (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Safe and Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach

Nagib, Ahmad M., Abou-Zeid, Hatem, Hassanein, Hossam S.

arXiv.org Artificial IntelligenceSep-18-2023

The open radio access network (O-RAN) architecture supports intelligent network control algorithms as one of its core capabilities. Data-driven applications incorporate such algorithms to optimize radio access network (RAN) functions via RAN intelligent controllers (RICs). Deep reinforcement learning (DRL) algorithms are among the main approaches adopted in the O-RAN literature to solve dynamic radio resource management problems. However, despite the benefits introduced by the O-RAN RICs, the practical adoption of DRL algorithms in real network deployments falls behind. This is primarily due to the slow convergence and unstable performance exhibited by DRL agents upon deployment and when encountering previously unseen network conditions. In this paper, we address these challenges by proposing transfer learning (TL) as a core component of the training and deployment workflows for the DRL-based closed-loop control of O-RAN functionalities. To this end, we propose and design a hybrid TL-aided approach that leverages the advantages of both policy reuse and distillation TL methods to provide safe and accelerated convergence in DRL-based O-RAN slicing. We conduct a thorough experiment that accommodates multiple services, including real VR gaming traffic to reflect practical scenarios of O-RAN slicing. We also propose and implement policy reuse and distillation-aided DRL and non-TL-aided DRL as three separate baselines. The proposed hybrid approach shows at least: 7.7% and 20.7% improvements in the average initial reward value and the percentage of converged scenarios, and a 64.6% decrease in reward variance while maintaining fast convergence and enhancing the generalizability compared with the baselines.

agent, drl agent, expert policy, (15 more...)

arXiv.org Artificial Intelligence

2309.07265

Country:

North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.14)
Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.04)
North America > Canada > Ontario > Kingston (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.46)
Personal > Honors (0.46)

Industry:

Leisure & Entertainment > Games > Computer Games (0.48)
Telecommunications > Networks (0.46)
Education > Educational Setting (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Survey of Privacy Attacks in Machine Learning

Rigaki, Maria, Garcia, Sebastian

arXiv.org Artificial IntelligenceSep-16-2023

As machine learning becomes more widely used, the need to study its implications in security and privacy becomes more urgent. Although the body of work in privacy has been steadily growing over the past few years, research on the privacy aspects of machine learning has received less focus than the security aspects. Our contribution in this research is an analysis of more than 40 papers related to privacy attacks against machine learning that have been published during the past seven years. We propose an attack taxonomy, together with a threat model that allows the categorization of different attacks based on the adversarial knowledge, and the assets under attack. An initial exploration of the causes of privacy leaks is presented, as well as a detailed analysis of the different attacks. Finally, we present an overview of the most commonly proposed defenses and a discussion of the open problems and future directions identified during our analysis.

adversary, dataset, learning, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3624010

2007.07646

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > New York > New York County > New York City (0.05)
(35 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

TensorFlow Chaotic Prediction and Blow Up

Andrecut, M.

arXiv.org Artificial IntelligenceSep-14-2023

Predicting the dynamics of complex systems exhibiting high-dimensional spatiotemporal chaos is a challenging machine learning problem with important applications in: physics, biology, medicine, economics, meteorology etc. Another problem of interest, is the inverse problem of inferring the connectivity network of such a system from input-output measurements. Such an example is the case of inferring the connectivity of genetic regulatory networks from the measurements of gene expression data. Here we explore the feasibility of these problems using a complex system corresponding to a non-linear network model we previously discussed in [1]. This is a continuous model of non-linear random networks (NLRN), which exhibits a phase transition from ordered to chaotic dynamics as a function of the average network connectivity (in-degree). In the chaotic regime, these networks show strong sensitivity to initial conditions, quickly forgetting their past states, making them harder to predict and to infer their connectivity. In our approach we use the TensorFlow library [2], which is the state of the art for deep neural networks training and prediction. Our numerical results show that the dynamics of the considered system can be successfully predicted for short times.

chaotic system, neural network, prediction, (10 more...)

arXiv.org Artificial Intelligence

2309.0745

Country: North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.04)

Genre: Research Report > New Finding (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification

Kawano, Harunori, Shimizu, Sota

arXiv.org Artificial IntelligenceSep-10-2023

Wav2vec2 has achieved success in applying Transformer architecture and self-supervised learning to speech recognition. Recently, these have come to be used not only for speech recognition but also for the entire speech processing. This paper introduces an effective end-to-end speaker identification model applied Transformer-based contextual model. We explored the relationship between the hyper-parameters and the performance in order to discern the structure of an effective model. Furthermore, we propose a pooling method, Temporal Gate Pooling, with powerful learning ability for speaker identification. We applied Conformer as encoder and BEST-RQ for pre-training and conducted an evaluation utilizing the speaker identification of VoxCeleb1. The proposed method has achieved an accuracy of 87.1% with 28.5M parameters, demonstrating comparable precision to wav2vec2 with 317.7M parameters. Code is available at https://github.com/HarunoriKawano/speaker-identification-with-tgp.

accuracy, identification, speaker identification, (11 more...)

arXiv.org Artificial Intelligence

2308.11241

Country:

Asia > India (0.05)
Oceania > Australia > Queensland > Brisbane (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(3 more...)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Speech > Acoustic Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Linking Symptom Inventories using Semantic Textual Similarity

Kennedy, Eamonn, Vadlamani, Shashank, Lindsey, Hannah M, Peterson, Kelly S, OConnor, Kristen Dams, Murray, Kenton, Agarwal, Ronak, Amiri, Houshang H, Andersen, Raeda K, Babikian, Talin, Baron, David A, Bigler, Erin D, Caeyenberghs, Karen, Delano-Wood, Lisa, Disner, Seth G, Dobryakova, Ekaterina, Eapen, Blessen C, Edelstein, Rachel M, Esopenko, Carrie, Genova, Helen M, Geuze, Elbert, Goodrich-Hunsaker, Naomi J, Grafman, Jordan, Haberg, Asta K, Hodges, Cooper B, Hoskinson, Kristen R, Hovenden, Elizabeth S, Irimia, Andrei, Jahanshad, Neda, Jha, Ruchira M, Keleher, Finian, Kenney, Kimbra, Koerte, Inga K, Liebel, Spencer W, Livny, Abigail, Lovstad, Marianne, Martindale, Sarah L, Max, Jeffrey E, Mayer, Andrew R, Meier, Timothy B, Menefee, Deleene S, Mohamed, Abdalla Z, Mondello, Stefania, Monti, Martin M, Morey, Rajendra A, Newcombe, Virginia, Newsome, Mary R, Olsen, Alexander, Pastorek, Nicholas J, Pugh, Mary Jo, Razi, Adeel, Resch, Jacob E, Rowland, Jared A, Russell, Kelly, Ryan, Nicholas P, Scheibel, Randall S, Schmidt, Adam T, Spitz, Gershon, Stephens, Jaclyn A, Tal, Assaf, Talbert, Leah D, Tartaglia, Maria Carmela, Taylor, Brian A, Thomopoulos, Sophia I, Troyanskaya, Maya, Valera, Eve M, van der Horn, Harm Jan, Van Horn, John D, Verma, Ragini, Wade, Benjamin SC, Walker, Willian SC, Ware, Ashley L, Werner, J Kent Jr, Yeates, Keith Owen, Zafonte, Ross D, Zeineh, Michael M, Zielinski, Brandon, Thompson, Paul M, Hillary, Frank G, Tate, David F, Wilde, Elisabeth A, Dennis, Emily L

arXiv.org Artificial IntelligenceSep-8-2023

An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores across previously incongruous symptom inventories. We tested the ability of four pre-trained STS models to screen thousands of symptom description pairs for related content - a challenging task typically requiring expert panels. Models were tasked to predict symptom severity across four different inventories for 6,607 participants drawn from 16 international data sources. The STS approach achieved 74.8% accuracy across five tasks, outperforming other models tested. This work suggests that incorporating contextual, semantic information can assist expert decision-making processes, yielding gains for both general and disease-specific clinical assessment.

inventory, symptom, university, (11 more...)

arXiv.org Artificial Intelligence

2309.04607

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.30)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
(50 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Comparative Analysis of Pretrained Language Models for Text-to-Speech

Granero-Moya, Marcel, Karanasou, Penny, Karlapati, Sri, Schnell, Bastian, Peinelt, Nicole, Moinet, Alexis, Drugman, Thomas

arXiv.org Artificial IntelligenceSep-4-2023

State-of-the-art text-to-speech (TTS) systems have utilized pretrained language models (PLMs) to enhance prosody and create more natural-sounding speech. However, while PLMs have been extensively researched for natural language understanding (NLU), their impact on TTS has been overlooked. In this study, we aim to address this gap by conducting a comparative analysis of different PLMs for two TTS tasks: prosody prediction and pause prediction. Firstly, we trained a prosody prediction model using 15 different PLMs. Our findings revealed a logarithmic relationship between model size and quality, as well as significant performance differences between neutral and expressive prosody. Secondly, we employed PLMs for pause prediction and found that the task was less sensitive to small models. We also identified a strong correlation between our empirical results and the GLUE scores obtained for these language models. To the best of our knowledge, this is the first study of its kind to investigate the impact of different PLMs on TTS.

language model, pause prediction, prediction, (13 more...)

arXiv.org Artificial Intelligence

2309.01576

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(7 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

RGI-Net: 3D Room Geometry Inference from Room Impulse Responses in the Absence of First-order Echoes

Yeon, Inmo, Choi, Jung-Woo

arXiv.org Artificial IntelligenceSep-4-2023

Room geometry is important prior information for implementing realistic 3D audio rendering. For this reason, various room geometry inference (RGI) methods have been developed by utilizing the time of arrival (TOA) or time difference of arrival (TDOA) information in room impulse responses. However, the conventional RGI technique poses several assumptions, such as convex room shapes, the number of walls known in priori, and the visibility of first-order reflections. In this work, we introduce the deep neural network (DNN), RGI-Net, which can estimate room geometries without the aforementioned assumptions. RGI-Net learns and exploits complex relationships between high-order reflections in room impulse responses (RIRs) and, thus, can estimate room shapes even when the shape is non-convex or first-order reflections are missing in the RIRs. The network takes RIRs measured from a compact audio device equipped with a circular microphone array and a single loudspeaker, which greatly improves its practical applicability. RGI-Net includes the evaluation network that separately evaluates the presence probability of walls, so the geometry inference is possible without prior knowledge of the number of walls.

geometry, high-order reflection, reflection, (15 more...)

arXiv.org Artificial Intelligence

2309.01513

Country:

North America > United States > Texas > Dallas County > Dallas (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(7 more...)

Genre: Research Report > New Finding (0.69)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Reasoning over the Air: A Reasoning-based Implicit Semantic-Aware Communication Framework

Xiao, Yong, Liao, Yiwei, Li, Yingyu, Shi, Guangming, Poor, H. Vincent, Saad, Walid, Debbah, Merouane, Bennis, Mehdi

arXiv.org Artificial IntelligenceSep-2-2023

Semantic-aware communication is a novel paradigm that draws inspiration from human communication focusing on the delivery of the meaning of messages. It has attracted significant interest recently due to its potential to improve the efficiency and reliability of communication and enhance users' QoE. Most existing works focus on transmitting and delivering the explicit semantic meaning that can be directly identified from the source signal. This paper investigates the implicit semantic-aware communication in which the hidden information that cannot be directly observed from the source signal must be recognized and interpreted by the intended users. To this end, a novel implicit semantic-aware communication (iSAC) architecture is proposed for representing, communicating, and interpreting the implicit semantic meaning between source and destination users. A projection-based semantic encoder is proposed to convert the high-dimensional graphical representation of explicit semantics into a low-dimensional semantic constellation space for efficient physical channel transmission. To enable the destination user to learn and imitate the implicit semantic reasoning process of source user, a generative adversarial imitation learning-based solution, called G-RML, is proposed. Different from existing communication solutions, the source user in G-RML does not focus only on sending as much of the useful messages as possible; but, instead, it tries to guide the destination user to learn a reasoning mechanism to map any observed explicit semantics to the corresponding implicit semantics that are most relevant to the semantic meaning. Compared to the existing solutions, our proposed G-RML requires much less communication and computational resources and scales well to the scenarios involving the communication of rich semantic meanings consisting of a large number of concepts and relations.

communication, destination user, relation, (15 more...)

arXiv.org Artificial Intelligence

2306.11229

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Austria > Vienna (0.14)
Asia > China > Hubei Province > Wuhan (0.04)
(23 more...)

Genre:

Personal > Honors (0.69)
Research Report > Promising Solution (0.46)

Industry:

Education (1.00)
Leisure & Entertainment (0.68)
Information Technology (0.68)
Telecommunications (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.93)
(4 more...)

Add feedback

Through depression, illness and a heck of a birthday party, video games have been our family's glue Dominik Diamond

The GuardianAug-29-2023, 10:00:30 GMT

There are now new, different rooms to fill with all the things and machines that keep a family functioning. Some of them, as ever, are games consoles. I wrote recently about the place that the ZX Spectrum occupied in our council house in Arbroath in the 80s. But we had a machine before that. A bizarre arcade box that came with a black and white TV and an assortment of Pong knockoffs.

birthday party, depression, video game, (8 more...)

The Guardian

Country:

North America > Canada > Ontario > Toronto (0.05)
North America > Canada > Nova Scotia (0.05)
North America > Canada > Alberta > Census Division No. 6 > Calgary Metropolitan Region > Calgary (0.05)
(2 more...)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games (0.87)

Add feedback