AITopics | Edmonton

Collaborating Authors

Edmonton

Joint Dropout: Improving Generalizability in Low-Resource Neural Machine Translation through Phrase Pair Variables

Araabi, Ali, Niculae, Vlad, Monz, Christof

arXiv.org Artificial IntelligenceJul-24-2023

Although Neural Machine Translation (NMT) has made remarkable advances (Vaswani et al., 2017), it still requires large amounts of data to induce correct generalizations that characterize human intelligence (Lake et al., 2017). However, such a vast amount of data to make robust, reliable, and fair predictions is not available for low-resource NMT (Koehn and Knowles, 2017). The generalizability of NMT has been extensively studied in prior research, revealing the volatile behaviour of translation outputs when even a single token in the source sentence is modified (Belinkov and Bisk, 2018; Fadaee and Monz, 2020; Li et al., 2021). For instance, in the sentence "smallpox killed billions of people on this planet" from our IWSLT test set, when replacing the noun "smallpox" with another acute disease like "tuberculosis", the model should ideally generate a correct translation by only modifying the relevant part while keeping the rest of the sentence unchanged. However, in many instances, such a small perturbation adversely affects the translation of the entire sentence, highlighting the limited generalization and robustness of existing NMT models (Fadaee and Monz, 2020). Compositionality is regarded as the most prominent form of generalization that embodies the ability of human intelligence to generalize to new data, tasks, and domains (Schmidhuber, 1990; Lake and Baroni, 2018), while other types mostly focus on the practical considerations across domains, tasks, and languages, model robustness, and structural generalization (Hupkes et al., 2022). Research in compositional generalization has two main aspects: evaluating the current models' compositional abilities as well as improving them.

artificial intelligence, computational linguistic, natural language, (14 more...)

arXiv.org Artificial Intelligence

2307.12835

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Asia > India (0.05)
(21 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.74)
Health & Medicine > Therapeutic Area > Immunology (0.74)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

Exphormer: Sparse Transformers for Graphs

Shirzad, Hamed, Velingker, Ameya, Venkatachalam, Balaji, Sutherland, Danica J., Sinop, Ali Kemal

arXiv.org Artificial IntelligenceJul-24-2023

Graph transformers have emerged as a promising architecture for a variety of graph learning and representation tasks. Despite their successes, though, it remains challenging to scale graph transformers to large graphs while maintaining accuracy competitive with message-passing networks. In this paper, we introduce Exphormer, a framework for building powerful and scalable graph transformers. Exphormer consists of a sparse attention mechanism based on two mechanisms: virtual global nodes and expander graphs, whose mathematical characteristics, such as spectral expansion, pseduorandomness, and sparsity, yield graph transformers with complexity only linear in the size of the graph, while allowing us to prove desirable theoretical properties of the resulting transformer models. We show that incorporating Exphormer into the recently-proposed GraphGPS framework produces models with competitive empirical results on a wide variety of graph datasets, including state-of-the-art results on three datasets. We also show that Exphormer can scale to datasets on larger graphs than shown in previous graph transformer architectures. Code can be found at \url{https://github.com/hamed1375/Exphormer}.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2303.06147

Country:

North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(5 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Explaining Autonomous Driving Actions with Visual Question Answering

Atakishiyev, Shahin, Salameh, Mohammad, Babiker, Housam, Goebel, Randy

arXiv.org Artificial IntelligenceJul-19-2023

The end-to-end learning ability of self-driving vehicles has achieved significant milestones over the last decade owing to rapid advances in deep learning and computer vision algorithms. However, as autonomous driving technology is a safety-critical application of artificial intelligence (AI), road accidents and established regulatory principles necessitate the need for the explainability of intelligent action choices for self-driving vehicles. To facilitate interpretability of decision-making in autonomous driving, we present a Visual Question Answering (VQA) framework, which explains driving actions with question-answering-based causal reasoning. To do so, we first collect driving videos in a simulation environment using reinforcement learning (RL) and extract consecutive frames from this log data uniformly for five selected action categories. Further, we manually annotate the extracted frames using question-answer pairs as justifications for the actions chosen in each scenario. Finally, we evaluate the correctness of the VQA-predicted answers for actions on unseen driving scenes. The empirical results suggest that the VQA mechanism can provide support to interpret real-time decisions of autonomous vehicles and help enhance overall driving safety.

explanation, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2307.10408

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
North America > United States > California (0.04)

Genre: Research Report > New Finding (0.88)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Task Space Control of Hydraulic Construction Machines using Reinforcement Learning

Lee, Hyung Joo, Brell-Cokcan, Sigrid

arXiv.org Artificial IntelligenceJul-19-2023

Teleoperation is vital in the construction industry, allowing safe machine manipulation from a distance. However, controlling machines at a joint level requires extensive training due to their complex degrees of freedom. Task space control offers intuitive maneuvering, but precise control often requires dynamic models, posing challenges for hydraulic machines. To address this, we use a data-driven actuator model to capture machine dynamics in real-world operations. By integrating this model into simulation and reinforcement learning, an optimal control policy for task space control is obtained.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

arXiv.org Artificial Intelligence

2307.09246

Country:

North America > United States (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Aachen (0.04)

Genre: Research Report (0.64)

Industry:

Construction & Engineering (0.87)
Machinery > Construction Machinery & Heavy Trucks (0.53)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.86)

Add feedback

Unconstrained Online Learning with Unbounded Losses

Jacobsen, Andrew, Cutkosky, Ashok

arXiv.org Artificial IntelligenceJul-14-2023

Algorithms for online learning typically require one or more boundedness assumptions: that the domain is bounded, that the losses are Lipschitz, or both. In this paper, we develop a new setting for online learning with unbounded domains and non-Lipschitz losses. For this setting we provide an algorithm which guarantees $R_{T}(u)\le \tilde O(G\|u\|\sqrt{T}+L\|u\|^{2}\sqrt{T})$ regret on any problem where the subgradients satisfy $\|g_{t}\|\le G+L\|w_{t}\|$, and show that this bound is unimprovable without further assumptions. We leverage this algorithm to develop new saddle-point optimization algorithms that converge in duality gap in unbounded domains, even in the absence of meaningful curvature. Finally, we provide the first algorithm achieving non-trivial dynamic regret in an unbounded domain for non-Lipschitz losses, as well as a matching lower bound. The regret of our dynamic regret algorithm automatically improves to a novel $L^{*}$ bound when the losses are smooth.

artificial intelligence, machine learning, unconstrained online learning, (15 more...)

arXiv.org Artificial Intelligence

2306.04923

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Education > Educational Setting > Online (0.83)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.83)

Add feedback

Negated Complementary Commonsense using Large Language Models

Rezaei, Navid, Reformat, Marek Z.

arXiv.org Artificial IntelligenceJul-13-2023

Larger language models, such as GPT-3, have shown to be excellent in many tasks. However, we demonstrate that out-of-ordinary questions can throw the model off guard. This work focuses on finding answers to negated complementary questions in commonsense scenarios. We illustrate how such questions adversely affect the model responses. We propose a model-agnostic methodology to improve the performance in negated complementary scenarios. Our method outperforms few-shot generation from GPT-3 (by more than 11 points) and, more importantly, highlights the significance of studying the response of large language models in negated complementary questions. The code, data, and experiments are available under: https://github.com/navidre/negated_complementary_commonsense.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2307.06794

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(3 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Testing different Log Bases For Vector Model Weighting Technique

Assaf, Kamel

arXiv.org Artificial IntelligenceJul-12-2023

Information retrieval systems retrieves relevant documents based on a query submitted by the user. The documents are initially indexed and the words in the documents are assigned weights using a weighting technique called TFIDF which is the product of Term Frequency (TF) and Inverse Document Frequency (IDF). TF represents the number of occurrences of a term in a document. IDF measures whether the term is common or rare across all documents. It is computed by dividing the total number of documents in the system by the number of documents containing the term and then computing the logarithm of the quotient. By default, we use base 10 to calculate the logarithm. In this paper, we are going to test this weighting technique by using a range of log bases from 0.1 to 100.0 to calculate the IDF. Testing different log bases for vector model weighting technique is to highlight the importance of understanding the performance of the system at different weighting values. We use the documents of MED, CRAN, NPL, LISA, and CISI test collections that scientists assembled explicitly for experiments in data information retrieval systems.

information retrieval, natural language, test collection, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.5121/ijnlc.2023.12301

2307.06213

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

Observation of high-energy neutrinos from the Galactic plane

Abbasi, R., Ackermann, M., Adams, J., Aguilar, J. A., Ahlers, M., Ahrens, M., Alameddine, J. M., Alves, A. A. Jr., Amin, N. M., Andeen, K., Anderson, T., Anton, G., Argüelles, C., Ashida, Y., Athanasiadou, S., Axani, S., Bai, X., V., A. Balagopal, Barwick, S. W., Basu, V., Baur, S., Bay, R., Beatty, J. J., Becker, K. -H., Tjus, J. Becker, Beise, J., Bellenghi, C., Benda, S., BenZvi, S., Berley, D., Bernardini, E., Besson, D. Z., Binder, G., Bindig, D., Blaufuss, E., Blot, S., Boddenberg, M., Bontempo, F., Book, J. Y., Borowka, J., Böser, S., Botner, O., Böttcher, J., Bourbeau, E., Bradascio, F., Braun, J., Brinson, B., Bron, S., Brostean-Kaiser, J., Burley, R. T., Busse, R. S., Campana, M. A., Carnie-Bronca, E. G., Chen, C., Chen, Z., Chirkin, D., Choi, K., Clark, B. A., Clark, K., Classen, L., Coleman, A., Collin, G. H., Connolly, A., Conrad, J. M., Coppin, P., Correa, P., Cowen, D. F., Cross, R., Dappen, C., Dave, P., De Clercq, C., DeLaunay, J. J., López, D. Delgado, Dembinski, H., Deoskar, K., Desai, A., Desiati, P., de Vries, K. D., de Wasseige, G., DeYoung, T., Diaz, A., Díaz-Vélez, J. C., Dittmer, M., Dujmovic, H., Dunkman, M., DuVernois, M. A., Ehrhardt, T., Eller, P., Engel, R., Erpenbeck, H., Evans, J., Evenson, P. A., Fan, K. L., Fazely, A. R., Fedynitch, A., Feigl, N., Fiedlschuster, S., Fienberg, A. T., Finley, C., Fischer, L., Fox, D., Franckowiak, A., Friedman, E., Fritz, A., Fürst, P., Gaisser, T. K., Gallagher, J., Ganster, E., Garcia, A., Garrappa, S., Gerhardt, L., Ghadimi, A., Glaser, C., Glauch, T., Glüsenkamp, T., Goehlke, N., Goldschmidt, A., Gonzalez, J. G., Goswami, S., Grant, D., Grégoire, T., Griswold, S., Günther, C., Gutjahr, P., Haack, C., Hallgren, A., Halliday, R., Halve, L., Halzen, F., Minh, M. Ha, Hanson, K., Hardin, J., Harnisch, A. A., Haungs, A., Helbing, K., Henningsen, F., Hettinger, E. C., Hickford, S., Hignight, J., Hill, C., Hill, G. C., Hoffman, K. D., Hoshina, K., Hou, W., Huang, F., Huber, M., Huber, T., Hultqvist, K., Hünnefeld, M., Hussain, R., Hymon, K., In, S., Iovine, N., Ishihara, A., Jansson, M., Japaridze, G. S., Jeong, M., Jin, M., Jones, B. J. P., Kang, D., Kang, W., Kang, X., Kappes, A., Kappesser, D., Kardum, L., Karg, T., Karl, M., Karle, A., Katz, U., Kauer, M., Kellermann, M., Kelley, J. L., Kheirandish, A., Kin, K., Kiryluk, J., Klein, S. R., Kochocki, A., Koirala, R., Kolanoski, H., Kontrimas, T., Köpke, L., Kopper, C., Kopper, S., Koskinen, D. J., Koundal, P., Kovacevich, M., Kowalski, M., Kozynets, T., Krupczak, E., Kun, E., Kurahashi, N., Lad, N., Gualda, C. Lagunas, Lanfranchi, J. L., Larson, M. J., Lauber, F., Lazar, J. P., Lee, J. W., Leonard, K., Leszczyńska, A., Li, Y., Lincetto, M., Liu, Q. R., Liubarska, M., Lohfink, E., Mariscal, C. J. Lozano, Lu, L., Lucarelli, F., Ludwig, A., Luszczak, W., Lyu, Y., Ma, W. Y., Madsen, J., Mahn, K. B. M., Makino, Y., Mancina, S., Mariş, I. C., Martinez-Soler, I., Maruyama, R., McCarthy, S., McElroy, T., McNally, F., Mead, J. V., Meagher, K., Mechbal, S., Medina, A., Meier, M., Meighen-Berger, S., Merckx, Y., Micallef, J., Mockler, D., Montaruli, T., Moore, R. W., Morik, K., Morse, R., Moulai, M., Mukherjee, T., Naab, R., Nagai, R., Nahnhauer, R., Naumann, U., Necker, J., Nguyen, L. V., Niederhausen, H., Nisa, M. U., Nowicki, S. C., Nygren, D., Pollmann, A. Obertacke, Oehler, M., Oeyen, B., Olivas, A., O'Sullivan, E., Pandya, H., Pankova, D. V., Park, N., Parker, G. K., Paudel, E. N., Paul, L., Heros, C. Pérez de los, Peters, L., Peterson, J., Philippen, S., Pieper, S., Pizzuto, A., Plum, M., Popovych, Y., Porcelli, A., Rodriguez, M. Prado, Pries, B., Przybylski, G. T., Raab, C., Rack-Helleis, J., Raissi, A., Rameez, M., Rawlins, K., Rea, I. C., Rechav, Z., Rehman, A., Reichherzer, P., Reimann, R., Renzi, G., Resconi, E., Reusch, S., Rhode, W., Richman, M., Riedel, B., Roberts, E. J., Robertson, S., Roellinghoff, G., Rongen, M., Rott, C., Ruhe, T., Ryckbosch, D., Cantu, D. Rysewyk, Safa, I., Saffer, J., Salazar-Gallegos, D., Sampathkumar, P., Herrera, S. E. Sanchez, Sandrock, A., Santander, M., Sarkar, S., Sarkar, S., Satalecka, K., Schaufel, M., Schieler, H., Schindler, S., Schmidt, T., Schneider, A., Schneider, J., Schröder, F. G., Schumacher, L., Schwefer, G., Sclafani, S., Seckel, D., Seunarine, S., Sharma, A., Shefali, S., Shimizu, N., Silva, M., Skrzypek, B., Smithers, B., Snihur, R., Soedingrekso, J., Sogaard, A., Soldin, D., Spannfellner, C., Spiczak, G. M., Spiering, C., Stamatikos, M., Stanev, T., Stein, R., Stettner, J., Stezelberger, T., Stokstad, B., Stürwald, T., Stuttard, T., Sullivan, G. W., Taboada, I., Ter-Antonyan, S., Thwaites, J., Tilav, S., Tischbein, F., Tollefson, K., Tönnis, C., Toscano, S., Tosi, D., Trettin, A., Tselengidou, M., Tung, C. F., Turcati, A., Turcotte, R., Turley, C. F., Twagirayezu, J. P., Ty, B., Elorrieta, M. A. Unland, Valtonen-Mattila, N., Vandenbroucke, J., van Eijndhoven, N., Vannerom, D., van Santen, J., Veitch-Michaelis, J., Verpoest, S., Walck, C., Wang, W., Watson, T. B., Weaver, C., Weigel, P., Weindl, A., Weiss, M. J., Weldert, J., Wendt, C., Werthebach, J., Weyrauch, M., Whitehorn, N., Wiebusch, C. H., Willey, N., Williams, D. R., Wolf, M., Wrede, G., Wulff, J., Xu, X. W., Yanez, J. P., Yildizci, E., Yoshida, S., Yu, S., Yuan, T., Zhang, Z., Zhelnin, P.

arXiv.org Artificial IntelligenceJul-10-2023

The origin of high-energy cosmic rays, atomic nuclei that continuously impact Earth's atmosphere, has been a mystery for over a century. Due to deflection in interstellar magnetic fields, cosmic rays from the Milky Way arrive at Earth from random directions. However, near their sources and during propagation, cosmic rays interact with matter and produce high-energy neutrinos. We search for neutrino emission using machine learning techniques applied to ten years of data from the IceCube Neutrino Observatory. We identify neutrino emission from the Galactic plane at the 4.5$\sigma$ level of significance, by comparing diffuse emission models to a background-only hypothesis. The signal is consistent with modeled diffuse emission from the Galactic plane, but could also arise from a population of unresolved point sources.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1126/science.adc9818

2307.04427

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > California > Alameda County > Berkeley (0.14)
(50 more...)

Genre: Research Report > Experimental Study (0.47)

Industry:

Government > Regional Government > North America Government > United States Government (0.67)
Energy (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Truth Discovery in Sequence Labels from Crowds

Sabetpour, Nasim, Kulkarni, Adithya, Xie, Sihong, Li, Qi

arXiv.org Artificial IntelligenceJul-1-2023

Annotation quality and quantity positively affect the learning performance of sequence labeling, a vital task in Natural Language Processing. Hiring domain experts to annotate a corpus is very costly in terms of money and time. Crowdsourcing platforms, such as Amazon Mechanical Turk (AMT), have been deployed to assist in this purpose. However, the annotations collected this way are prone to human errors due to the lack of expertise of the crowd workers. Existing literature in annotation aggregation assumes that annotations are independent and thus faces challenges when handling the sequential label aggregation tasks with complex dependencies. To conquer the challenges, we propose an optimization-based method that infers the ground truth labels using annotations provided by workers for sequential labeling tasks. The proposed Aggregation method for Sequential Labels from Crowds ($AggSLC$) jointly considers the characteristics of sequential labeling tasks, workers' reliabilities, and advanced machine learning techniques. Theoretical analysis on the algorithm's convergence further demonstrates that the proposed $AggSLC$ halts after a finite number of iterations. We evaluate $AggSLC$ on different crowdsourced datasets for Named Entity Recognition (NER) tasks and Information Extraction tasks in biomedical (PICO), as well as a simulated dataset. Our results show that the proposed method outperforms the state-of-the-art aggregation methods. To achieve insights into the framework, we study the effectiveness of $AggSLC$'s components through ablation studies.

annotation, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2109.0447

Country:

North America > United States > Iowa > Story County > Ames (0.04)
North America > United States > Pennsylvania > Northampton County > Bethlehem (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.86)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.89)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.89)
(4 more...)

Add feedback

Game Level Blending using a Learned Level Representation

Atmakuri, Venkata Sai Revanth, Cooper, Seth, Guzdial, Matthew

arXiv.org Artificial IntelligenceJun-28-2023

Game level blending via machine learning, the process of combining features of game levels to create unique and novel game levels using Procedural Content Generation via Machine Learning (PCGML) techniques, has gained increasing popularity in recent years. However, many existing techniques rely on human-annotated level representations, which limits game level blending to a limited number of annotated games. Even with annotated games, researchers often need to author an additional shared representation to make blending possible. In this paper, we present a novel approach to game level blending that employs Clustering-based Tile Embeddings (CTE), a learned level representation technique that can serve as a level representation for unannotated games and a unified level representation across games without the need for human annotation. CTE represents game level tiles as a continuous vector representation, unifying their visual, contextual, and behavioral information. We apply this approach to two classic Nintendo games, Lode Runner and The Legend of Zelda. We run an evaluation comparing the CTE representation to a common, human-annotated representation in the blending task and find that CTE has comparable or better performance without the need for human annotation.

artificial intelligence, machine learning, representation, (17 more...)

arXiv.org Artificial Intelligence

2306.16666

Country:

North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
North America > United States (0.04)

Genre: Research Report (0.84)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback