AITopics | Edmonton

Collaborating Authors

Edmonton

A Transformer-Based Framework for Payload Malware Detection and Classification

Stein, Kyle, Mahyari, Arash, Francia, Guillermo III, El-Sheikh, Eman

arXiv.org Artificial IntelligenceMar-26-2024

As malicious cyber threats become more sophisticated in breaching computer networks, the need for effective intrusion detection systems (IDSs) becomes crucial. Techniques such as Deep Packet Inspection (DPI) have been introduced to allow IDSs analyze the content of network packets, providing more context for identifying potential threats. IDSs traditionally rely on using anomaly-based and signature-based detection techniques to detect unrecognized and suspicious activity. Deep learning techniques have shown great potential in DPI for IDSs due to their efficiency in learning intricate patterns from the packet content being transmitted through the network. In this paper, we propose a revolutionary DPI algorithm based on transformers adapted for the purpose of detecting malicious traffic with a classifier head. Transformers learn the complex content of sequence data and generalize them well to similar scenarios thanks to their self-attention mechanism. Our proposed method uses the raw payload bytes that represent the packet contents and is deployed as man-in-the-middle. The payload bytes are used to detect malicious packets and classify their types. Experimental results on the UNSW-NB15 and CIC-IOT23 datasets demonstrate that our transformer-based model is effective in distinguishing malicious from benign traffic in the test dataset, attaining an average accuracy of 79\% using binary classification and 72\% on the multi-classification experiment, both using solely payload bytes.

dataset, packet, payload, (16 more...)

arXiv.org Artificial Intelligence

2403.18223

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Florida > Escambia County > Pensacola (0.05)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Few-shot Named Entity Recognition via Superposition Concept Discrimination

Chen, Jiawei, Lin, Hongyu, Han, Xianpei, Lu, Yaojie, Jiang, Shanshan, Dong, Bin, Sun, Le

arXiv.org Artificial IntelligenceMar-25-2024

Few-shot NER aims to identify entities of target types with only limited number of illustrative instances. Unfortunately, few-shot NER is severely challenged by the intrinsic precise generalization problem, i.e., it is hard to accurately determine the desired target type due to the ambiguity stemming from information deficiency. In this paper, we propose Superposition Concept Discriminator (SuperCD), which resolves the above challenge via an active learning paradigm. Specifically, a concept extractor is first introduced to identify superposition concepts from illustrative instances, with each concept corresponding to a possible generalization boundary. Then a superposition instance retriever is applied to retrieve corresponding instances of these superposition concepts from large-scale text corpus. Finally, annotators are asked to annotate the retrieved instances and these annotated instances together with original illustrative instances are used to learn FS-NER models. To this end, we learn a universal concept extractor and superposition instance retriever using a large-scale openly available knowledge bases. Experiments show that SuperCD can effectively identify superposition concepts from illustrative instances, retrieve superposition instances from large-scale corpus, and significantly improve the few-shot NER performance with minimal additional efforts.

computational linguistic, supercd, superposition concept, (13 more...)

arXiv.org Artificial Intelligence

2403.16463

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Beijing > Beijing (0.05)
Europe > Ireland > Leinster > County Dublin > Dublin (0.05)
(10 more...)

Genre: Research Report (0.64)

Industry: Education > Curriculum > Subject-Specific Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Deep Learning Based Sphere Decoding

Mohammadkarimi, Mostafa, Mehrabi, Mehrtash, Ardakani, Masoud, Jing, Yindi

arXiv.org Artificial IntelligenceMar-25-2024

In this paper, a deep learning (DL)-based sphere decoding algorithm is proposed, where the radius of the decoding hypersphere is learned by a deep neural network (DNN). The performance achieved by the proposed algorithm is very close to the optimal maximum likelihood decoding (MLD) over a wide range of signal-to-noise ratios (SNRs), while the computational complexity, compared to existing sphere decoding variants, is significantly reduced. This improvement is attributed to DNN's ability of intelligently learning the radius of the hypersphere used in decoding. The expected complexity of the proposed DL-based algorithm is analytically derived and compared with existing ones. It is shown that the number of lattice points inside the decoding hypersphere drastically reduces in the DL-based algorithm in both the average and worst-case senses. The effectiveness of the proposed algorithm is shown through simulation for high-dimensional multiple-input multiple-output (MIMO) systems, using high-order modulations.

algorithm, complexity, lattice point, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TWC.2019.2924220

1807.03162

Country:

North America > United States > New York (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Distributed Robust Learning based Formation Control of Mobile Robots based on Bioinspired Neural Dynamics

Xu, Zhe, Yan, Tao, Yang, Simon X., Gadsden, S. Andrew, Biglarbegian, Mohammad

arXiv.org Artificial IntelligenceMar-23-2024

This paper addresses the challenges of distributed formation control in multiple mobile robots, introducing a novel approach that enhances real-world practicability. We first introduce a distributed estimator using a variable structure and cascaded design technique, eliminating the need for derivative information to improve the real time performance. Then, a kinematic tracking control method is developed utilizing a bioinspired neural dynamic-based approach aimed at providing smooth control inputs and effectively resolving the speed jump issue. Furthermore, to address the challenges for robots operating with completely unknown dynamics and disturbances, a learning-based robust dynamic controller is developed. This controller provides real time parameter estimates while maintaining its robustness against disturbances. The overall stability of the proposed method is proved with rigorous mathematical analysis. At last, multiple comprehensive simulation studies have shown the advantages and effectiveness of the proposed method.

controller, mobile robot, robot, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TIV.2024.3380000

2403.15716

Country:

North America > Canada > Ontario > Hamilton (0.14)
North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
North America > Canada > Ontario > Wellington County > Guelph (0.14)
(9 more...)

Genre:

Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Industry: Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (0.66)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.49)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.47)

Add feedback

BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion

Wei, Jia, Zhang, Xingjun, Pedrycz, Witold

arXiv.org Artificial IntelligenceMar-23-2024

Bagging has achieved great success in the field of machine learning by integrating multiple base classifiers to build a single strong classifier to reduce model variance. The performance improvement of bagging mainly relies on the number and diversity of base classifiers. However, traditional deep learning model training methods are expensive to train individually and difficult to train multiple models with low similarity in a restricted dataset. Recently, diffusion models, which have been tremendously successful in the fields of imaging and vision, have been found to be effective in generating neural network model weights and biases with diversity. We creatively propose a Bagging deep learning training algorithm based on Efficient Neural network Diffusion (BEND). The originality of BEND comes from the first use of a neural network diffusion model to efficiently build base classifiers for bagging. Our approach is simple but effective, first using multiple trained model weights and biases as inputs to train autoencoder and latent diffusion model to realize a diffusion model from noise to valid neural network parameters. Subsequently, we generate several base classifiers using the trained diffusion model. Finally, we integrate these ba se classifiers for various inference tasks using the Bagging method. Resulting experiments on multiple models and datasets show that our proposed BEND algorithm can consistently outperform the mean and median accuracies of both the original trained model and the diffused model. At the same time, new models diffused using the diffusion model have higher diversity and lower cost than multiple models trained using traditional methods. The BEND approach successfully introduces diffusion models into the new deep learning training domain and provides a new paradigm for future deep learning training and inference.

classifier, diffusion model, model parameter, (12 more...)

arXiv.org Artificial Intelligence

2403.15766

Country:

Asia > China > Shaanxi Province > Xi'an (0.05)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Research Re: search & Re-search

Plaat, Aske

arXiv.org Artificial IntelligenceMar-20-2024

Search algorithms are often categorized by their node expansion strategy. One option is the depth-first strategy, a simple backtracking strategy that traverses the search space in the order in which successor nodes are generated. An alternative is the best-first strategy, which was designed to make it possible to use domain-specific heuristic information. By exploring promising parts of the search space first, best-first algorithms are usually more efficient than depth-first algorithms. In programs that play minimax games such as chess and checkers, the efficiency of the search is of crucial importance. Given the success of best-first algorithms in other domains, one would expect them to be used for minimax games too. However, all high-performance game-playing programs are based on a depth-first algorithm. This study takes a closer look at a depth-first algorithm, AB, and a best-first algorithm, SSS. The prevailing opinion on these algorithms is that SSS offers the potential for a more efficient search, but that its complicated formulation and exponential memory requirements render it impractical. The theoretical part of this work shows that there is a surprisingly straightforward link between the two algorithms -- for all practical purposes, SSS is a special case of AB. Subsequent empirical evidence proves the prevailing opinion on SSS to be wrong: it is not a complicated algorithm, it does not need too much memory, and it is also not more efficient than depth-first search.

alpha-beta algorithm, computer science, total node relative, (16 more...)

arXiv.org Artificial Intelligence

2403.13705

Country:

North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.28)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.27)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
(32 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment > Games > Chess (1.00)
Government (1.00)
Banking & Finance > Economy (1.00)
Transportation (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Weisfeiler and Leman Go Loopy: A New Hierarchy for Graph Representational Learning

Paolino, Raffaele, Maskey, Sohir, Welke, Pascal, Kutyniok, Gitta

arXiv.org Artificial IntelligenceMar-20-2024

For example, in organic chemistry or bioinformatics, different types of cycles can impact We introduce r-loopy Weisfeiler-Leman (r-lWL), various chemical properties of the underlying molecules a novel hierarchy of graph isomorphism tests and (Deshpande et al., 2002; Koyutürk et al., 2004). Therefore, a corresponding GNN framework, r-lMPNN, that it is crucial to investigate whether GNNs can count certain can count cycles up to length r + 2. Most notably, substructures and to design architectures that surpass the we show that r-lWL can count homomorphisms limited power of MPNNs. of cactus graphs. This strictly extends classical Many models have been proposed to match or surpass the 1-WL, which can only count homomorphisms of expressive power of WL. Several draw inspiration from trees and, in fact, is incomparable to k-WL for any higher-order variants of the WL algorithm (Morris et al., fixed k. We empirically validate the expressive 2019), enabling them to count a broader range of substructures.

graph, homomorphism, isomorphism, (14 more...)

arXiv.org Artificial Intelligence

2403.13749

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Austria > Vienna (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(6 more...)

Genre: Research Report (0.81)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Relaxed Clipping: A Global Training Method for Robust Regression and Classification

Neural Information Processing SystemsMar-15-2024, 15:24:19 GMT

Robust regression and classification are often thought to require non-convex loss functions that prevent scalable, global training. However, such a view neglects the possibility of reformulated training methods that can yield practically solvable alternatives. A natural way to make a loss function more robust to outliers is to truncate loss values that exceed a maximum threshold. We demonstrate that a relaxation of this form of "loss clipping" can be made globally solvable and applicable to any standard loss while guaranteeing robustness against outliers. We present a generic procedure that can be applied to standard loss functions and demonstrate improved robustness in regression and classification problems.

loss function, outlier, robustness, (14 more...)

Neural Information Processing Systems

Country: North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)

Add feedback

On Strategy Stitching in Large Extensive Form Multiplayer Games

Neural Information Processing SystemsMar-15-2024, 04:59:05 GMT

Computing a good strategy in a large extensive form game often demands an extraordinary amount of computer memory, necessitating the use of abstraction to reduce the game size. Typically, strategies from abstract games perform better in the real game as the granularity of abstraction is increased. This paper investigates two techniques for stitching a base strategy in a coarse abstraction of the full game tree, to expert strategies in fine abstractions of smaller subtrees. We provide a general framework for creating static experts, an approach that generalizes some previous strategy stitching efforts. In addition, we show that static experts can create strong agents for both 2-player and 3-player Leduc and Limit Texas Hold'em poker, and that a specific class of static experts can be preferred among a number of alternatives. Furthermore, we describe a poker agent that used static experts and won the 3-player events of the 2010 Annual Computer Poker Competition.

abstraction, information, static expert, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.25)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games > Poker (0.87)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Games > Poker (1.00)

Add feedback

Learning Patient-Specific Cancer Survival Distributions as a Sequence of Dependent Regressors

Neural Information Processing SystemsMar-14-2024, 22:42:20 GMT

An accurate model of patient survival time can help in the treatment and care of cancer patients. The common practice of providing survival time estimates based only on population averages for the site and stage of cancer ignores many important individual differences among patients. In this paper, we propose a local regression method for learning patient-specific survival time distribution based on patient attributes such as blood tests and clinical assessments. When tested on a cohort of more than 2000 cancer patients, our method gives survival time predictions that are much more accurate than popular survival analysis models such as the Cox and Aalen regression models. Our results also show that using patient-specific attributes can reduce the prediction error on survival time by as much as 20% when compared to using cancer site and stage only.

prediction, regression model, survival time, (14 more...)

Neural Information Processing Systems

Country:

North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
North America > United States > New York (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback