AITopics | Bennamoun, Mohammed

Collaborating Authors

Bennamoun, Mohammed

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LCEval: Learned Composite Metric for Caption Evaluation

Sharif, Naeha, White, Lyndon, Bennamoun, Mohammed, Liu, Wei, Shah, Syed Afaq Ali

arXiv.org Artificial IntelligenceDec-24-2020

Automatic evaluation metrics hold a fundamental importance in the development and fine-grained analysis of captioning systems. While current evaluation metrics tend to achieve an acceptable correlation with human judgements at the system level, they fail to do so at the caption level. In this work, we propose a neural network-based learned metric to improve the caption-level caption evaluation. To get a deeper insight into the parameters which impact a learned metrics performance, this paper investigates the relationship between different linguistic features and the caption-level correlation of the learned metrics. We also compare metrics trained with different training examples to measure the variations in their evaluation. Moreover, we perform a robustness analysis, which highlights the sensitivity of learned and handcrafted metrics to various sentence perturbations. Our empirical analysis shows that our proposed metric not only outperforms the existing metrics in terms of caption-level correlation but it also shows a strong system-level correlation against human assessments.

caption, deep learning, neural network, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s11263-019-01206-z

2012.13136

Country: Oceania > Australia > Western Australia (0.14)

Genre: Research Report > New Finding (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

A Practical Guide to Graph Neural Networks

Ward, Isaac Ronald, Joyner, Jack, Lickfold, Casey, Rowe, Stash, Guo, Yulan, Bennamoun, Mohammed

arXiv.org Artificial IntelligenceNov-1-2020

NN variants have been designed to increase performance in certain problem domains; the convolutional neural network (CNN) excels in the context of image-based tasks, and the recurrent neural network (RNN) in the space of natural language processing and time series analysis. NNs have also been leveraged as components in composite DL frameworks -- they have been used as trainable generators and discriminators in generative adversarial networks (GANs), and as encoders and decoders in transformers [46]. Although they seem unrelated, the images used as inputs in computer vision, and the sentences used as inputs in natural language processing can both be represented by a single, general data structure: the graph (see Figure 1). Formally, a graph is a set of distinct vertices (representing items or entities) that are joined optionally to each other by edges (representing relationships). The learning architecture that has been designed to process said graphs is the titular graph neural network (GNN). Uniquely, the graphs fed into a GNN (during training and evaluation) do not have strict structural requirements per se; the number of vertices and edges between input graphs can change. In this way, GNNs can handle unstructured, non-Euclidean data [4], a property which makes them valuable in certain problem domains where graph data is abundant. Conversely, NN-based algorithms are typically required to operate on structured inputs with strictly defined dimensions.

deep learning, neural network, vertex, (20 more...)

arXiv.org Artificial Intelligence

2010.05234

Country: North America > United States > New York > New York County > New York City (0.14)

Genre:

Overview (0.67)
Research Report (0.51)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Hands-on Bayesian Neural Networks -- a Tutorial for Deep Learning Users

Jospin, Laurent Valentin, Buntine, Wray, Boussaid, Farid, Laga, Hamid, Bennamoun, Mohammed

arXiv.org Machine LearningJul-14-2020

Modern deep learning methods have equipped researchers and engineers with incredibly powerful tools to tackle problems that previously seemed impossible. However, since deep learning methods operate as black boxes, the uncertainty associated with their predictions is often challenging to quantify. Bayesian statistics offer a formalism to understand and quantify the uncertainty associated with deep neural networks predictions. This paper provides a tutorial for researchers and scientists who are using machine learning, especially deep learning, with an overview of the relevant literature and a complete toolset to design, implement, train, use and evaluate Bayesian neural networks.

deep learning user, hands-on bayesian neural network, tutorial

arXiv.org Machine Learning

2007.06823

Genre: Instructional Material > Course Syllabus & Notes (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multi-Kernel Fusion for RBF Neural Networks

Atif, Syed Muhammad, Khan, Shujaat, Naseem, Imran, Togneri, Roberto, Bennamoun, Mohammed

arXiv.org Machine LearningJul-6-2020

A simple yet effective architectural design of radial basis function neural networks (RBFNN) makes them amongst the most popular conventional neural networks. The current generation of radial basis function neural network is equipped with multiple kernels which provide significant performance benefits compared to the previous generation using only a single kernel. In existing multi-kernel RBF algorithms, multi-kernel is formed by the convex combination of the base/primary kernels. In this paper, we propose a novel multi-kernel RBFNN in which every base kernel has its own (local) weight. This novel flexibility in the network provides better performance such as faster convergence rate, better local minima and resilience against stucking in poor local minima. These performance gains are achieved at a competitive computational complexity compared to the contemporary multi-kernel RBF algorithms. The proposed algorithm is thoroughly analysed for performance gain using mathematical and graphical illustrations and also evaluated on three different types of problems namely: (i) pattern classification, (ii) system identification and (iii) function approximation. Empirical results clearly show the superiority of the proposed algorithm compared to the existing state-of-the-art multi-kernel approaches.

algorithm, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

2007.02592

Country: North America > Canada > Quebec (0.14)

Genre:

Overview (0.87)
Research Report > Promising Solution (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Orthogonal Deep Models As Defense Against Black-Box Attacks

Jalwana, Mohammad A. A. K., Akhtar, Naveed, Bennamoun, Mohammed, Mian, Ajmal

arXiv.org Machine LearningJun-26-2020

Deep learning has demonstrated state-of-the-art performance for a variety of challenging computer vision tasks. On one hand, this has enabled deep visual models to pave the way for a plethora of critical applications like disease prognostics and smart surveillance. On the other, deep learning has also been found vulnerable to adversarial attacks, which calls for new techniques to defend deep models against these attacks. Among the attack algorithms, the black-box schemes are of serious practical concern since they only need publicly available knowledge of the targeted model. We carefully analyze the inherent weakness of deep models in black-box settings where the attacker may develop the attack using a model similar to the targeted model. Based on our analysis, we introduce a novel gradient regularization scheme that encourages the internal representation of a deep model to be orthogonal to another, even if the architectures of the two models are similar. Our unique constraint allows a model to concomitantly endeavour for higher accuracy while maintaining near orthogonal alignment of gradients with respect to a reference model. Detailed empirical study verifies that controlled misalignment of gradients under our orthogonality objective significantly boosts a model's robustness against transferable black-box adversarial attacks. In comparison to regular models, the orthogonal models are significantly more robust to a range of $l_p$ norm bounded perturbations. We verify the effectiveness of our technique on a variety of large-scale models.

air transportation, deep learning, perturbation, (19 more...)

arXiv.org Machine Learning

2006.14856

Country:

Oceania > Australia (0.14)
Asia (0.14)

Genre: Research Report (1.00)

Industry:

Transportation > Air (1.00)
Information Technology > Security & Privacy (0.90)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GANgster: A Fraud Review Detector based on Regulated GAN with Data Augmentation

Shehnepoor, Saeedreza, Togneri, Roberto, Liu, Wei, Bennamoun, Mohammed

arXiv.org Machine LearningJun-11-2020

Financial implications of written reviews provide great incentives for businesses to pay fraudsters to write or use bots to generate fraud reviews. The promising performance of Deep Neural Networks (DNNs) in text classification, has attracted research to use them for fraud review detection. However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews. Unsupervised and semi-supervised methods are among the most applicable methods to deal with the data scarcity problem. Generative Adversarial Network (GAN) as a semi-supervised method has demonstrated to be effective for data augmentation purposes. The state-of-the-art solution utilizes GAN to overcome the data limitation problem. However, it fails to incorporate the behavioral clues in both fraud generation and detection. Besides, the state-of-the-art approach suffers from a common limitation in the training convergence of the GAN, slowing down the training procedure. In this work, we propose a regularised GAN for fraud review detection that makes use of both review text and review rating scores. Scores are incorporated through Information Gain Maximization in to the loss function for two reasons. One is to generate near-authentic and more human like score-correlated reviews. The other is to improve the stability of the GAN. Experimental results have shown better convergence of the regulated GAN. In addition, the scores are also used in combination with word embeddings of review text as input for the discriminators for better performance. Results show that the proposed framework relatively outperformed existing state-of-the-art framework; namely FakeGAN; in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets, respectively.

deep learning, discriminator, law enforcement, (19 more...)

arXiv.org Machine Learning

2006.06561

Country:

North America > United States > California (0.46)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.14)

Genre:

Research Report > Promising Solution (0.68)
Research Report > New Finding (0.66)

Industry:

Law Enforcement & Public Safety > Fraud (0.48)
Media (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

A Novel Adaptive Kernel for the RBF Neural Networks

Khan, Shujaat, Naseem, Imran, Togneri, Roberto, Bennamoun, Mohammed

arXiv.org Machine LearningMay-9-2019

Abstract--In this paper, we propose a novel adaptive kernel for the radial basis function (RBF) neural networks. In [12] a novel RBF network with the multi-kernel is proposed to obtain an optimized and I. INTRODUCTION The unknown centres of the multikernels The RBF neural networks have shown excellent performance are determined by an improved k-means clustering in a number of problems of practical interest. An orthogonal least squares (OLS) algorithm is reservoirs of brine are analyzed for physicochemical properties used to determine the remaining parameters. The convergence of the ACA is analyzed by the [3] the RBF kernel is used to predict the pressure gradient Lyapunov criterion. In the context of nuclear physics, RBF Cognitive Radial Basis Function network (McRBFN) and its has been effectively used to model the stopping power data Projection based Learning (PBL) referred to as PBL-McRBFN of materials as in [4].

kernel, neural network, upstream oil & gas, (20 more...)

arXiv.org Machine Learning

doi: 10.1007/s00034-016-0375-7

1905.03546

Country:

North America > United States (0.28)
Asia (0.28)

Genre: Research Report (0.50)

Industry:

Health & Medicine (0.47)
Education (0.47)
Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.69)

Add feedback

Improving Image-Based Localization with Deep Learning: The Impact of the Loss Function

Ward, Isaac Ronald, Jalwana, M. A. Asim K., Bennamoun, Mohammed

arXiv.org Artificial IntelligenceApr-28-2019

This work formulates a novel loss term which can be appended to an RGB only image localization network's loss function to improve its performance. A common technique used when regressing a camera's pose from an image is to formulate the loss as a linear combination of positional and rotational error (using tuned hyperparameters as coefficients). In this work we observe that changes to rotation and position mutually affect the captured image, and in order to improve performance, a network's loss function should include a term which combines error in both position and rotation. To that end we design a geometric loss term which considers the similarity between the predicted and ground truth poses using both position and rotation, and use it to augment the existing image localization network PoseNet. The loss term is simply appended to the loss function of the already existing image localization network. We achieve improvements in the localization accuracy of the network for indoor scenes: with decreases of up to 9.64% and 2.99% in the median positional and rotational error when compared to similar pipelines.

deep learning, loss function, neural network, (20 more...)

arXiv.org Artificial Intelligence

1905.03692

Country: Oceania > Australia > Western Australia (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.84)

Add feedback

Attention in Convolutional LSTM for Gesture Recognition

Zhang, Liang, Zhu, Guangming, Mei, Lin, Shen, Peiyi, Shah, Syed Afaq Ali, Bennamoun, Mohammed

Neural Information Processing SystemsDec-31-2018

Convolutional long short-term memory (LSTM) networks have been widely used for action/gesture recognition, and different attention mechanisms have also been embedded into the LSTM or the convolutional LSTM (ConvLSTM) networks. Based on the previous gesture recognition architectures which combine the three-dimensional convolution neural network (3DCNN) and ConvLSTM, this paper explores the effects of attention mechanism in ConvLSTM. Several variants of ConvLSTM are evaluated: (a) Removing the convolutional structures of the three gates in ConvLSTM, (b) Applying the attention mechanism on the input of ConvLSTM, (c) Reconstructing the input and (d) output gates respectively with the modified channel-wise attention mechanism. The evaluation results demonstrate that the spatial convolutions in the three gates scarcely contribute to the spatiotemporal feature fusion, and the attention mechanisms embedded into the input and output gates cannot improve the feature fusion. In other words, ConvLSTM mainly contributes to the temporal fusion along with the recurrent steps to learn the long-term spatiotemporal features, when taking as input the spatial or spatiotemporal features. On this basis, a new variant of LSTM is derived, in which the convolutional structures are only embedded into the input-to-state transition of LSTM. The code of the LSTM variants is publicly available.

deep learning, neural network, variant, (19 more...)

Neural Information Processing Systems

Country: Oceania > Australia (0.28)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback