AITopics | Barnes, Nick

Collaborating Authors

Barnes, Nick

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Comprehensive Overview of Large Language Models

Naveed, Humza, Khan, Asad Ullah, Qiu, Shi, Saqib, Muhammad, Anwar, Saeed, Usman, Muhammad, Akhtar, Naveed, Barnes, Nick, Mian, Ajmal

arXiv.org Artificial IntelligenceDec-27-2023

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics, datasets, benchmarking, efficiency, and more. With the rapid development of techniques and regular breakthroughs in LLM research, it has become considerably challenging to perceive the bigger picture of the advances in this direction. Considering the rapidly emerging plethora of literature on LLMs, it is imperative that the research community is able to benefit from a concise yet comprehensive overview of the recent developments in this field. This article provides an overview of the existing literature on a broad range of LLM-related concepts. Our self-contained comprehensive overview of LLMs discusses relevant background concepts along with covering the advanced topics at the frontier of research in LLMs. This review article is intended to not only provide a systematic survey but also a quick comprehensive reference for the researchers and practitioners to draw insights from extensive informative summaries of the existing works to advance the LLM research.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2307.06435

Country:

Europe (1.00)
Oceania > Australia (0.92)
Asia > Middle East > Saudi Arabia > Eastern Province > Dhahran (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.33)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.92)
Education > Educational Setting (0.92)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Model Calibration in Dense Classification with Adaptive Label Perturbation

Liu, Jiawei, Ye, Changkun, Wang, Shan, Cui, Ruikai, Zhang, Jing, Zhang, Kaihao, Barnes, Nick

arXiv.org Artificial IntelligenceAug-2-2023

For safety-related applications, it is crucial to produce trustworthy deep neural networks whose prediction is associated with confidence that can represent the likelihood of correctness for subsequent decision-making. Existing dense binary classification models are prone to being over-confident. To improve model calibration, we propose Adaptive Stochastic Label Perturbation (ASLP) which learns a unique label perturbation level for each training image. ASLP employs our proposed Self-Calibrating Binary Cross Entropy (SC-BCE) loss, which unifies label perturbation processes including stochastic approaches (like DisturbLabel), and label smoothing, to correct calibration while maintaining classification rates. ASLP follows Maximum Entropy Inference of classic statistical mechanics to maximise prediction entropy with respect to missing information. It performs this while: (1) preserving classification accuracy on known data as a conservative solution, or (2) specifically improves model calibration degree by minimising the gap between the prediction accuracy and expected confidence of the target training label. Extensive results demonstrate that ASLP can significantly improve calibration degrees of dense binary classification models on both in-distribution and out-of-distribution data. The code is available on https://github.com/Carlisle-Liu/ASLP.

artificial intelligence, calibration degree, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2307.13539

Country: Europe (0.45)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Robust normalizing flows using Bernstein-type polynomials

Ramasinghe, Sameera, Fernando, Kasun, Khan, Salman, Barnes, Nick

arXiv.org Machine LearningFeb-5-2021

We propose a framework to construct (Kobyzev et al., 2020). NFs based on increasing triangular maps and Bernstein-type polynomials. Compared to the In contrast, normalizing flows (NFs) are a category of generative existing (universal) NF frameworks, our method models that enable exact density computation and provides compelling advantages like theoretical efficient sampling. Since the seminal work by Rezende upper bounds for the approximation error, robustness, & Mohamed (2015), NFs have been gaining increasing attention higher interpretability, suitability for compactly from the machine learning community due to the supported densities, and the ability to employ attractive properties mentioned earlier. In the abstract, NFs higher degree polynomials without training consist of a series diffeomorphisms that transforms a simple instability. Moreover, we provide a constructive distribution into a more complex one, which in turn universality proof, which gives analytic expressions allows an analytical density estimation of samples. In the of the approximations for known transformations.

artificial intelligence, neural network, polynomial, (17 more...)

arXiv.org Machine Learning

2102.03509

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Attention Guided Semantic Relationship Parsing for Visual Question Answering

Farazi, Moshiur, Khan, Salman, Barnes, Nick

arXiv.org Artificial IntelligenceOct-4-2020

Humans explain inter-object relationships with semantic labels that demonstrate a high-level understanding required to perform complex Vision-Language tasks such as Visual Question Answering (VQA). However, existing VQA models represent relationships as a combination of object-level visual features which constrain a model to express interactions between objects in a single domain, while the model is trying to solve a multi-modal task. In this paper, we propose a general purpose semantic relationship parser which generates a semantic feature vector for each subject-predicate-object triplet in an image, and a Mutual and Self Attention (MSA) mechanism that learns to identify relationship triplets that are important to answer the given question. To motivate the significance of semantic relationships, we show an oracle setting with ground-truth relationship triplets, where our model achieves a ~25% accuracy gain over the closest state-of-the-art model on the challenging GQA dataset. Further, with our semantic parser, we show that our model outperforms other comparable approaches on VQA and GQA datasets.

artificial intelligence, neural network, representation, (17 more...)

arXiv.org Artificial Intelligence

2010.01725

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.70)
(2 more...)

Add feedback

Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes

Ramasinghe, Sameera, Khan, Salman, Barnes, Nick, Gould, Stephen

arXiv.org Machine LearningAug-24-2019

Existing networks directly learn feature representations on 3D point clouds for shape analysis. We argue that 3D point clouds are highly redundant and hold irregular (permutation-invariant) structure, which makes it difficult to achieve inter-class discrimination efficiently. In this paper, we propose a two-faceted solution to this problem that is seamlessly integrated in a single `Blended Convolution and Synthesis' layer. This fully differentiable layer performs two critical tasks in succession. In the first step, it projects the input 3D point clouds into a latent 3D space to synthesize a highly compact and more inter-class discriminative point cloud representation. Since, 3D point clouds do not follow a Euclidean topology, standard 2/3D Convolutional Neural Networks offer limited representation capability. Therefore, in the second step, it uses a novel 3D convolution operator functioning inside the unit ball ($\mathbb{B}^3$) to extract useful volumetric features. We extensively derive formulae to achieve both translation and rotation of our novel convolution kernels. Finally, using the proposed techniques we present an extremely light-weight, end-to-end architecture that achieves compelling results on 3D shape recognition and retrieval.

deep learning, neural network, null, (17 more...)

arXiv.org Machine Learning

1908.10209

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Volumetric Convolution: Automatic Representation Learning in Unit Ball

Ramasinghe, Sameera, Khan, Salman, Barnes, Nick

arXiv.org Machine LearningJan-3-2019

Convolution is an efficient technique to obtain abstract feature representations using hierarchical layers in deep networks. Although performing convolution in Euclidean geometries is fairly straightforward, its extension to other topological spaces---such as a sphere ($\mathbb{S}^2$) or a unit ball ($\mathbb{B}^3$)---entails unique challenges. In this work, we propose a novel `\emph{volumetric convolution}' operation that can effectively convolve arbitrary functions in $\mathbb{B}^3$. We develop a theoretical framework for \emph{volumetric convolution} based on Zernike polynomials and efficiently implement it as a differentiable and an easily pluggable layer for deep networks. Furthermore, our formulation leads to derivation of a novel formula to measure the symmetry of a function in $\mathbb{B}^3$ around an arbitrary axis, that is useful in 3D shape analysis tasks. We demonstrate the efficacy of proposed volumetric convolution operation on a possible use-case i.e., 3D object recognition task.

convolution, deep learning, neural network, (16 more...)

arXiv.org Machine Learning

1901.00616

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

AI@NICTA

AI MagazineOct-11-2012

NICTA is Australia's Information and Communications Technology (ICT) Centre of Excellence. It is the largest organization in Australia dedicated to ICT research. While it has close links with local universities, it is in fact an independent but not-for-profit company in the business of doing research, commercializing that research and training PhD students to do that research. Much of the work taking place at NICTA involves various topics in artificial intelligence. In this article, we survey some of the AI work being undertaken at NICTA.

constraint-based reasoning, logic programming, nicta, (23 more...)

AI Magazine

Country:

Oceania > Australia (1.00)
Europe (1.00)
North America > United States > California (0.29)

Genre: Instructional Material (0.69)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.93)
Information Technology (0.93)
Transportation (0.68)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(6 more...)

Add feedback

Totally Corrective Boosting for Regularized Risk Minimization

Shen, Chunhua, Li, Hanxi, Barnes, Nick

arXiv.org Artificial IntelligenceDec-11-2011

Consideration of the primal and dual problems together leads to important new insights into the characteristics of boosting algorithms. In this work, we propose a general framework that can be used to design new boosting algorithms. A wide variety of machine learning problems essentially minimize a regularized risk functional. We show that the proposed boosting framework, termed CGBoost, can accommodate various loss functions and different regularizers in a totally-corrective optimization fashion. We show that, by solving the primal rather than the dual, a large body of totally-corrective boosting algorithms can actually be efficiently solved and no sophisticated convex optimization solvers are needed. We also demonstrate that some boosting algorithms like AdaBoost can be interpreted in our framework--even their optimization is not totally corrective. We empirically show that various boosting algorithms based on the proposed framework perform similarly on the UCIrvine machine learning datasets [1] that we have used in the experiments.

regularized risk minimization

arXiv.org Artificial Intelligence

1008.5188

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback