AITopics | pre-trained neural network

Collaborating Authors

pre-trained neural network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

DeiSAM: Segment Anything with Deictic Prompting

Neural Information Processing SystemsMar-20-2026, 20:18:43 GMT

Large-scale, pre-trained neural networks have demonstrated strong capabilities in various tasks, including zero-shot image segmentation. To identify concrete objects in complex scenes, humans instinctively rely on deictic descriptions in natural language, i.e., referring to something depending on the context such as The object that is on the desk and behind the cup.. However, deep learning approaches cannot reliably interpret such deictic representations due to their lack of reasoning capabilities in complex scenarios. To remedy this issue, we propose DeiSAM -- a combination of large pre-trained neural networks with differentiable logic reasoners -- for deictic promptable segmentation. Given a complex, textual segmentation description, DeiSAM leverages Large Language Models (LLMs) to generate first-order logic rules and performs differentiable forward reasoning on generated scene graphs. Subsequently, DeiSAM segments objects by matching them to the logically inferred image regions. As part of our evaluation, we propose the Deictic Visual Genome (DeiVG) dataset, containing paired visual input and complex, deictic textual prompts. Our empirical results demonstrate that DeiSAM is a substantial improvement over purely data-driven baselines for deictic promptable segmentation.

large language model, machine learning, natural language, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Add feedback

DeiSAM: Segment Anything with Deictic Prompting

Neural Information Processing SystemsMay-27-2025, 02:41:37 GMT

deictic promptable segmentation, deictic prompting, deisam, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback

Post-Hoc Uncertainty Quantification in Pre-Trained Neural Networks via Activation-Level Gaussian Processes

Bergna, Richard, Depeweg, Stefan, Ordonez, Sergio Calvo, Plenk, Jonathan, Cartea, Alvaro, Hernandez-Lobato, Jose Miguel

arXiv.org Machine LearningFeb-28-2025

Uncertainty quantification in neural networks through methods such as Dropout, Bayesian neural networks and Laplace approximations is either prone to underfitting or computationally demanding, rendering these approaches impractical for large-scale datasets. In this work, we address these shortcomings by shifting the focus from uncertainty in the weight space to uncertainty at the activation level, via Gaussian processes. More specifically, we introduce the Gaussian Process Activation function (GAPA) to capture neuron-level uncertainties. Our approach operates in a post-hoc manner, preserving the original mean predictions of the pre-trained neural network and thereby avoiding the underfitting issues commonly encountered in previous methods. We propose two methods. The first, GAPA-Free, employs empirical kernel learning from the training data for the hyperparameters and is highly efficient during training. The second, GAPA-Variational, learns the hyperparameters via gradient descent on the kernels, thus affording greater flexibility. Empirical results demonstrate that GAPA-Variational outperforms the Laplace approximation on most datasets in at least one of the uncertainty quantification metrics.

activation, learning, neural network, (11 more...)

arXiv.org Machine Learning

2502.20966

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Västmanland County > Västerås (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Transportation (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Predicting concentration levels of air pollutants by transfer learning and recurrent neural network

Fong, Iat Hang, Li, Tengyue, Fong, Simon, Wong, Raymond K., Tallón-Ballesteros, Antonio J.

arXiv.org Artificial IntelligenceJan-30-2025

Air pollution (AP) poses a great threat to human health, and people are paying more attention than ever to its prediction. Accurate prediction of AP helps people to plan for their outdoor activities and aids protecting human health. In this paper, long-short term memory (LSTM) recurrent neural networks (RNNs) have been used to predict the future concentration of air pollutants (APS) in Macau. Additionally, meteorological data and data on the concentration of APS have been utilized. Moreover, in Macau, some air quality monitoring stations (AQMSs) have less observed data in quantity, and, at the same time, some AQMSs recorded less observed data of certain types of APS. Therefore, the transfer learning and pre-trained neural networks have been employed to assist AQMSs with less observed data to build a neural network with high prediction accuracy. The experimental sample covers a period longer than 12-year and includes daily measurements from several APS as well as other more classical meteorological values. Records from five stations, four out of them are AQMSs and the remaining one is an automatic weather station, have been prepared from the aforesaid period and eventually underwent to computational intelligence techniques to build and extract a prediction knowledge-based system. As shown by experimentation, LSTM RNNs initialized with transfer learning methods have higher prediction accuracy; it incurred shorter training time than randomly initialized recurrent neural networks.

artificial intelligence, machine learning, neural network, (14 more...)

arXiv.org Artificial Intelligence

2502.01654

Country:

Asia > Macao (0.59)
Asia > China > Hong Kong (0.05)
Europe > Spain > Andalusia > Seville Province > Seville (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.34)
Health & Medicine > Therapeutic Area > Immunology (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Leveraging Pre-Trained Neural Networks to Enhance Machine Learning with Variational Quantum Circuits

Qi, Jun, Yang, Chao-Han, Chen, Samuel Yen-Chi, Chen, Pin-Yu, Zenil, Hector, Tegner, Jesper

arXiv.org Artificial IntelligenceNov-13-2024

Quantum Machine Learning (QML) offers tremendous potential but is currently limited by the availability of qubits. We introduce an innovative approach that utilizes pre-trained neural networks to enhance Variational Quantum Circuits (VQC). This technique effectively separates approximation error from qubit count and removes the need for restrictive conditions, making QML more viable for real-world applications. Our method significantly improves parameter optimization for VQC while delivering notable gains in representation and generalization capabilities, as evidenced by rigorous theoretical analysis and extensive empirical testing on quantum dot classification tasks. Moreover, our results extend to applications such as human genome analysis, demonstrating the broad applicability of our approach. By addressing the constraints of current quantum hardware, our work paves the way for a new era of advanced QML applications, unlocking the full potential of quantum computing in fields such as machine learning, materials science, medicine, mimetics, and various interdisciplinary areas.

machine learning, neural network, vqc, (13 more...)

arXiv.org Artificial Intelligence

2411.08552

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > Santa Clara County > Santa Clara (0.04)
Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Running Ekkono's Edge Machine Learning on a Commodore 64 - Ekkono Solutions AB

#artificialintelligenceAug-22-2022, 10:53:53 GMT

What does it really mean to run machine learning on the edge? Over the last five years, Ekkono's researchers, engineers, and developers have been working hard to bring smart functionality to small hardware platforms. In this blog post, I would like to give you a small glimpse of what's possible to achieve with our purpose-built machine learning software library, designed with portability and ease-of-use in mind from the very first line of code. The potential benefits of analyzing data close to the source, rather than uploading it to the cloud, are many: faster response times, increased security and privacy, and improved energy efficiency, to name a few. Back in June, one of our data scientists, Eva Garcia Martin, wrote about some of the challenges of designing and building machine learning software for edge devices, and how those challenges influence our R&D processes.

application, commodore 64, neural network, (16 more...)

#artificialintelligence

Country: Europe > Germany (0.05)

Industry:

Education (0.56)
Information Technology (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

IT Threat Detection using Neural Search

#artificialintelligenceJul-5-2022, 16:50:48 GMT

If you spend more on coffee than IT security, you will be hacked! Warned U.S. Cybersecurity Czar Richard Clarke, speaking at RSA Conference. This quote would make a great bumper sticker if it weren't for network attacks. According to research by IBM, it takes 280 days to find and contain the average cyberattack, while the average attack costs $3.86 million. But what are network attacks, and how can we leverage a next-gen search tool like Jina to mitigate our exposure to the threat?

feature extractor, network traffic, neural network, (12 more...)

#artificialintelligence

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.71)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

How Data-Centric AI Bolsters Deep Learning for the Small-Data Masses

#artificialintelligenceMar-25-2022, 11:10:23 GMT

It's no coincidence that deep learning became popular in the AI community following the rise of big data, since neural networks require huge amounts of data to train. But organizations with much smaller data sets can benefit from pre-trained neural networks, especially if they follow the premise of data-centric AI, Andrew Ng said this week at the Nvidia GPU Technology Conference. Ng, a prominent AI researcher and a Datanami 2022 Person to Watch, is at the forefront of the data-centric AI movement, which is aimed at helping millions of smaller organizations leverage the promise of AI. "We know that in consumer software companies, you may have a billion users [in] a giant data set. But when you go to other industries, the sizes are often much smaller," Ng said during his Nvidia GTC session, titled "The Data-centric AI Movement." "From where I'm sitting, I think AI–machine learning, deep learning–has transformed the consumer software Internet. But in many other industries, I think it's frankly not yet there."

andrew ng, data-centric ai bolster deep learning, neural network, (12 more...)

#artificialintelligence

Industry: Information Technology (0.96)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)

Add feedback

Using DeepProbLog to perform Complex Event Processing on an Audio Stream

Vilamala, Marc Roig, Xing, Tianwei, Taylor, Harrison, Garcia, Luis, Srivastava, Mani, Kaplan, Lance, Preece, Alun, Kimmig, Angelika, Cerutti, Federico

arXiv.org Artificial IntelligenceOct-15-2021

In this paper, we present an approach to Complex Event Processing (CEP) that is based on DeepProbLog. This approach has the following objectives: (i) allowing the use of subsymbolic data as an input, (ii) retaining the flexibility and modularity on the definitions of complex event rules, (iii) allowing the system to be trained in an end-to-end manner and (iv) being robust against noisily labelled data. Our approach makes use of DeepProbLog to create a neuro-symbolic architecture that combines a neural network to process the subsymbolic data with a probabilistic logic layer to allow the user to define the rules for the complex events. We demonstrate that our approach is capable of detecting complex events from an audio stream. We also demonstrate that our approach is capable of training even with a dataset that has a moderate proportion of noisy data.

complex event, dataset, neural network, (15 more...)

arXiv.org Artificial Intelligence

2110.0809

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Industry:

Media > Radio (0.61)
Leisure & Entertainment (0.61)
Government (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Accelerate Your First AI Deployment with Pre-Trained Models

#artificialintelligenceJul-8-2021, 15:31:01 GMT

Organizations are constantly looking to incorporate artificial intelligence (AI) in their daily operations. However, with the large amounts of time and money required for extensive AI integration, organizations must find smarter ways to implement AI, such as using pre-trained AI models. As you probably know, transforming an organization with AI and machine learning can be time-consuming. The efforts and finances required to complete the process depend on the level of automation and digitization being introduced in the various departments of the organization. Deep learning and other components of AI need thousands upon thousands of datasets to improve the competency of automated operations over a given period.

ai model, neural network, pre-trained ai model, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Add feedback