AITopics | deeplearning

Collaborating Authors

deeplearning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

2 PreliminariesonBayesianAutoencoders Anautoencoder(AE)isaneuralnetworkparameterizedbyasetofparameters w,whichtransforms anunlabelled dataset,x

Neural Information Processing SystemsFeb-10-2026, 10:44:01 GMT

The bottleneck introduced by the low-dimensional latent space is what characterizes the compression and representation learning capabilities ofautoencoders.

artificial intelligence, machine learning, whichtransform anunlabelled dataset, (16 more...)

Neural Information Processing Systems

Country:

Europe > France (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)
(11 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)

Add feedback

A Primer on Large Language Models and their Limitations

Johnson, Sandra, Hyland-Wood, David

arXiv.org Artificial IntelligenceDec-2-2024

The world of artificial intelligence (AI) is increasingly penetrating all aspects of our personal and professional lives. This proliferation of AI tools and applications are being met with a mixture of excitement, scepticism and even dread [78]. Excitement at the seemingly endless potential of AI applications such as LLMs, especially when they are integrated "within broader systems" [13], scepticism as the realisation dawns that LLMs are in fact fallible as evidenced by hallucinations and hence not the golden bullet that can solve all problems [19, 21], and a feeling of dread for those who believe that LLMs and AI have the potential to detrimentally impact our lives and make people redundant [78]. The ability of some LLMs to pass Theory of Mind (ToM) [64][32] and Turing Tests [7][42] suggests support for the Computational Theory of Mind (CTM), that cognition may be substrate independent. These findings challenge biological essentialism and open new avenues for creating sophisticated AI systems capable of human-like reasoning and interaction.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2412.04503

Country:

Oceania > Australia > Queensland (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
(4 more...)

Genre: Overview (1.00)

Industry:

Law (1.00)
Health & Medicine (1.00)
Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Predicting the activity of chemical compounds based on machine learning approaches

Tu, Do Hoang, Van Lang, Tran, Xuyen, Pham Cong, Long, Le Mau

arXiv.org Artificial IntelligenceSep-10-2023

ABSTRACT -- Exploring methods and techniques of machine learning (ML) to address specific challenges in various fields is essential. In this work, we tackle a problem in the domain of Cheminformatics; that is, providing a suitable solution to aid in predicting the activity of a chemical compound to the best extent possible. To address the problem at hand, this study conducts experiments on 100 different combinations of existing techniques. These solutions are then selected based on a set of criteria that includes the G-means, F1-score, and AUC metrics. The results have been tested on a dataset of about 10,000 chemical compounds from PubChem that have been classified according to their activity. I. INTRODUCTION In datasets used in biological experiments for measuring the activity of various compounds against different biological targets, often used in screening, there is usually a significant imbalance between active and inactive compounds, with the number of inactive data points being much larger. Therefore, training requires the use of suitable machine learning models. Additionally, preprocessing before using machine learning methods for training is also a crucial issue. The following issues are approached to address the problem of predicting the activity of chemical compounds using chemistry-related datasets: Investigating the dependency of attributes or features in the dataset to potentially reduce the number of features. This can be done using methods such as ANOVA F-test to assess the dependency of each feature on the target variable or by using correlation coefficients.

chemical compound, compound, dataset, (15 more...)

arXiv.org Artificial Intelligence

2401.01004

Country:

Asia > Vietnam (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Genre: Research Report (1.00)

Industry:

Materials > Chemicals (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Monadic Deep Learning

Yang, Bo, Marisa, Zhihao Zhang Kirisame, Shi, Kai

arXiv.org Artificial IntelligenceJul-22-2023

The Java and Scala community has built a very successful big data ecosystem. However, most of neural networks running on it are modeled in dynamically typed programming languages. These dynamically typed deep learning frameworks treat neural networks as differentiable expressions that contain many trainable variable, and perform automatic differentiation on those expressions when training them. Until 2019, none of the learning frameworks in statically typed languages provided the expressive power of traditional frameworks. Their users are not able to use custom algorithms unless creating plenty of boilerplate code for hard-coded back-propagation. We solved this problem in DeepLearning.scala 2. Our contributions are: 1. We discovered a novel approach to perform automatic differentiation in reverse mode for statically typed functions that contain multiple trainable variable, and can interoperate freely with the metalanguage. 2. We designed a set of monads and monad transformers, which allow users to create monadic expressions that represent dynamic neural networks. 3. Along with these monads, we provide some applicative functors, to perform multiple calculations in parallel. With these features, users of DeepLearning.scala were able to create complex neural networks in an intuitive and concise way, and still maintain type safety.

artificial intelligence, machine learning, neural network, (17 more...)

arXiv.org Artificial Intelligence

2307.12187

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

#DeepLearning. Just published a new blog post on deep…

#artificialintelligenceMar-25-2023, 10:45:30 GMT

Deep learning is a subset of machine learning that is based on artificial neural networks. It has revolutionized the field of artificial intelligence and is being used in a variety of applications, including image and speech recognition, natural language processing, and autonomous vehicles. In this blog post, we will provide an overview of what deep learning is, how it works, and its applications.

deep learning algorithm, learning algorithm, neural network, (9 more...)

#artificialintelligence

Genre: Overview (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How to Build a Speech-to-Text System using ChatGPT and Python - Pyresearch - Medium

#artificialintelligenceMar-9-2023, 19:05:25 GMT

Check out our latest tutorial on how to build a speech-to-text system using ChatGPT and Python! Learn how to leverage the power of natural language processing and deep learning to convert audio to text with amazing accuracy. Please let me know your valuable feedback on the video by means of comments. Please like and share the video. Do not forget to subscribe to my channel for more educational videos.

chatgpt and python, pyresearch, speech-to-text system, (5 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)

Add feedback

Use OpenAI with Google Spreadsheets

#artificialintelligenceJan-22-2023, 14:10:18 GMT

This article explains how you can integrate OpenAI GPT-3 with Google Spreadsheets. This allows you to complete spreadsheet tasks with the use of AI. Tip: Make sure to subscribe to above Gist since all future revisions with improvements will be made to this file. Then you can refer to this file later and update your functions. Note: When there are revisions to functions in the Gist file we discussed above, this is the same place you need to update the new revised code as well.

large language model, machine learning, naturallanguageprocessing, (8 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.81)

Add feedback

Memory Complexity with Transformers - KDnuggets

#artificialintelligenceDec-9-2022, 16:53:01 GMT

The key innovation in Transformers is the introduction of a self-attention mechanism, which computes similarity scores for all pairs of positions in an input sequence, and can be evaluated in parallel for each token of the input sequence, avoiding the sequential dependency of recurrent neural networks, and enabling Transformers to vastly outperform previous sequence models like LSTM. There are a lot of deep explanations elsewhere so here I'd like to share some example questions in an interview setting. What can be a solution to this problem? Here are some tips for readers' reference: If you try to run a large transformer on the long sequence, you just run out of memory. A limitation of existing Transformer models and their derivatives is that the full self-attention mechanism has computational and memory requirements that are quadratic with the input sequence length.

input sequence, memory complexity, transformer, (7 more...)

#artificialintelligence

Industry: Banking & Finance (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

How to Build Good AI Solutions When Data Is Scarce

#artificialintelligenceNov-24-2022, 08:00:05 GMT

Conventional wisdom holds that you need large volumes of labeled training data to unlock value from powerful AI models. For the consumer internet companies where many of today's AI models originated, this hasn't been difficult to obtain. But for companies in other sectors -- such as industrial companies, manufacturers, health care organizations, and educational institutions -- curating labeled data in sufficient volume can be significantly more challenging. Over the past few years, AI practitioners and researchers have developed several techniques to significantly reduce the volume of labeled data needed to build accurate AI models. Using these approaches, it's often possible to build a good AI model with a fraction of the labeled data that might otherwise be needed.

ai model, deep learning, learning, (13 more...)

#artificialintelligence

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.41)
North America > United States > New York (0.05)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
Europe > Switzerland > Zürich > Zürich (0.05)

Industry: Health & Medicine (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

"UnConference"🎙 with Jeremy Howard

#artificialintelligenceOct-16-2022, 03:20:08 GMT

Today's post is slightly off-track. I was invited to the first Fast.AI unconference in Brisbane, Queensland this week. It was an honor to be part of the community and I'm having a blast meeting with so many brilliant AI researchers around the globe! In short, UnConferences are "unconventional conferences". Anyone can propose an agenda, organize a session to any topics they want.

jeremy howard, learning, unconference, (6 more...)

#artificialintelligence

Country:

Oceania > Australia > Queensland > Brisbane (0.26)
North America > United States > California > San Francisco County > San Francisco (0.06)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.33)

Add feedback