AITopics

2512.06304

Country:

Asia > China (0.69)
North America > United States (0.67)
Europe (0.67)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

arXiv.org Artificial IntelligenceJan-5-2024

Understanding LLMs: A Comprehensive Overview from Training to Inference

Liu, Yiheng, He, Hao, Han, Tianle, Zhang, Xu, Liu, Mengyuan, Tian, Jiaming, Zhang, Yutong, Wang, Jiaqi, Gao, Xiaohui, Zhong, Tianyang, Pan, Yi, Xu, Shaochen, Wu, Zihao, Liu, Zhengliang, Zhang, Xin, Zhang, Shu, Hu, Xintao, Zhang, Tuo, Qiang, Ning, Liu, Tianming, Ge, Bao

The introduction of ChatGPT has led to a significant increase in the utilization of Large Language Models (LLMs) for addressing downstream tasks. There's an increasing focus on cost-efficient training and deployment within this context. Low-cost training and deployment of LLMs represent the future development trend. This paper reviews the evolution of large language model training techniques and inference deployment technologies aligned with this emerging trend. The discussion on training includes various aspects, including data preprocessing, training architecture, pre-training tasks, parallel training, and relevant content related to model fine-tuning. On the inference side, the paper covers topics such as model compression, parallel computation, memory scheduling, and structural optimization. It also explores LLMs' utilization and provides insights into their future development.

arxiv preprint arxiv, language model, llm, (13 more...)

2401.02038

Country:

Asia > China > Shaanxi Province > Xi'an (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(3 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.92)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceNov-20-2023

Exploring Prompting Large Language Models as Explainable Metrics

Mahmoudi, Ghazaleh

This paper describes the IUST NLP Lab submission to the Prompting Large Language Models as Explainable Metrics Shared Task at the Eval4NLP 2023 Workshop on Evaluation & Comparison of NLP Systems. We have proposed a zero-shot prompt-based strategy for explainable evaluation of the summarization task using Large Language Models (LLMs). The conducted experiments demonstrate the promising potential of LLMs as evaluation metrics in Natural Language Processing (NLP), particularly in the field of summarization. Both few-shot and zero-shot approaches are employed in these experiments. The performance of our best provided prompts achieved a Kendall correlation of 0.477 with human evaluations in the text summarization task on the test data. Code and results are publicly available on GitHub.

evaluation, main document, summarization, (16 more...)

2311.11552

Country:

North America > The Bahamas (0.16)
North America > Jamaica (0.05)
Europe > United Kingdom > England > Leicestershire > Leicester (0.05)
(8 more...)

Genre: Research Report (0.64)

Industry:

Leisure & Entertainment > Sports > Soccer (1.00)
Leisure & Entertainment > Sports > Olympic Games (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

arXiv.org Artificial IntelligenceApr-24-2023

Optimization Methods in Deep Learning: A Comprehensive Overview

Shulman, David

In recent years, deep learning has achieved remarkable success in various fields such as image recognition, natural language processing, and speech recognition. The effectiveness of deep learning largely depends on the optimization methods used to train deep neural networks. In this paper, we provide an overview of first-order optimization methods such as Stochastic Gradient Descent, Adagrad, Adadelta, and RMSprop, as well as recent momentum-based and adaptive gradient methods such as Nesterov accelerated gradient, Adam, Nadam, AdaMax, and AMSGrad. We also discuss the challenges associated with optimization in deep learning and explore techniques for addressing these challenges, including weight initialization, batch normalization, and layer normalization. Finally, we provide recommendations for selecting optimization methods for different deep learning tasks and datasets. This paper serves as a comprehensive guide to optimization methods in deep learning and can be used as a reference for researchers and practitioners in the field.

comprehensive overview, deep learning, machine learning, (2 more...)

2302.09566

Genre:

Overview (0.89)
Research Report (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceJan-6-2023, 02:05:21 GMT

Maximizing Object Detection Accuracy with FPN: A Comprehensive Overview

FPN (Feature Pyramid Network) is a type of convolutional neural network architecture for object detection tasks. It is designed to improve the performance of object detection models by making use of both high-level and low-level features from the input image. The basic idea behind FPN is to build a pyramid of features, where each level in the pyramid represents a different scale or resolution of the input image. The top of the pyramid represents the high-level, semantically rich features, while the bottom of the pyramid represents the low-level, fine-grained features. By combining features from different levels in the pyramid, the model is able to make use of both the semantically rich high-level features and the fine-grained low-level features to improve the accuracy of object detection.

artificial intelligence, machine learning, pyramid, (15 more...)

Genre: Overview (0.42)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

#artificialintelligenceDec-24-2022, 12:10:29 GMT

A Comprehensive Overview of Feature Selection Methods in Machine Learning

A critical phase of machine learning is feature selection. To enhance the model's effectiveness and generalizability, it includes choosing and creating a subset of features from a dataset that are most related to the target variable. There are many different techniques that can be used for feature selection, and the appropriate technique will depend on the specific problem and the type of data you are working with. It's important to note that feature selection is a trade-off between model complexity and predictive power. Adding more features to a model can potentially improve its performance, but it can also make the model more difficult to interpret and increase the risk of overfitting (performing well on the training data but poorly on unseen data). A hybrid method involves combining two or more of the above methods to find the optimal subset of features.

comprehensive overview, feature selection method, machine learning, (3 more...)

Genre: Overview (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceAug-7-2021, 21:45:22 GMT

Machine Learning Market Key Development, Trends and Major Players With Top Countries Data

The global Machine Learning Market research report is a compilation of the detailed study of each and every aspect related to the Machine Learning industry. The research report offers a thorough analysis of all the Market related data supported by reliable numerical data. The research report holds the crucial data regarding the Valuation of the Machine Learning industry in the past years. It also includes a prediction for numerical data regarding the future Market size and volume. The detailed study on the CAGR at which the Machine Learning Market is anticipated to expand in the future is provided in the study.

california, machine learning industry, research report, (8 more...)

Country:

North America > United States > New York (0.41)
Europe > Germany (0.41)
Europe > Switzerland > Zürich > Zürich (0.15)
(14 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.82)

Industry:

Information Technology (1.00)
Marketing (0.95)
Banking & Finance > Trading (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Machine LearningOct-26-2020

A Comprehensive Overview and Survey of Recent Advances in Meta-Learning

Peng, Huimin

This article reviews meta-learning also known as learning-to-learn which seeks rapid and accurate model adaptation to unseen tasks with applications in highly automated AI, few-shot learning, natural language processing and robotics. Unlike deep learning, meta-learning can be applied to few-shot high-dimensional datasets and considers further improving model generalization to unseen tasks. Deep learning is focused upon in-sample prediction and meta-learning concerns model adaptation for out-of-sample prediction. Meta-learning can continually perform self-improvement to achieve highly autonomous AI. Meta-learning may serve as an additional generalization block complementary for original deep learning model. Meta-learning seeks adaptation of machine learning models to unseen tasks which are vastly different from trained tasks. Meta-learning with coevolution between agent and environment provides solutions for complex tasks unsolvable by training from scratch. Meta-learning methodology covers a wide range of great minds and thoughts. We briefly introduce meta-learning methodologies in the following categories: black-box meta-learning, metric-based meta-learning, layered meta-learning and Bayesian meta-learning framework. Recent applications concentrate upon the integration of meta-learning with other machine learning framework to provide feasible integrated problem solutions. We briefly present recent meta-learning advances and discuss potential future research directions.

base learner, deep learning, upstream oil & gas, (20 more...)

arXiv.org Machine Learning

2004.11149

Country: North America > Canada (0.27)

Genre: Overview (1.00)

Industry:

Education (1.00)
Banking & Finance (0.68)
Information Technology > Security & Privacy (0.45)
Energy > Oil & Gas > Upstream (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

#artificialintelligenceSep-26-2020, 12:51:20 GMT

Artificial Intelligence (AI) In Fintech Market Growth by Top Companies, Region, Application, Driver, Trends and Forecasts by 2027 – Crypto Daily

The Artificial Intelligence (AI) In Fintech Market report predicts promising growth and development during the period 2020-2027. The Artificial Intelligence (AI) In Fintech Market survey report represents vital statistical data represented in an organized format such as graphs, charts, tables, and figures to provide a detailed understanding of the Artificial Intelligence (AI) In Fintech Market in a simple manner. The report covers an in-depth analysis of the Artificial Intelligence (AI) In Fintech market and offers key insights on current and emerging trends, market drivers, and market insights offered by industry experts. The report examines the impact of COVID-19 on market growth. The study provides comprehensive coverage of the impact of the COVID-19 pandemic on the Artificial Intelligence (AI) In Fintech market and its key segments.

artificial intelligence, fintech market, verifiedmarketresearch, (13 more...)

Country:

South America (0.15)
Europe (0.06)
Asia (0.06)
(2 more...)

Genre:

Overview (0.92)
Research Report > Experimental Study (0.35)

Industry:

Banking & Finance (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.77)
Health & Medicine > Therapeutic Area > Immunology (0.77)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceFeb-15-2020, 21:21:29 GMT

The Most Influential Deep Learning Research of 2019

Deep learning has continued its forward movement during 2019 with advances in many exciting research areas like generative adversarial networks (GANs), auto-encoders, and reinforcement learning. In terms of deployments, deep learning is the darling of many contemporary application areas such as computer vision, image recognition, speech recognition, natural language processing, machine translation, autonomous vehicles, and many more. Earlier this year, we saw Google AI Language revolutionize the NLP segment of deep learning with the new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. The already seminal paper was released on arXiv on May 24. This has led to a storm of follow-on research results.

graph neural network, learning, neural network, (13 more...)

Country: North America > Canada > Ontario > Toronto (0.05)

Genre: Overview (0.96)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)