AITopics

This paper presents a systematic overview and comparison of parameter-efficient fine-tuning methods covering over 40 papers published between February 2019 and February 2023. These methods aim to resolve the infeasibility and impracticality of fine-tuning large language models by only training a small set of parameters. We provide a taxonomy that covers a broad range of methods and present a detailed method comparison with a specific focus on real-life efficiency and fine-tuning multibillion-scale language models.

large language model, machine learning, natural language, (19 more...)

2303.15647

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Dominican Republic (0.04)
(7 more...)

Genre:

Research Report (1.00)
Overview (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Mu, Tongzhou, Su, Hao

Although reinforcement learning has seen tremendous success recently, this kind of trial-and-error learning can be impractical or inefficient in complex environments. The use of demonstrations, on the other hand, enables agents to benefit from expert knowledge rather than having to discover the best action to take through exploration. In this survey, we discuss the advantages of using demonstrations in sequential decision making, various ways to apply demonstrations in learning-based decision making paradigms (for example, reinforcement learning and planning in the learned models), and how to collect the demonstrations in various scenarios. Additionally, we exemplify a practical pipeline for generating and utilizing demonstrations in the recently proposed ManiSkill robot learning benchmark.

demonstration, machine learning, reinforcement learning, (14 more...)

2303.13489

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Germany > Berlin (0.04)

Genre:

Overview (0.68)
Research Report (0.64)

Industry:

Leisure & Entertainment > Games > Computer Games (0.93)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)

Fournier, Quentin, Caron, Gaétan Marceau, Aloise, Daniel

A Practical Survey on Faster and Lighter Transformers

Recurrent neural networks are effective models to process sequences. However, they are unable to learn long-term dependencies because of their inherent sequential nature. As a solution, Vaswani et al. introduced the Transformer, a model solely based on the attention mechanism that is able to relate any two positions of the input sequence, hence modelling arbitrary long dependencies. The Transformer has improved the state-of-the-art across numerous sequence modelling tasks. However, its effectiveness comes at the expense of a quadratic computational and memory complexity with respect to the sequence length, hindering its adoption. Fortunately, the deep learning community has always been interested in improving the models' efficiency, leading to a plethora of solutions such as parameter sharing, pruning, mixed-precision, and knowledge distillation. Recently, researchers have directly addressed the Transformer's limitation by designing lower-complexity alternatives such as the Longformer, Reformer, Linformer, and Performer. However, due to the wide range of solutions, it has become challenging for researchers and practitioners to determine which methods to apply in practice in order to meet the desired trade-off between capacity, computation, and memory. This survey addresses this issue by investigating popular approaches to make Transformers faster and lighter and by providing a comprehensive explanation of the methods' strengths, limitations, and underlying assumptions.

artificial intelligence, machine learning, natural language, (21 more...)

doi: 10.1145/3586074

2103.14636

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Jordan (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine (0.67)
Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Sejarah dan Perkembangan Teknik Natural Language Processing (NLP) Bahasa Indonesia: Tinjauan tentang sejarah, perkembangan teknologi, dan aplikasi NLP dalam bahasa Indonesia

Amien, Mukhlis

This study provides an overview of the history of the development of Natural Language Processing (NLP) in the context of the Indonesian language, with a focus on the basic technologies, methods, and practical applications that have been developed. This review covers developments in basic NLP technologies such as stemming, part-of-speech tagging, and related methods; practical applications in cross-language information retrieval systems, information extraction, and sentiment analysis; and methods and techniques used in Indonesian language NLP research, such as machine learning, statistics-based machine translation, and conflict-based approaches. This study also explores the application of NLP in Indonesian language industry and research and identifies challenges and opportunities in Indonesian language NLP research and development. Recommendations for future Indonesian language NLP research and development include developing more efficient methods and technologies, expanding NLP applications, increasing sustainability, further research into the potential of NLP, and promoting interdisciplinary collaboration. It is hoped that this review will help researchers, practitioners, and the government to understand the development of Indonesian language NLP and identify opportunities for further research and development. Designing an indonesian part of speech tagset and manually tagged indonesian corpus.

information retrieval, machine learning, natural language, (17 more...)

2304.02746

Country:

Asia > Indonesia (1.00)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)

Genre:

Overview (1.00)
Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)

Guerra-Manzanares, Alejandro, Lopez, L. Julian Lechuga, Maniatakos, Michail, Shamout, Farah E.

Privacy-preserving machine learning for healthcare: open challenges and future perspectives

healthcare, open challenge and future perspective, privacy-preserving machine

Machine Learning (ML) has recently shown tremendous success in modeling various healthcare prediction tasks, ranging from disease diagnosis and prognosis to patient treatment. Due to the sensitive nature of medical data, privacy must be considered along the entire ML pipeline, from model training to inference. In this paper, we conduct a review of recent literature concerning Privacy-Preserving Machine Learning (PPML) for healthcare. We primarily focus on privacy-preserving training and inference-as-a-service, and perform a comprehensive review of existing trends, identify challenges, and discuss opportunities for future research directions. The aim of this review is to guide the development of private and efficient ML models in healthcare, with the prospects of translating research efforts into real-world settings.

doi: 10.1007/978-3-031-39539-0_3

2303.15563

Genre: Overview (0.87)

Industry:

Health & Medicine (1.00)
Information Technology > Security & Privacy (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Security & Privacy (0.87)
Information Technology > Data Science > Data Mining > Big Data (0.80)

Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey

Xiao, Aoran, Huang, Jiaxing, Guan, Dayan, Zhang, Xiaoqin, Lu, Shijian, Shao, Ling

Point cloud data have been widely explored due to its superior accuracy and robustness under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved very impressive success in various applications such as surveillance and autonomous driving. The convergence of point cloud and DNNs has led to many deep point cloud models, largely trained under the supervision of large-scale and densely-labelled point cloud data. Unsupervised point cloud representation learning, which aims to learn general and useful point cloud representations from unlabelled point cloud data, has recently attracted increasing attention due to the constraint in large-scale point cloud labelling. This paper provides a comprehensive review of unsupervised point cloud representation learning using DNNs. It first describes the motivation, general pipelines as well as terminologies of the recent studies. Relevant background including widely adopted point cloud datasets and DNN architectures is then briefly presented. This is followed by an extensive discussion of existing unsupervised point cloud representation learning methods according to their technical approaches. We also quantitatively benchmark and discuss the reviewed methods over multiple widely adopted point cloud datasets. Finally, we share our humble opinion about several challenges and problems that could be pursued in future research in unsupervised point cloud representation learning. A project associated with this survey has been built at https://github.com/xiaoaoran/3d_url_survey.

artificial intelligence, machine learning, survey article, (13 more...)

doi: 10.1109/TPAMI.2023.3262786

2202.13589

Country: Asia > Middle East > UAE (0.28)

Genre: Overview (1.00)

Industry:

Energy > Oil & Gas (0.67)
Transportation > Ground > Road (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Choi, Joseph B., Nguyen, Phong C. H., Sen, Oishik, Udaykumar, H. S., Baek, Stephen

Artificial intelligence approaches for materials-by-design of energetic materials: state-of-the-art, challenges, and future directions

Energetic materials (EM) cover a wide spectrum of propellants, pyrotechnics, and explosives and are key components in military applications for propulsion and munition systems and in civilian applications such as construction and mining [1]. Heterogenous/composite EMs have complex microstructures which significantly influence--along with chemistry--the property and performance of these materials [2-8]. There is increasing research interest in controlling the microstructure of EM, to engineer their properties and performance for targeted functional specificity [9-10]. EMs are typically solid-solid composites of organic energetic crystals (commonly CHNO compounds), inclusions (i.e., metals, nanoparticles), and plastic binders. The CHNO materials are commonly categorized based on how sensitive they are to an external load/mechanical insult. They can range f rom'insensitive' (such as TATB - based EMs [11]) to'highly sensitive' (PETN-based EMs [12-13]) with others such as HMX, CL-20, and RDX ranging in between [14]. The sensitivity is closely connected with the molecular structure of these species of EMs within the CHNO family. However, when they are formed into propellants and explosives, the sensitivity is also impacted by the physical structure, composition, and formulation of the material mixtures, as reviewed by Handley et al. [1]. In other words, the design of a mixture and its microstructure can define the overall properties and performance characteristics of formed EM, thus opening the possibility of systematic methods to engineer materials by their design.

artificial intelligence, evolutionary algorithm, machine learning, (20 more...)

doi: 10.1002/prep.202200276

2211.08179

Country: North America > United States (1.00)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Materials (1.00)
Health & Medicine (1.00)
Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
(6 more...)

Guiding AI-Generated Digital Content with Wireless Perception

Wang, Jiacheng, Du, Hongyang, Niyato, Dusit, Xiong, Zehui, Kang, Jiawen, Mao, Shiwen, Xuemin, null, Shen, null

Recent advances in artificial intelligence (AI), coupled with a surge in training data, have led to the widespread use of AI for digital content generation, with ChatGPT serving as a representative example. Despite the increased efficiency and diversity, the inherent instability of AI models poses a persistent challenge in guiding these models to produce the desired content for users. In this paper, we introduce an integration of wireless perception (WP) with AI-generated content (AIGC) and propose a unified WP-AIGC framework to improve the quality of digital content production. The framework employs a novel multi-scale perception technology to read user's posture, which is difficult to describe accurately in words, and transmits it to the AIGC model as skeleton images. Based on these images and user's service requirements, the AIGC model generates corresponding digital content. Since the production process imposes the user's posture as a constraint on the AIGC model, it makes the generated content more aligned with the user's requirements. Additionally, WP-AIGC can also accept user's feedback, allowing adjustment of computing resources at edge server to improve service quality. Experiments results verify the effectiveness of the WP-AIGC framework, highlighting its potential as a novel approach for guiding AI models in the accurate generation of digital content.

artificial intelligence, machine learning, natural language, (19 more...)

2303.14624

Country:

Asia > Singapore (0.04)
North America > United States (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
Asia > China (0.04)

Genre:

Overview (0.88)
Research Report > Promising Solution (0.34)

Industry: Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

A Survey on Dual-Quaternions

Kenwright, Benjamin

Over the past few years, the applications of dual-quaternions have not only developed in many different directions but has also evolved in exciting ways in several areas. As dual-quaternions offer an efficient and compact symbolic form with unique mathematical properties. While dual-quaternions are now common place in many aspects of research and implementation, such as, robotics and engineering through to computer graphics and animation, there are still a large number of avenues for exploration with huge potential benefits. This article is the first to provide a comprehensive review of the dual-quaternion landscape. In this survey, we present a review of dual-quaternion techniques and applications developed over the years while providing insights into current and future directions. The article starts with the definition of dual-quaternions, their mathematical formulation, while explaining key aspects of importance (e.g., compression and ambiguities). The literature review in this article is divided into categories to help manage and visualize the application of dual-quaternions for solving specific problems. A timeline illustrating key methods is presented, explaining how dual-quaternion approaches have progressed over the years. The most popular dual-quaternion methods are discussed with regard to their impact in the literature, performance, computational cost and their real-world results (compared to associated models). Finally, we indicate the limitations of dual-quaternion methodologies and propose future research directions.

artificial intelligence, machine learning, quaternion, (15 more...)

2303.14765

Country: Europe > Ireland > Leinster > County Dublin > Dublin (0.04)

Genre: Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (0.70)
Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Vision (0.69)

Wen, Dacheng, Li, Yupeng, Lau, Francis C. M.

A Survey of Machine Learning-Based Ride-Hailing Planning

Ride-hailing is a sustainable transportation paradigm where riders access door-to-door traveling services through a mobile phone application, which has attracted a colossal amount of usage. There are two major planning tasks in a ride-hailing system: (1) matching, i.e., assigning available vehicles to pick up the riders, and (2) repositioning, i.e., proactively relocating vehicles to certain locations to balance the supply and demand of ride-hailing services. Recently, many studies of ride-hailing planning that leverage machine learning techniques have emerged. In this article, we present a comprehensive overview on latest developments of machine learning-based ride-hailing planning. To offer a clear and structured review, we introduce a taxonomy into which we carefully fit the different categories of related works according to the types of their planning tasks and solution schemes, which include collective matching, distributed matching, collective repositioning, distributed repositioning, and joint matching and repositioning. We further shed light on many real-world datasets and simulators that are indispensable for empirical studies on machine learning-based ride-hailing planning strategies. At last, we propose several promising research directions for this rapidly growing research and practical field.

machine learning, proc, reinforcement learning, (18 more...)

2303.14646

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > China > Hong Kong (0.05)
North America > United States > New York (0.04)
(17 more...)

Genre: Overview (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(3 more...)