AITopics

2308.04457

Country:

North America > United States > Texas (0.67)
Europe (0.45)
North America > United States > Louisiana (0.28)
(2 more...)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.67)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Klabunde, Max, Schumacher, Tobias, Strohmaier, Markus, Lemmerich, Florian

Similarity of Neural Network Models: A Survey of Functional and Representational Measures

arXiv.org Artificial IntelligenceAug-6-2023

However, understanding and measuring similarity of neural networks is a complex problem, as there are multiple perspectives on how such models can be similar. In this work, we specifically focus on two key perspectives: representational and functional measures of similarity (see Figure 1). Representational similarity measures assess how activations of intermediate layers differ, whereas functional similarity measures specifically compare the outputs of neural networks with respect to their task. Both perspectives on their own are not sufficient to gain detailed insights into similarity of neural network models. Seemingly similar representations can still yield different outputs, and conversely, similar outputs can result from different representations. In that sense, combining these two complementary perspectives provides a more comprehensive approach to analyze similarity between neural networks at all layers. Given the broad range of research on neural network similarity, numerous similarity measures have been proposed and applied, often with lines of research being disconnected from each other. With this work, we provide a comprehensive overview of measures for representational similarity and functional similarity that gives a unified perspective on the existing literature and can inform and guide both researchers and practitioners interested in understanding and comparing neural network models.

artificial intelligence, machine learning, representation, (15 more...)

2305.06329

Country:

North America > United States (0.14)
Europe > France (0.04)
Europe > Austria > Vienna (0.04)

Genre: Overview (1.00)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

arXiv.org Artificial IntelligenceAug-6-2023

Taxonomy of Abstractive Dialogue Summarization: Scenarios, Approaches and Future Directions

Jia, Qi, Liu, Yizhu, Ren, Siyu, Zhu, Kenny Q.

Abstractive dialogue summarization generates a concise and fluent summary covering the salient information in a dialogue among two or more interlocutors. It has attracted significant attention in recent years based on the massive emergence of social communication platforms and an urgent requirement for efficient dialogue information understanding and digestion. Different from news or articles in traditional document summarization, dialogues bring unique characteristics and additional challenges, including different language styles and formats, scattered information, flexible discourse structures, and unclear topic boundaries. This survey provides a comprehensive investigation of existing work for abstractive dialogue summarization from scenarios, approaches to evaluations. It categorizes the task into two broad categories according to the type of input dialogues, i.e., open-domain and task-oriented, and presents a taxonomy of existing techniques in three directions, namely, injecting dialogue features, designing auxiliary training tasks and using additional data. A list of datasets under different scenarios and widely-accepted evaluation metrics are summarized for completeness. After that, the trends of scenarios and techniques are summarized, together with deep insights into correlations between extensively exploited features and different scenarios. Based on these analyses, we recommend future directions, including more controlled and complicated scenarios, technical innovations and comparisons, publicly available datasets in special domains, etc. CCS Concepts: Computing methodologies Natural language generation; Discourse, dialogue and pragmatics; General and reference Surveys and overviews.

large language model, machine learning, natural language, (18 more...)

2210.09894

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > Canada > Ontario > Toronto (0.04)
(7 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Media (1.00)
Health & Medicine (1.00)
Law (0.93)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Mojto, Martin, Lubušký, Karol, Fikar, Miroslav, Paulen, Radoslav

Data-Based Design of Multi-Model Inferential Sensors

The nonlinear character of industrial processes is usually the main limitation to designing simple linear inferential sensors with sufficient accuracy. In order to increase the inferential sensor predictive performance and yet to maintain its linear structure, multi-model inferential sensors represent a straightforward option. In this contribution, we propose two novel approaches for the design of multi-model inferential sensors aiming to mitigate some drawbacks of the state-of-the-art approaches. For a demonstration of the developed techniques, we design inferential sensors for a Vacuum Gasoil Hydrogenation unit, which is a real-world petrochemical refinery unit. The performance of the multi-model inferential sensor is compared against various single-model inferential sensors and the current (referential) inferential sensor used in the refinery. The results show substantial improvements over the state-of-the-art design techniques for single-/multi-model inferential sensors.

artificial intelligence, inferential sensor, machine learning, (17 more...)

2308.02872

Country:

Europe > Slovakia (0.28)
North America > Canada > Quebec (0.14)
North America > United States (0.14)
Asia > Taiwan (0.14)

Genre:

Research Report > New Finding (0.87)
Research Report > Promising Solution (0.54)
Overview > Innovation (0.54)

Industry:

Energy > Oil & Gas > Downstream (1.00)
Materials > Chemicals > Commodity Chemicals > Petrochemicals (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

SoK: Privacy-Preserving Data Synthesis

Hu, Yuzheng, Wu, Fan, Li, Qinbin, Long, Yunhui, Garrido, Gonzalo Munilla, Ge, Chang, Ding, Bolin, Forsyth, David, Li, Bo, Song, Dawn

As the prevalence of data analysis grows, safeguarding data privacy has become a paramount concern. Consequently, there has been an upsurge in the development of mechanisms aimed at privacy-preserving data analyses. However, these approaches are task-specific; designing algorithms for new tasks is a cumbersome process. As an alternative, one can create synthetic data that is (ideally) devoid of private information. This paper focuses on privacy-preserving data synthesis (PPDS) by providing a comprehensive overview, analysis, and discussion of the field. Specifically, we put forth a master recipe that unifies two prominent strands of research in PPDS: statistical methods and deep learning (DL)-based methods. Under the master recipe, we further dissect the statistical methods into choices of modeling and representation, and investigate the DL-based methods by different generative modeling principles. To consolidate our findings, we provide comprehensive reference tables, distill key takeaways, and identify open problems in the existing literature. In doing so, we aim to answer the following questions: What are the design principles behind different PPDS methods? How can we categorize these methods, and what are the advantages and disadvantages associated with each category? Can we provide guidelines for method selection in different real-world scenarios? We proceed to benchmark several prominent DL-based methods on the task of private image synthesis and conclude that DP-MERF is an all-purpose approach. Finally, upon systematizing the work over the past decade, we identify future directions and call for actions from researchers.

privacy, proceedings, synthesis, (17 more...)

2307.02106

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Minnesota (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(10 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.87)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Government (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.91)
(3 more...)

Farshidi, Siamak, Rezaee, Kiyan, Mazaheri, Sara, Rahimi, Amir Hossein, Dadashzadeh, Ali, Ziabakhsh, Morteza, Eskandari, Sadegh, Jansen, Slinger

Understanding User Intent Modeling for Conversational Recommender Systems: A Systematic Literature Review

Context: User intent modeling is a crucial process in Natural Language Processing that aims to identify the underlying purpose behind a user's request, enabling personalized responses. With a vast array of approaches introduced in the literature (over 13,000 papers in the last decade), understanding the related concepts and commonly used models in AI-based systems is essential. Method: We conducted a systematic literature review to gather data on models typically employed in designing conversational recommender systems. From the collected data, we developed a decision model to assist researchers in selecting the most suitable models for their systems. Additionally, we performed two case studies to evaluate the effectiveness of our proposed decision model. Results: Our study analyzed 59 distinct models and identified 74 commonly used features. We provided insights into potential model combinations, trends in model selection, quality concerns, evaluation measures, and frequently used datasets for training and evaluating these models. Contribution: Our study contributes practical insights and a comprehensive understanding of user intent modeling, empowering the development of more effective and personalized conversational recommender systems. With the Conversational Recommender System, researchers can perform a more systematic and efficient assessment of fitting intent modeling frameworks.

machine learning, natural language, recommender system, (14 more...)

2308.08496

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Austria (0.04)
(11 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Experimental Study (0.94)

Industry:

Health & Medicine (1.00)
Information Technology > Services (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(5 more...)

Rafiei, Alireza, Moore, Ronald, Jahromi, Sina, Hajati, Farshid, Kamaleswaran, Rishikesan

Meta-learning in healthcare: A survey

UELED by the surge in the collection of diverse data, coupled with advancements in computational models and models in the healthcare domain, they typically perform well algorithms, artificial intelligence (AI) techniques have been on a single task [16], [17]. Meta-learning models, however, striving to establish a strong foothold in healthcare over the prove beneficial both in multi-task scenarios, where taskagnostic past decade [1]-[3]. This burgeoning trend has fostered a knowledge is garnered from a suite of tasks to enhance growing interest in the deployment of innovative data analysis the learning of new tasks within that suite, and in singletask methods and machine learning (ML) techniques across a scenarios, where a single problem is continually solved range of healthcare applications [4]-[7]. As a specialized area and refined solutions for a single problem over numerous within ML, meta-learning, or learning-to-learn, has recently episodes [10], [18]. This multi-task learning capability can gained significant attention due to its impressive theoretical enable a more comprehensive understanding of the complex and practical advancements, making it a primary choice for interrelations and dependencies between various healthcare numerous applications [8]-[10].

evolutionary algorithm, machine learning, pattern recognition, (16 more...)

2308.02877

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)
Oceania > Australia (0.04)
(4 more...)

Genre:

Research Report > Experimental Study (1.00)
Overview (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(10 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.67)

Multi-Agent Verification and Control with Probabilistic Model Checking

Parker, David

Probabilistic model checking is a technique for formal automated reasoning about software or hardware systems that operate in the context of uncertainty or stochasticity. It builds upon ideas and techniques from a diverse range of fields, from logic, automata and graph theory, to optimisation, numerical methods and control. In recent years, probabilistic model checking has also been extended to integrate ideas from game theory, notably using models such as stochastic games and solution concepts such as equilibria, to formally verify the interaction of multiple rational agents with distinct objectives. This provides a means to reason flexibly about agents acting in either an adversarial or a collaborative fashion, and opens up opportunities to tackle new problems within, for example, artificial intelligence, robotics and autonomous systems. In this paper, we summarise some of the advances in this area, and highlight applications for which they have already been used. We discuss how the strengths of probabilistic model checking apply, or have the potential to apply, to the multi-agent setting and outline some of the key challenges required to make further progress in this field.

artificial intelligence, logic & formal reasoning, stochastic game, (14 more...)

2308.02829

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California (0.04)

Genre:

Research Report (0.50)
Overview (0.34)

Industry:

Information Technology (0.47)
Leisure & Entertainment > Games (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Towards Ubiquitous Semantic Metaverse: Challenges, Approaches, and Opportunities

Li, Kai, Lau, Billy Pik Lik, Yuan, Xin, Ni, Wei, Guizani, Mohsen, Yuen, Chau

In recent years, ubiquitous semantic Metaverse has been studied to revolutionize immersive cyber-virtual experiences for augmented reality (AR) and virtual reality (VR) users, which leverages advanced semantic understanding and representation to enable seamless, context-aware interactions within mixed-reality environments. This survey focuses on the intelligence and spatio-temporal characteristics of four fundamental system components in ubiquitous semantic Metaverse, i.e., artificial intelligence (AI), spatio-temporal data representation (STDR), semantic Internet of Things (SIoT), and semantic-enhanced digital twin (SDT). We thoroughly survey the representative techniques of the four fundamental system components that enable intelligent, personalized, and context-aware interactions with typical use cases of the ubiquitous semantic Metaverse, such as remote education, work and collaboration, entertainment and socialization, healthcare, and e-commerce marketing. Furthermore, we outline the opportunities for constructing the future ubiquitous semantic Metaverse, including scalability and interoperability, privacy and security, performance measurement and standardization, as well as ethical considerations and responsible AI. Addressing those challenges is important for creating a robust, secure, and ethically sound system environment that offers engaging immersive experiences for the users and AR/VR applications.

artificial intelligence, machine learning, metaverse, (16 more...)

doi: 10.1109/JIOT.2023.3302159

2307.06687

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Asia > Singapore (0.04)
South America > Ecuador > Guayas Province > Guayaquil (0.04)
(9 more...)

Genre:

Overview (1.00)
Research Report (0.81)

Industry:

Transportation (1.00)
Information Technology > Security & Privacy (1.00)
Energy (1.00)
(2 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Internet of Things (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
(2 more...)

Recommender Systems in the Era of Large Language Models (LLMs)

Fan, Wenqi, Zhao, Zihuai, Li, Jiatong, Liu, Yunqing, Mei, Xiaowei, Wang, Yiqi, Wen, Zhen, Wang, Fei, Zhao, Xiangyu, Tang, Jiliang, Li, Qing

With the prosperity of e-commerce and web applications, Recommender Systems (RecSys) have become an important component of our daily life, providing personalized suggestions that cater to user preferences. While Deep Neural Networks (DNNs) have made significant advancements in enhancing recommender systems by modeling user-item interactions and incorporating textual side information, DNN-based methods still face limitations, such as difficulties in understanding users' interests and capturing textual side information, inabilities in generalizing to various recommendation scenarios and reasoning on their predictions, etc. Meanwhile, the emergence of Large Language Models (LLMs), such as ChatGPT and GPT4, has revolutionized the fields of Natural Language Processing (NLP) and Artificial Intelligence (AI), due to their remarkable abilities in fundamental responsibilities of language understanding and generation, as well as impressive generalization and reasoning capabilities. As a result, recent studies have attempted to harness the power of LLMs to enhance recommender systems. Given the rapid evolution of this research direction in recommender systems, there is a pressing need for a systematic overview that summarizes existing LLM-empowered recommender systems, to provide researchers in relevant fields with an in-depth understanding. Therefore, in this paper, we conduct a comprehensive review of LLM-empowered recommender systems from various aspects including Pre-training, Fine-tuning, and Prompting. More specifically, we first introduce representative methods to harness the power of LLMs (as a feature encoder) for learning representations of users and items. Then, we review recent techniques of LLMs for enhancing recommender systems from three paradigms, namely pre-training, fine-tuning, and prompting. Finally, we comprehensively discuss future directions in this emerging field.

large language model, machine learning, natural language, (19 more...)

2307.02046

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > Michigan (0.04)
(7 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.45)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)