Overview
Dynamic technology impact analysis: A multi-task learning approach to patent citation prediction
Seol, Youngjin, Choi, Jaewoong, Lee, Seunghyun, Yoon, Janghyeok
Machine learning (ML) models are valuable tools for analyzing the impact of technology using patent citation information. However, existing ML-based methods often struggle to account for the dynamic nature of the technology impact over time and the interdependencies of these impacts across different periods. This study proposes a multi-task learning (MTL) approach to enhance the prediction of technology impact across various time frames by leveraging knowledge sharing and simultaneously monitoring the evolution of technology impact. First, we quantify the technology impacts and identify patterns through citation analysis over distinct time periods. Next, we develop MTL models to predict citation counts using multiple patent indicators over time. Finally, we examine the changes in key input indicators and their patterns over different periods using the SHapley Additive exPlanation method. We also offer guidelines for validating and interpreting the results by employing statistical methods and natural language processing techniques. A case study on battery technologies demonstrates that our approach not only deepens the understanding of 1 technology impact, but also improves prediction accuracy, yielding valuable insights for both academia and industry.
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Ma, Longxuan, Li, Mingda, Zhang, Weinan, Li, Jiapeng, Liu, Ting
Incorporating external knowledge into dialogue generation has been proven to benefit the performance of an open-domain Dialogue System (DS), such as generating informative or stylized responses, controlling conversation topics. In this article, we study the open-domain DS that uses unstructured text as external knowledge sources (\textbf{U}nstructured \textbf{T}ext \textbf{E}nhanced \textbf{D}ialogue \textbf{S}ystem, \textbf{UTEDS}). The existence of unstructured text entails distinctions between UTEDS and traditional data-driven DS and we aim to analyze these differences. We first give the definition of the UTEDS related concepts, then summarize the recently released datasets and models. We categorize UTEDS into Retrieval and Generative models and introduce them from the perspective of model components. The retrieval models consist of Fusion, Matching, and Ranking modules, while the generative models comprise Dialogue and Knowledge Encoding, Knowledge Selection, and Response Generation modules. We further summarize the evaluation methods utilized in UTEDS and analyze the current models' performance. At last, we discuss the future development trends of UTEDS, hoping to inspire new research in this field.
Language-Model Prior Overcomes Cold-Start Items
Wang, Shiyu, Ding, Hao, Gu, Yupeng, Aydore, Sergul, Kalantari, Kousha, Kveton, Branislav
The growth of recommender systems (RecSys) is driven by digitization and the need for personalized content in areas such as e-commerce and video streaming. The content in these systems often changes rapidly and therefore they constantly face the ongoing cold-start problem, where new items lack interaction data and are hard to value. Existing solutions for the cold-start problem, such as content-based recommenders and hybrid methods, leverage item metadata to determine item similarities. The main challenge with these methods is their reliance on structured and informative metadata to capture detailed item similarities, which may not always be available. This paper introduces a novel approach for cold-start item recommendation that utilizes the language model (LM) to estimate item similarities, which are further integrated as a Bayesian prior with classic recommender systems. This approach is generic and able to boost the performance of various recommenders. Specifically, our experiments integrate it with both sequential and collaborative filtering-based recommender and evaluate it on two real-world datasets, demonstrating the enhanced performance of the proposed approach.
Optimisation Strategies for Ensuring Fairness in Machine Learning: With and Without Demographics
Ensuring fairness has emerged as one of the primary concerns in AI and its related algorithms. Over time, the field of machine learning fairness has evolved to address these issues. This paper provides an extensive overview of this field and introduces two formal frameworks to tackle open questions in machine learning fairness. In one framework, operator-valued optimisation and min-max objectives are employed to address unfairness in time-series problems. This approach showcases state-of-the-art performance on the notorious COMPAS benchmark dataset, demonstrating its effectiveness in real-world scenarios. In the second framework, the challenge of lacking sensitive attributes, such as gender and race, in commonly used datasets is addressed. This issue is particularly pressing because existing algorithms in this field predominantly rely on the availability or estimations of such attributes to assess and mitigate unfairness. Here, a framework for a group-blind bias-repair is introduced, aiming to mitigate bias without relying on sensitive attributes. The efficacy of this approach is showcased through analyses conducted on the Adult Census Income dataset. Additionally, detailed algorithmic analyses for both frameworks are provided, accompanied by convergence guarantees, ensuring the robustness and reliability of the proposed methodologies.
The Systems Engineering Approach in Times of Large Language Models
Cabrera, Christian, Bastidas, Viviana, Schooling, Jennifer, Lawrence, Neil D.
Using Large Language Models (LLMs) to address critical societal problems requires adopting this novel technology into socio-technical systems. However, the complexity of such systems and the nature of LLMs challenge such a vision. It is unlikely that the solution to such challenges will come from the Artificial Intelligence (AI) community itself. Instead, the Systems Engineering approach is better equipped to facilitate the adoption of LLMs by prioritising the problems and their context before any other aspects. This paper introduces the challenges LLMs generate and surveys systems research efforts for engineering AI-based systems. We reveal how the systems engineering principles have supported addressing similar issues to the ones LLMs pose and discuss our findings to provide future directions for adopting LLMs.
Clustered Patch Embeddings for Permutation-Invariant Classification of Whole Slide Images
Gupta, Ravi Kant, Das, Shounak, Sethi, Amit
In the evolving field of digital pathology, Whole Slide Imaging (WSI) has emerged as a transformative technology, enabling the digitization of histopathological slides at gigapixel resolution. This advancement has not only facilitated remote diagnostics and educational opportunities but also opened new avenues for quantitative image analysis [1, 2]. Despite its potential, the sheer size and complexity of WSIs pose significant computational challenges, limiting the practicality of large-scale analysis and the application of advanced machine learning techniques [3, 4]. Whole slide imaging (WSI) represents a significant breakthrough in digital pathology, enabling the digitization of histological slides at high resolutions. This advancement allows for improved visualization, analysis, and management of tissue samples, essential for accurate disease diagnosis and research. However, the sheer size and complexity of WSIs pose unique challenges in image processing and analysis, necessitating innovative approaches for efficient and effective feature extraction and classification. Traditional methods for analyzing WSIs often rely on supervised learning techniques, which require extensive annotated datasets prepared by expert pathologists. This process is not only time-consuming but also prone to variability due to inter-observer differences.
Lo-MARVE: A Low Cost Autonomous Underwater Vehicle for Marine Exploration
This paper presents Low-cost Marine Autonomous Robotic Vehicle Explorer (Lo-MARVE), a novel autonomous underwater vehicle (AUV) designed to provide a low cost solution for underwater exploration and environmental monitoring in shallow water environments. Lo-MARVE offers a cost-effective alternative to existing AUVs, featuring a modular design, low-cost sensors, and wireless communication capabilities. The total cost of Lo-MARVE is approximately EUR 500. Lo-MARVE is developed using the Raspberry Pi 4B microprocessor, with control software written in Python. The proposed AUV was validated through field testing outside of a laboratory setting, in the freshwater environment of the River Corrib in Galway, Ireland. This demonstrates its ability to navigate autonomously, collect data, and communicate effectively outside of a controlled laboratory setting. The successful deployment of Lo-MARVE in a real-world environment validates its proof of concept.
Intelligent Algorithms For Signature Diagnostics Of Three-Phase Motors
Svirin, Stepan, Ryzhikov, Artem, Ali, Saraa, Derkach, Denis
Traditional diagnostic methods for these engines predominantly rely on signature analysis, a technique that examines the engine's operational patterns to detect anomalies [1]. While signature analysis has become a de-facto standard due to its effectiveness, it has some substantial limitations, and the growing complexity of modern engines and the vast amounts of data they generate require more advanced and precise diagnostic frameworks [2]. At the same time, machine learning (ML) and artificial intelligence (AI) have emerged as essential tools integrated into various aspects of modern life, from recommendation algorithms [3] to healthcare [4] applications. The potential for advancement and innovation in these fields is immense. Despite this, the application of ML in industrial settings remains underexplored, primarily due to the scarcity of publicly available labeled datasets, especially with malfunctioning engines This lack of data poses significant challenges when transitioning ML solutions from experimental phases to full-scale production, especially given the complexities and variability of real-world conditions [5].
Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding
Ji, Deyi, Zhu, Lanyun, Gao, Siqi, Xu, Peng, Lu, Hongtao, Ye, Jieping, Zhao, Feng
The ubiquity and value of tables as semi-structured data across various domains necessitate advanced methods for understanding their complexity and vast amounts of information. Despite the impressive capabilities of large language models (LLMs) in advancing the natural language understanding frontier, their application to large-scale tabular data presents significant challenges, specifically regarding table size and complex intricate relationships. Existing works have shown promise with small-scale tables but often flounder when tasked with the complex reasoning required by larger, interconnected tables found in real-world scenarios. To address this gap, we introduce "Tree-of-Table", a novel approach designed to enhance LLMs' reasoning capabilities over large and complex tables. Our method employs Table Condensation and Decomposition to distill and reorganize relevant data into a manageable format, followed by the construction of a hierarchical Table-Tree that facilitates tree-structured reasoning. Through a meticulous Table-Tree Execution process, we systematically unravel the tree-structured reasoning chain to derive the solutions. Experiments across diverse datasets, including WikiTQ, TableFact, FeTaQA, and BIRD, demonstrate that Tree-of-Table sets a new benchmark with superior performance, showcasing remarkable efficiency and generalization capabilities in large-scale table reasoning.
RLInspect: An Interactive Visual Approach to Assess Reinforcement Learning Algorithm
Kalra, Geetansh, Singh, Divye, Jose, Justin
Reinforcement Learning (RL) is a rapidly growing area of machine learning that finds its application in a broad range of domains, from finance and healthcare to robotics and gaming. Compared to other machine learning techniques, RL agents learn from their own experiences using trial and error, and improve their performance over time. However, assessing RL models can be challenging, which makes it difficult to interpret their behaviour. While reward is a widely used metric to evaluate RL models, it may not always provide an accurate measure of training performance. In some cases, the reward may seem increasing while the model's performance is actually decreasing, leading to misleading conclusions about the effectiveness of the training. To overcome this limitation, we have developed RLInspect - an interactive visual analytic tool, that takes into account different components of the RL model - state, action, agent architecture and reward, and provides a more comprehensive view of the RL training. By using RLInspect, users can gain insights into the model's behaviour, identify issues during training, and potentially correct them effectively, leading to a more robust and reliable RL system.