AITopics | Personal

Collaborating Authors

Personal

AI for Handball: predicting and explaining the 2024 Olympic Games tournament with Deep Learning and Large Language Models

arXiv.org Artificial IntelligenceJul-22-2024

Over summer 2024, the world will be looking at Paris to encourage their favorite athletes win the Olympic gold medal. In handball, few nations will fight hard to win the precious metal with speculations predicting the victory for France or Denmark for men and France or Norway for women. However, there is so far no scientific method proposed to predict the final results of the competition. In this work, we leverage a deep learning model to predict the results of the handball tournament of the 2024 Olympic Games. This model, coupled with explainable AI (xAI) techniques, allows us to extract insightful information about the main factors influencing the outcome of each match. Notably, xAI helps sports experts understand how factors like match information or individual athlete performance contribute to the predictions. Furthermore, we integrate Large Language Models (LLMs) to generate human-friendly explanations that highlight the most important factors impacting the match results. By providing human-centric explanations, our approach offers a deeper understanding of the AI predictions, making them more actionable for coaches and analysts.

information, language model, tournament, (14 more...)

arXiv.org Artificial Intelligence

2407.15987

Country:

Europe > Denmark (0.26)
Europe > Norway (0.25)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
(14 more...)

Genre:

Research Report (0.41)
Personal > Honors (0.34)

Industry: Leisure & Entertainment > Sports > Olympic Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

#RoboCup2024 – daily digest: 20 July

AIHubJul-20-2024, 18:34:07 GMT

This is the second of our daily digests from RoboCup2024 in Eindhoven, The Netherlands. If you missed the first digest, which gives some background to RoboCup, you can find it here. Competitions continued across all the leagues today, with participants vying for a place in Sunday's finals. The RoboCup@Work league focusses on robots in work-related scenarios, utilizing ideas and concepts from other RoboCup competitions to tackle open research challenges in industrial and service robotics. I arrived at the arena in time to catch the advanced navigation test.

daily digest, robocup, robot, (12 more...)

AIHub

Country:

Europe > Netherlands > North Brabant > Eindhoven (0.25)
Asia > Singapore (0.06)
Oceania > Australia > New South Wales (0.05)
Europe > Portugal (0.05)

Genre: Personal (0.31)

Industry: Leisure & Entertainment > Sports > Soccer (0.99)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

'Google says I'm a dead physicist': is the world's biggest search engine broken?

The GuardianJul-20-2024, 11:00:35 GMT

I didn't know I was dead until I saw it on Google. When I searched my name, there it was: a picture of my smiling face next to the text "Tom Faber was a physicist and publisher, and he was a university lecturer at Cambridge for 35 years". Apparently I died on 27 July 2004, aged 77. This was news to me. The problem was the picture. When you search the name of a notable person, Google may create what it calls a "knowledge panel", a little box with basic information taken from Wikipedia. Somewhere along the way, the algorithm had confused pictures of my face with the biography of another man who shared my name. According to his obituary, he was "a distinguished physicist with a literary hinterland". Google provides a feedback form to resolve this type of bug. I filled it in several times, but it made no difference.

google, information, search engine, (16 more...)

The Guardian

Country:

North America > United States (1.00)
Asia > India > Karnataka (0.04)

Genre: Personal (0.48)

Industry:

Law (1.00)
Information Technology > Services (1.00)
Government > Regional Government > North America Government > United States Government (0.94)
Education (0.86)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.45)

Add feedback

PersLLM: A Personified Training Approach for Large Language Models

Zeng, Zheni, Chen, Jiayi, Chen, Huimin, Yan, Yukun, Chen, Yuxuan, Liu, Zhiyuan, Sun, Maosong

arXiv.org Artificial IntelligenceJul-18-2024

Large language models exhibit aspects of human-level intelligence that catalyze their application as human-like agents in domains such as social simulations, human-machine interactions, and collaborative multi-agent systems. However, the absence of distinct personalities, such as displaying ingratiating behaviors, inconsistent opinions, and uniform response patterns, diminish LLMs utility in practical applications. Addressing this, the development of personality traits in LLMs emerges as a crucial area of research to unlock their latent potential. Existing methods to personify LLMs generally involve strategies like employing stylized training data for instruction tuning or using prompt engineering to simulate different personalities. These methods only capture superficial linguistic styles instead of the core of personalities and are therefore not stable. In this study, we propose PersLLM, integrating psychology-grounded principles of personality: social practice, consistency, and dynamic development, into a comprehensive training methodology. We incorporate personality traits directly into the model parameters, enhancing the model's resistance to induction, promoting consistency, and supporting the dynamic evolution of personality. Single-agent evaluation validates our method's superiority, as it produces responses more aligned with reference personalities compared to other approaches. Case studies for multi-agent communication highlight its benefits in enhancing opinion consistency within individual agents and fostering collaborative creativity among multiple agents in dialogue contexts, potentially benefiting human simulation and multi-agent cooperation. Additionally, human-agent interaction evaluations indicate that our personified models significantly enhance interactive experiences, underscoring the practical implications of our research.

architecture, knowledge, personality, (15 more...)

arXiv.org Artificial Intelligence

2407.12393

Country:

Asia > China > Hong Kong (0.04)
Asia > South Korea (0.04)
Asia > China > Beijing > Beijing (0.04)
(3 more...)

Genre:

Personal > Interview (0.93)
Research Report > New Finding (0.87)

Industry:

Education (1.00)
Banking & Finance (1.00)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II

Wu, Rixin, Wang, Ran, Hao, Jie, Wu, Qiang, Wang, Ping, Niyato, Dusit

arXiv.org Artificial IntelligenceJul-17-2024

This paper proposes a weight-aware deep reinforcement learning (WADRL) approach designed to address the multiobjective vehicle routing problem with time windows (MOVRPTW), aiming to use a single deep reinforcement learning (DRL) model to solve the entire multiobjective optimization problem. The Non-dominated sorting genetic algorithm-II (NSGA-II) method is then employed to optimize the outcomes produced by the WADRL, thereby mitigating the limitations of both approaches. Firstly, we design an MOVRPTW model to balance the minimization of travel cost and the maximization of customer satisfaction. Subsequently, we present a novel DRL framework that incorporates a transformer-based policy network. This network is composed of an encoder module, a weight embedding module where the weights of the objective functions are incorporated, and a decoder module. NSGA-II is then utilized to optimize the solutions generated by WADRL. Finally, extensive experimental results demonstrate that our method outperforms the existing and traditional methods. Due to the numerous constraints in VRPTW, generating initial solutions of the NSGA-II algorithm can be time-consuming. However, using solutions generated by the WADRL as initial solutions for NSGA-II significantly reduces the time required for generating initial solutions. Meanwhile, the NSGA-II algorithm can enhance the quality of solutions generated by WADRL, resulting in solutions with better scalability. Notably, the weight-aware strategy significantly reduces the training time of DRL while achieving better results, enabling a single DRL model to solve the entire multiobjective optimization problem.

algorithm, time window, vehicle, (16 more...)

arXiv.org Artificial Intelligence

2407.13113

Country:

Asia > China > Jiangsu Province > Nanjing (0.05)
Asia > Singapore (0.05)
North America > Canada > Manitoba (0.04)
(4 more...)

Genre:

Research Report (1.00)
Personal > Honors (0.67)

Industry: Transportation > Freight & Logistics Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Building Intelligence Identification System via Large Language Model Watermarking: A Survey and Beyond

Wang, Xuhong, Jiang, Haoyu, Yu, Yi, Yu, Jingru, Lin, Yilun, Yi, Ping, Wang, Yingchun, Yu, Qiao, Li, Li, Wang, Fei-Yue

arXiv.org Artificial IntelligenceJul-16-2024

Large Language Models (LLMs) are increasingly integrated into diverse industries, posing substantial security risks due to unauthorized replication and misuse. To mitigate these concerns, robust identification mechanisms are widely acknowledged as an effective strategy. Identification systems for LLMs now rely heavily on watermarking technology to manage and protect intellectual property and ensure data security. However, previous studies have primarily concentrated on the basic principles of algorithms and lacked a comprehensive analysis of watermarking theory and practice from the perspective of intelligent identification. To bridge this gap, firstly, we explore how a robust identity recognition system can be effectively implemented and managed within LLMs by various participants using watermarking technology. Secondly, we propose a mathematical framework based on mutual information theory, which systematizes the identification process to achieve more precise and customized watermarking. Additionally, we present a comprehensive evaluation of performance metrics for LLM watermarking, reflecting participant preferences and advancing discussions on its identification applications. Lastly, we outline the existing challenges in current watermarking technologies and theoretical frameworks, and provide directional guidance to address these challenges. Our systematic classification and detailed exposition aim to enhance the comparison and evaluation of various methods, fostering further research and development toward a transparent, secure, and equitable LLM ecosystem.

information, llm, watermark, (13 more...)

arXiv.org Artificial Intelligence

2407.111

Country:

Europe > Germany (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Beijing > Beijing (0.04)
(7 more...)

Genre:

Research Report (1.00)
Overview (1.00)
Personal > Honors (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.92)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Tackling Challenges in Implementing Large-Scale Graph Databases

Communications of the ACMJul-15-2024, 17:10:51 GMT

Graph databases (GDBs)13,30 have gained momentum with the rise of large unstructured repositories of information that emphasize relations between entities. Dozens of GDB management systems,8,22,25,31 prototypes,1,2,15,21 models and languages,3,10,12,14 large knowledge graphs like Wikidata,33 and efforts from companies like Apache, Facebook, Google, Microsoft, Neo4j, and Oracle, illustrate the growing interest in this technology. While the expressive power and flexibility of their data model and query languages is the key to their success, the efficiency challenges posed by their implementation is the main obstacle to the wider adoption of GDBs. Latin America has a long-standing tradition in fundamental research areas like database theory, string processing, information retrieval, and the design and analysis of algorithms and data structures--all of which are relevant for the development of GDBs. In the last few years, several researchers in Chile started collaborating on algorithms and systems for evaluating complex queries on large-scale GDBs.

algorithm, query, triple pattern, (15 more...)

Communications of the ACM

Country:

North America > Central America (0.61)
South America > Chile (0.25)

Genre: Personal > Honors (0.53)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.72)
Information Technology > Communications > Social Media (0.46)

Add feedback

SENTINEL: Securing Indoor Localization against Adversarial Attacks with Capsule Neural Networks

Gufran, Danish, Anandathirtha, Pooja, Pasricha, Sudeep

arXiv.org Artificial IntelligenceJul-14-2024

With the increasing demand for edge device powered location-based services in indoor environments, Wi-Fi received signal strength (RSS) fingerprinting has become popular, given the unavailability of GPS indoors. However, achieving robust and efficient indoor localization faces several challenges, due to RSS fluctuations from dynamic changes in indoor environments and heterogeneity of edge devices, leading to diminished localization accuracy. While advances in machine learning (ML) have shown promise in mitigating these phenomena, it remains an open problem. Additionally, emerging threats from adversarial attacks on ML-enhanced indoor localization systems, especially those introduced by malicious or rogue access points (APs), can deceive ML models to further increase localization errors. To address these challenges, we present SENTINEL, a novel embedded ML framework utilizing modified capsule neural networks to bolster the resilience of indoor localization solutions against adversarial attacks, device heterogeneity, and dynamic RSS fluctuations. We also introduce RSSRogueLoc, a novel dataset capturing the effects of rogue APs from several real-world indoor environments. Experimental evaluations demonstrate that SENTINEL achieves significant improvements, with up to 3.5x reduction in mean error and 3.4x reduction in worst-case error compared to state-of-the-art frameworks using simulated adversarial attacks. SENTINEL also achieves improvements of up to 2.8x in mean error and 2.7x in worst-case error compared to state-of-the-art frameworks when evaluated with the real-world RSSRogueLoc dataset.

indoor localization, neural network, perturbation, (13 more...)

arXiv.org Artificial Intelligence

2407.11091

Country:

Asia > India (0.04)
North America > United States > Colorado > Larimer County > Fort Collins (0.04)
North America > United States > California > Orange County > Irvine (0.04)

Genre:

Research Report (0.64)
Personal (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Interview with Sherry Yang: Learning interactive real-world simulators

AIHubJul-11-2024, 09:30:45 GMT

Sherry Yang, Yilun Du, Kamyar Ghasemipour, Jonathan Tompson, Leslie Kaelbling, Dale Schuurmans and Pieter Abbeel won an outstanding paper award at ICLR2024 for their work Learning Interactive Real-World Simulators. In the paper, they introduce a universal simulator (called UniSim) which takes image and text input to train a robot simulator. We spoke to Sherry about this work, some of the challenges, and potential applications. There are two components – there is the universal component and then there is a simulator component. Looking at the simulator component first – typically when people build a simulator, they do this based on an understanding of the real world, using physics equations. Researchers will build a simulator to study how things work, such as how cars move, for example.

interaction, simulator, video, (13 more...)

AIHub

Genre: Personal > Honors (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.72)

Add feedback

On LLM Wizards: Identifying Large Language Models' Behaviors for Wizard of Oz Experiments

Fang, Jingchao, Arechiga, Nikos, Namaoshi, Keiichi, Bravo, Nayeli, Hogan, Candice, Shamma, David A.

arXiv.org Artificial IntelligenceJul-10-2024

The Wizard of Oz (WoZ) method is a widely adopted research approach where a human Wizard "role-plays" a not readily available technology and interacts with participants to elicit user behaviors and probe the design space. With the growing ability for modern large language models (LLMs) to role-play, one can apply LLMs as Wizards in WoZ experiments with better scalability and lower cost than the traditional approach. However, methodological guidance on responsibly applying LLMs in WoZ experiments and a systematic evaluation of LLMs' role-playing ability are lacking. Through two LLM-powered WoZ studies, we take the first step towards identifying an experiment lifecycle for researchers to safely integrate Figure 1: An overview of our proposed experiment lifecycle LLMs into WoZ experiments and interpret data generated compared to traditional Wizard of Oz experiments. We ask from settings that involve Wizards role-played by LLMs. We also GPT-4 empowered agents to play the role of "Wizards" in contribute a heuristic-based evaluation framework that allows the conversation-based Wizard of Oz experiments. The agents estimation of LLMs' role-playing ability in WoZ experiments and talk to either Simulacrums powered by GPT-4 (in Study 1) or reveals LLMs' behavior patterns at scale.

experiment, wizard, wol, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3652988.3673967

2407.08067

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > California > Santa Clara County > Los Altos (0.04)
(22 more...)

Genre:

Research Report > Experimental Study (0.93)
Personal > Interview (0.93)

Industry:

Transportation > Ground > Road (1.00)
Health & Medicine (1.00)
Energy > Renewable (1.00)
(5 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback