AITopics

2309.1446

Genre:

Instructional Material > Online (0.60)
Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Troxler, Andreas, Schelldorfer, Jürg

Actuarial Applications of Natural Language Processing Using Transformers: Case Studies for Using Text Features in an Actuarial Context

arXiv.org Artificial IntelligenceSep-25-2023

This tutorial demonstrates workflows to incorporate text data into actuarial classification and regression tasks. The main focus is on methods employing transformer-based models. A dataset of car accident descriptions with an average length of 400 words, available in English and German, and a dataset with short property insurance claims descriptions are used to demonstrate these techniques. The case studies tackle challenges related to a multi-lingual setting and long input sequences. They also show ways to interpret model output, to assess and improve model performance, by fine-tuning the models to the domain of application or to a specific prediction task. Finally, the tutorial provides practical approaches to handle classification tasks in situations with no or only few labeled data, including but not limited to ChatGPT. The results achieved by using the language-understanding skills of off-the-shelf natural language processing (NLP) models with only minimal pre-processing and fine-tuning clearly demonstrate the power of transfer learning for practical applications.

actuarial application, natural language processing, sequence, (15 more...)

2206.02014

Country:

North America > United States > Wisconsin (0.04)
North America > United States > Montana (0.04)
Europe > Switzerland (0.04)
(3 more...)

Genre:

Workflow (1.00)
Overview (0.93)
Instructional Material > Course Syllabus & Notes (0.66)

Industry:

Transportation > Ground > Road (1.00)
Health & Medicine (1.00)
Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(3 more...)

arXiv.org Artificial IntelligenceSep-24-2023

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

Zhang, Yue, Li, Yafu, Cui, Leyang, Cai, Deng, Liu, Lemao, Fu, Tingchen, Huang, Xinting, Zhao, Enbo, Zhang, Yu, Chen, Yulong, Wang, Longyue, Luu, Anh Tuan, Bi, Wei, Shi, Freda, Shi, Shuming

While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge to the reliability of LLMs in real-world scenarios. In this paper, we survey recent efforts on the detection, explanation, and mitigation of hallucination, with an emphasis on the unique challenges posed by LLMs. We present taxonomies of the LLM hallucination phenomena and evaluation benchmarks, analyze existing approaches aiming at mitigating LLM hallucination, and discuss potential directions for future research.

hallucination, llm, preprint arxiv, (16 more...)

2309.01219

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China (0.04)
(10 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine (1.00)
Media (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Communications of the ACMSep-23-2023, 15:55:25 GMT

Barbershop Computing

It takes a village: Supporting inquiry- and equity-oriented computer science pedagogy through a professional learning community.

barbershop computing, computer science education, participation, (10 more...)

Communications of the ACM

Country:

North America > United States > Michigan > Ingham County > Lansing (0.05)
North America > United States > Michigan > Ingham County > East Lansing (0.05)
North America > United States > South Carolina > Richland County > Columbia (0.05)
North America > United States > Pennsylvania (0.04)

Genre: Instructional Material (0.89)

Industry:

Education > Educational Setting > Higher Education (0.69)
Education > Curriculum > Subject-Specific Education (0.55)

Technology: Information Technology > Artificial Intelligence (0.71)

Soutif--Cormerais, Albin, Carta, Antonio, Cossu, Andrea, Hurtado, Julio, Hemati, Hamed, Lomonaco, Vincenzo, Van de Weijer, Joost

A Comprehensive Empirical Evaluation on Online Continual Learning

arXiv.org Artificial IntelligenceSep-23-2023

Online continual learning aims to get closer to a live learning experience by learning directly on a stream of data with temporally shifting distribution and by storing a minimum amount of data from that stream. In this empirical evaluation, we evaluate various methods from the literature that tackle online continual learning. More specifically, we focus on the class-incremental setting in the context of image classification, where the learner must learn new classes incrementally from a stream of data. We compare these methods on the Split-CIFAR100 and Split-TinyImagenet benchmarks, and measure their average accuracy, forgetting, stability, and quality of the representations, to evaluate various aspects of the algorithm at the end but also during the whole training period. We find that most methods suffer from stability and underfitting issues. However, the learned representations are comparable to i.i.d. training under the same computational budget. No clear winner emerges from the results and basic experience replay, when properly tuned and implemented, is a very strong baseline. We release our modular and extensible codebase at https://github.com/AlbinSou/ocl_survey based on the avalanche framework to reproduce our results and encourage future research.

accuracy, continual learning, learning, (12 more...)

2308.10328

Country:

Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Switzerland (0.04)

Genre:

Research Report (1.00)
Instructional Material > Online (0.92)

Industry: Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.34)

Yfantidou, Sofia, Spathis, Dimitris, Constantinides, Marios, Xia, Tong, van Berkel, Niels

FairComp: Workshop on Fairness and Robustness in Machine Learning for Ubiquitous Computing

arXiv.org Artificial IntelligenceSep-22-2023

How can we ensure that Ubiquitous Computing (UbiComp) research outcomes are both ethical and fair? While fairness in machine learning (ML) has gained traction in recent years, fairness in UbiComp remains unexplored. This workshop aims to discuss fairness in UbiComp research and its social, technical, and legal implications. From a social perspective, we will examine the relationship between fairness and UbiComp research and identify pathways to ensure that ubiquitous technologies do not cause harm or infringe on individual rights. From a technical perspective, we will initiate a discussion on data practices to develop bias mitigation approaches tailored to UbiComp research. From a legal perspective, we will examine how new policies shape our community's work and future research. We aim to foster a vibrant community centered around the topic of responsible UbiComp, while also charting a clear path for future research endeavours in this field.

fairness, ubicomp, workshop, (15 more...)

doi: 10.1145/3594739.3605107

2309.12877

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.29)
North America > Mexico > Quintana Roo > Cancún (0.06)
North America > United States > New York > New York County > New York City (0.05)
(3 more...)

Genre:

Instructional Material > Course Syllabus & Notes (1.00)
Research Report (0.82)

Industry:

Law (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.69)

Uncertainty-driven Exploration Strategies for Online Grasp Learning

Shi, Yitian, Schillinger, Philipp, Gabriel, Miroslav, Kuss, Alexander, Feldman, Zohar, Ziesche, Hanna, Vien, Ngo Anh

Existing grasp prediction approaches are mostly based on offline learning, while, ignored the exploratory grasp learning during online adaptation to new picking scenarios, i.e., unseen object portfolio, camera and bin settings etc. In this paper, we present a novel method for online learning of grasp predictions for robotic bin picking in a principled way. Existing grasp prediction approaches are mostly based on offline learning, while, ignored the exploratory grasp learning during online adaptation to new picking scenarios, i.e., unseen object portfolio, camera and bin settings etc. In this paper, we present a novel method for online learning of grasp predictions for robotic bin picking in a principled way. Specifically, the online learning algorithm with an effective exploration strategy can significantly improve its adaptation performance to unseen environment settings. To this end, we first propose to formulate online grasp learning as a RL problem that will allow to adapt both grasp reward prediction and grasp poses. We propose various uncertainty estimation schemes based on Bayesian Uncertainty Quantification and Distributional Ensembles. We carry out evaluations on real-world bin picking scenes of varying difficulty. The objects in the bin have various challenging physical and perceptual characteristics that can be characterized by semi- or total transparency, and irregular or curved surfaces. The results of our experiments demonstrate a notable improvement in the suggested approach compared to conventional online learning methods which incorporate only naive exploration strategies.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

2309.12038

Country:

Europe > Germany (0.14)
Asia > China (0.14)

Genre:

Research Report > New Finding (0.93)
Instructional Material > Online (0.62)

Industry:

Education > Educational Setting > Online (1.00)
Energy > Oil & Gas > Upstream (0.82)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Gupta, Deepak, Attal, Kush, Demner-Fushman, Dina

Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches

The increase in the availability of online videos has transformed the way we access information and knowledge. A growing number of individuals now prefer instructional videos as they offer a series of step-by-step procedures to accomplish particular tasks. The instructional videos from the medical domain may provide the best possible visual answers to first aid, medical emergency, and medical education questions. Toward this, this paper is focused on answering health-related questions asked by the public by providing visual answers from medical videos. The scarcity of large-scale datasets in the medical domain is a key challenge that hinders the development of applications that can help the public with their health-related questions. To address this issue, we first proposed a pipelined approach to create two large-scale datasets: HealthVidQA-CRF and HealthVidQA-Prompt. Later, we proposed monomodal and multimodal approaches that can effectively provide visual answers from medical videos to natural language questions. We conducted a comprehensive analysis of the results, focusing on the impact of the created datasets on model training and the significance of visual features in enhancing the performance of the monomodal and multi-modal approaches. Our findings suggest that these datasets have the potential to enhance the performance of medical visual answer localization tasks and provide a promising future direction to further enhance the performance by using pre-trained language-vision models.

dataset, proceedings, video, (12 more...)

2309.12224

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(3 more...)

Genre:

Research Report > New Finding (0.86)
Instructional Material > Course Syllabus & Notes (0.57)

Industry:

Health & Medicine (1.00)
Education > Educational Technology > Audio & Video (0.55)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

Kim, Daehee, Kim, Yoonsik, Kim, DongHyun, Lim, Yumin, Kim, Geewook, Kil, Taeho

bridging domain gap, character-wise supervised contrastive learning, online text rendering, (1 more...)

Inspired by the great success of language model (LM)-based pre-training, recent studies in visual document understanding have explored LM-based pre-training methods for modeling text within document images. Among them, pre-training that reads all text from an image has shown promise, but often exhibits instability and even fails when applied to broader domains, such as those involving both visual documents and scene text images. This is a substantial limitation for real-world scenarios, where the processing of text image inputs in diverse domains is essential. In this paper, we investigate effective pre-training tasks in the broader domains and also propose a novel pre-training method called SCOB that leverages character-wise supervised contrastive learning with online text rendering to effectively pre-train document and scene text domains by bridging the domain gap. Moreover, SCOB enables weakly supervised learning, significantly reducing annotation costs. Extensive benchmarks demonstrate that SCOB generally improves vanilla pre-training methods and achieves comparable performance to state-of-the-art methods. Our findings suggest that SCOB can be served generally and effectively for read-type pre-training methods. The code will be available at https://github.com/naver-ai/scob.

2309.12382

Genre:

Instructional Material > Online (0.60)
Instructional Material > Course Syllabus & Notes (0.60)
Research Report > New Finding (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Shaikh, Omar, Chai, Valentino, Gelfand, Michele J., Yang, Diyi, Bernstein, Michael S.

Rehearsal: Simulating Conflict to Teach Conflict Resolution

Interpersonal conflict is an uncomfortable but unavoidable fact of life. Navigating conflict successfully is a skill -- one that can be learned through deliberate practice -- but few have access to effective training or feedback. To expand this access, we introduce Rehearsal, a system that allows users to rehearse conflicts with a believable simulated interlocutor, explore counterfactual "what if?" scenarios to identify alternative conversational paths, and learn through feedback on how and when to apply specific conflict strategies. Users can utilize Rehearsal to practice handling a variety of predefined conflict scenarios, from office disputes to relationship issues, or they can choose to create their own. To enable Rehearsal, we develop IRP prompting, a method of conditioning output of a large language model on the influential Interest-Rights-Power (IRP) theory from conflict resolution. Rehearsal uses IRP to generate utterances grounded in conflict resolution theory, guiding users towards counterfactual conflict resolution strategies that help de-escalate difficult conversations. In a between-subjects evaluation, 40 participants engaged in an actual conflict with a confederate after training. Compared to a control group with lecture material covering the same IRP theory, participants with simulated training from Rehearsal significantly improved their performance in the unaided conflict: they reduced their use of escalating competitive strategies by an average of 67%, while doubling their use of cooperative strategies. Overall, Rehearsal highlights the potential effectiveness of language models as tools for learning and practicing interpersonal skills.

conflict, participant, rehearsal, (12 more...)

2309.12309

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Ontario > Toronto (0.04)
(7 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Health & Medicine (0.93)
Education > Curriculum > Subject-Specific Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)