AITopics | Instructional Material

Collaborating Authors

Instructional Material

LLM-Detector: Improving AI-Generated Chinese Text Detection with Open-Source LLM Instruction Tuning

Wang, Rongsheng, Chen, Haoming, Zhou, Ruizhe, Ma, Han, Duan, Yaofei, Kang, Yanlan, Yang, Songhua, Fan, Baoyu, Tan, Tao

arXiv.org Artificial IntelligenceFeb-2-2024

ChatGPT and other general large language models (LLMs) have achieved remarkable success, but they have also raised concerns about the misuse of AI-generated texts. Existing AI-generated text detection models, such as based on BERT and RoBERTa, are prone to in-domain over-fitting, leading to poor out-of-domain (OOD) detection performance. In this paper, we first collected Chinese text responses generated by human experts and 9 types of LLMs, for which to multiple domains questions, and further created a dataset that mixed human-written sentences and sentences polished by LLMs. We then proposed LLM-Detector, a novel method for both document-level and sentence-level text detection through Instruction Tuning of LLMs. Our method leverages the wealth of knowledge LLMs acquire during pre-training, enabling them to detect the text they generate. Instruction tuning aligns the model's responses with the user's expected text detection tasks. Experimental results show that previous methods struggle with sentence-level AI-generated text detection and OOD detection. In contrast, our proposed method not only significantly outperforms baseline methods in both sentence-level and document-level text detection but also demonstrates strong generalization capabilities. Furthermore, since LLM-Detector is trained based on open-source LLMs, it is easy to customize for deployment.

dataset, detection, llm-detector, (9 more...)

arXiv.org Artificial Intelligence

2402.01158

Country:

North America > United States > California > Santa Clara County > Stanford (0.04)
Europe > United Kingdom > England > Greater Manchester > Salford (0.04)
Asia > Macao (0.04)
Asia > China > Hubei Province > Wuhan (0.04)

Genre:

Research Report > New Finding (0.66)
Instructional Material > Online (0.42)
Instructional Material > Course Syllabus & Notes (0.42)

Industry:

Information Technology > Security & Privacy (0.46)
Education > Educational Setting > Online (0.42)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)

Add feedback

Online Transfer Learning for RSV Case Detection

Sun, Yiming, Gao, Yuhe, Bao, Runxue, Cooper, Gregory F., Espino, Jessi, Hochheiser, Harry, Michaels, Marian G., Aronis, John M., Ye, Ye

arXiv.org Artificial IntelligenceFeb-2-2024

In such cases, transferring knowledge from the source domain becomes crucial, particularly because the Machine learning has made substantial advancements in limited initial data in the target domain may be insufficient recent decades, with its applications spanning a wide range of for effective learning. The extensive and diverse information fields such as image and speech recognition, natural language available from the source domains can significantly compensate processing, and autonomous driving. Despite these achievements, for this shortfall, providing a foundational knowledge base machine learning in biomedicine faces significant challenges, that the model can build upon as more target domain data particularly in data collection. The acquisition of labeled becomes available. Therefore, the efficiency and effectiveness data can be very costly or even unfeasible due to factors of learning in the target domain are greatly enhanced by the like ethical considerations, patient privacy, and the scarcity transferred knowledge from the source domains. of certain diseases. These challenges have led researchers to Online transfer learning entails leveraging knowledge from increasingly rely on utilizing data from related domains that a static source domain and applying it to an ongoing, evolving have a more abundant supply of data.

classifier, ensemble model, target domain, (17 more...)

arXiv.org Artificial Intelligence

2402.01987

Country:

North America > United States (0.29)
Asia > Singapore (0.04)
Europe > Greece (0.04)
Asia > Middle East > Israel > Haifa District > Haifa (0.04)

Genre:

Research Report > New Finding (0.95)
Instructional Material > Online (0.72)
Research Report > Experimental Study (0.69)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Nonlinear Filtering with Brenier Optimal Transport Maps

Al-Jarrah, Mohammad, Jin, Niyizhen, Hosseini, Bamdad, Taghvaei, Amirhossein

arXiv.org Artificial IntelligenceFeb-2-2024

This paper is concerned with the problem of nonlinear filtering, i.e., computing the conditional distribution of the state of a stochastic dynamical system given a history of noisy partial observations. Conventional sequential importance resampling (SIR) particle filters suffer from fundamental limitations, in scenarios involving degenerate likelihoods or high-dimensional states, due to the weight degeneracy issue. In this paper, we explore an alternative method, which is based on estimating the Brenier optimal transport (OT) map from the current prior distribution of the state to the posterior distribution at the next time step. Unlike SIR particle filters, the OT formulation does not require the analytical form of the likelihood. Moreover, it allows us to harness the approximation power of neural networks to model complex and multi-modal distributions and employ stochastic optimization algorithms to enhance scalability. Extensive numerical experiments are presented that compare the OT method to the SIR particle filter and the ensemble Kalman filter, evaluating the performance in terms of sample efficiency, high-dimensional scalability, and the ability to capture complex and multi-modal distributions.

algorithm, nonlinear filtering, particle, (13 more...)

arXiv.org Artificial Intelligence

2310.13886

Country:

North America > United States > Washington > King County > Seattle (0.04)
Europe > France (0.04)

Genre:

Research Report > New Finding (0.46)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

Computing Education in the Era of Generative AI

Communications of the ACMFeb-1-2024, 05:00:00 GMT

Challenges and opportunities faced by computing educators and students adapting to LLMs capable of generating accurate source code from natural-language problem descriptions.

computing machinery, explanation, student, (15 more...)

Communications of the ACM

Country:

Oceania > New Zealand > North Island > Auckland Region > Auckland (0.05)
North America > United States > Texas > Taylor County > Abilene (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Europe > Finland (0.04)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.94)

Industry: Education > Curriculum > Subject-Specific Education (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.52)

Add feedback

Design and consensus content validity of the questionnaire for b-learning education: A 2-Tuple Fuzzy Linguistic Delphi based Decision Support Tool

Montes, Rosana, Zuheros, Cristina, Morales, Jeovani M., Zermeño, Noe, Duran, Jerónimo, Herrera, Francsico

arXiv.org Artificial IntelligenceFeb-1-2024

Classic Delphi and Fuzzy Delphi methods are used to test content validity of data collection tools such as questionnaires. Fuzzy Delphi takes the opinion issued by judges from a linguistic perspective reducing ambiguity in opinions by using fuzzy numbers. We propose an extension named 2-Tuple Fuzzy Linguistic Delphi method to deal with scenarios in which judges show different expertise degrees by using fuzzy multigranular semantics of the linguistic terms and to obtain intermediate and final results expressed by 2-tuple linguistic values. The key idea of our proposal is to validate the full questionnaire by means of the evaluation of its parts, defining the validity of each item as a Decision Making problem. Taking the opinion of experts, we measure the degree of consensus, the degree of consistency, and the linguistic score of each item, in order to detect those items that affect, positively or negatively, the quality of the instrument. Considering the real need to evaluate a b-learning educational experience with a consensual questionnaire, we present a Decision Making model for questionnaire validation that solves it. Additionally, we contribute to this consensus reaching problem by developing an online tool under GPL v3 license. The software visualizes the collective valuations for each iteration and assists to determine which parts of the questionnaire should be modified to reach a consensual solution.

2-tuple fuzzy linguistic delphi method, delphi method, questionnaire, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.asoc.2023.110755

2402.01775

Country:

South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)
Asia > Pakistan (0.04)
(2 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Health & Medicine (1.00)
Energy (0.67)
Education > Educational Setting (0.46)

Technology:

Information Technology > Decision Support Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Collaboration (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

Extending Interactive Science Exhibits into the Classroom using Anthropomorphized Chatbots and Bloom's Taxonomy

Golding, Yousuf

arXiv.org Artificial IntelligenceFeb-1-2024

This study explores the use of Generative AI chatbots for transforming public science exhibits into virtual experiences that can extend the engagement of exhibits into the classroom. The broader goal is to increase accessibility of science exhibits, especially for those marginalized in STEM due to various factors, including cultural barriers. We hypothesize that turning exhibits into first-person anthropomorphized chatbots with a personality, like quirky-talking asteroids or comets, can increase engagement and learning. The paper mainly explores if such techniques are possible using Generative AI (e.g. GPT) via prompt engineering alone. The research includes an investigation into the possibility of integrating interactive assessment via question-generation using Bloom's Taxonomy. Initial results indicate that it is possible to combine these techniques. As such, it lays a foundation for future classroom evaluations of such chatbots to gauge their overall efficacy in extending the reach of science exhibitions. The paper concludes by discussing extensions of the research to fully evaluate effectiveness in virtual field-trips. We also include a brief examination of additional ways to enhance student motivation towards learning via chatbots.

bloom, chatbot, student, (14 more...)

arXiv.org Artificial Intelligence

2402.0177

Country:

Asia > Singapore (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.69)

Industry:

Education > Curriculum > Subject-Specific Education (0.47)
Education > Educational Setting > K-12 Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

Add feedback

Distilling Conditional Diffusion Models for Offline Reinforcement Learning through Trajectory Stitching

Li, Shangzhe, Zhang, Xinhua

arXiv.org Artificial IntelligenceFeb-1-2024

Deep generative models have recently emerged as an effective approach to offline reinforcement learning. However, their large model size poses challenges in computation. We address this issue by proposing a knowledge distillation method based on data augmentation. In particular, high-return trajectories are generated from a conditional diffusion model, and they are blended with the original trajectories through a novel stitching algorithm that leverages a new reward generator. Applying the resulting dataset to behavioral cloning, the learned shallow policy whose size is much smaller outperforms or nearly matches deep generative planners on several D4RL benchmarks.

distilling conditional diffusion model, offline reinforcement learning, trajectory, (10 more...)

arXiv.org Artificial Intelligence

2402.00807

Country:

North America > United States > Montana (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(2 more...)

Genre:

Research Report (0.50)
Instructional Material (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Online Distribution Learning with Local Private Constraints

Sima, Jin, Wu, Changlong, Milenkovic, Olgica, Szpankowski, Wojciech

arXiv.org Artificial IntelligenceJan-31-2024

We study the problem of online conditional distribution estimation with \emph{unbounded} label sets under local differential privacy. Let $\mathcal{F}$ be a distribution-valued function class with unbounded label set. We aim at estimating an \emph{unknown} function $f\in \mathcal{F}$ in an online fashion so that at time $t$ when the context $\boldsymbol{x}_t$ is provided we can generate an estimate of $f(\boldsymbol{x}_t)$ under KL-divergence knowing only a privatized version of the true labels sampling from $f(\boldsymbol{x}_t)$. The ultimate objective is to minimize the cumulative KL-risk of a finite horizon $T$. We show that under $(\epsilon,0)$-local differential privacy of the privatized labels, the KL-risk grows as $\tilde{\Theta}(\frac{1}{\epsilon}\sqrt{KT})$ upto poly-logarithmic factors where $K=|\mathcal{F}|$. This is in stark contrast to the $\tilde{\Theta}(\sqrt{T\log K})$ bound demonstrated by Wu et al. (2023a) for bounded label sets. As a byproduct, our results recover a nearly tight upper bound for the hypothesis selection problem of gopi et al. (2020) established only for the batch setting.

algorithm, online distribution learning, privacy, (14 more...)

arXiv.org Artificial Intelligence

2402.00315

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain (0.04)

Genre:

Instructional Material > Online (0.41)
Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

Lan, Yunshi, Li, Xinyuan, Du, Hanyue, Lu, Xuesong, Gao, Ming, Qian, Weining, Zhou, Aoying

arXiv.org Artificial IntelligenceJan-31-2024

Natural Language Processing (NLP) aims to analyze the text via techniques in the computer science field. It serves the applications in healthcare, commerce, and education domains. Particularly, NLP has been applied to the education domain to help teaching and learning. In this survey, we review recent advances in NLP with a focus on solving problems related to the education domain. In detail, we begin with introducing the relevant background. Then, we present the taxonomy of NLP in the education domain. Next, we illustrate the task definition, challenges, and corresponding techniques based on the above taxonomy. After that, we showcase some off-the-shelf demonstrations in this domain and conclude with future directions.

arxiv preprint arxiv, dataset, question generation, (13 more...)

arXiv.org Artificial Intelligence

2401.07518

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre:

Research Report (1.00)
Overview (1.00)
Instructional Material (1.00)

Industry:

Education > Educational Setting > Online (1.00)
Education > Curriculum > Subject-Specific Education (1.00)
Education > Assessment & Standards > Student Performance (0.95)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)
(2 more...)

Add feedback

AIhub monthly digest: January 2024 – closed-loop robot planning, crowdsourced clustering, and trustworthiness in GPT models

AIHubJan-30-2024, 11:02:05 GMT

We start 2024 with a packed monthly digest, where you can catch up with any AIhub stories you may have missed, peruse the latest news, recap recent events, and more. This month, we continue our coverage of NeurIPS, meet the first interviewee in our AAAI Doctoral Consortium series, and find out how to build AI openly. The AAAI/SIGAI Doctoral Consortium provides an opportunity for a group of PhD students to discuss and explore their research interests and career objectives in an interdisciplinary workshop together with a panel of established researchers. Over the course of the next few months, we'll be meeting the participants and finding out more about their work, PhD life, and their future research plans. In the first interview of the series, Changhoon Kim told us about his research on enhancing the reliability of image generative AI.

machine learning, monthly digest, natural language, (10 more...)

AIHub

Genre: Instructional Material > Course Syllabus & Notes (0.53)

Industry: Energy > Renewable > Geothermal > Geothermal Energy Systems and Facilities > Geothermal System for Power Generation > Advanced Geothermal System (AGS) (0.41)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback