AITopics | Kelowna

Collaborating Authors

Kelowna

StackRAG Agent: Improving Developer Answers with Retrieval-Augmented Generation

arXiv.org Artificial IntelligenceJun-19-2024

Developers spend much time finding information that is relevant to their questions. Stack Overflow has been the leading resource, and with the advent of Large Language Models (LLMs), generative models such as ChatGPT are used frequently. However, there is a catch in using each one separately. Searching for answers is time-consuming and tedious, as shown by the many tools developed by researchers to address this issue. On the other, using LLMs is not reliable, as they might produce irrelevant or unreliable answers (i.e., hallucination). In this work, we present StackRAG, a retrieval-augmented Multiagent generation tool based on LLMs that combines the two worlds: aggregating the knowledge from SO to enhance the reliability of the generated answers. Initial evaluations show that the generated answers are correct, accurate, relevant, and useful.

agent, stack overflow, stackrag, (15 more...)

arXiv.org Artificial Intelligence

2406.1384

Country:

North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.14)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Building Artificial Intelligence with Creative Agency and Self-hood

Gabora, Liane, Bach, Joscha

arXiv.org Artificial IntelligenceJun-9-2024

This paper is an invited layperson summary for The Academic of the paper referenced on the last page. We summarize how the formal framework of autocatalytic networks offers a means of modeling the origins of self-organizing, self-sustaining structures that are sufficiently complex to reproduce and evolve, be they organisms undergoing biological evolution, novelty-generating minds driving cultural evolution, or artificial intelligence networks such as large language models. The approach can be used to analyze and detect phase transitions in vastly complex networks that have proven intractable with other approaches, and suggests a promising avenue to building an autonomous, agentic AI self. It seems reasonable to expect that such an autocatalytic AI would possess creative agency akin to that of humans, and undergo psychologically healing -- i.e., therapeutic -- internal transformation through engagement in creative tasks. Moreover, creative tasks would be expected to help such an AI solidify its self-identity.

artificial intelligence, evolution, reaction, (13 more...)

arXiv.org Artificial Intelligence

2407.10978

Country:

North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.05)
Europe > Portugal > Azores (0.05)
Asia > India (0.05)

Genre: Research Report (0.40)

Industry: Health & Medicine > Therapeutic Area (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.73)
Information Technology > Artificial Intelligence > Natural Language (0.72)
Information Technology > Artificial Intelligence > Cognitive Science (0.48)

Add feedback

Ordinal Mixed-Effects Random Forest

Bergonzoli, Giulia, Rossi, Lidia, Masci, Chiara

arXiv.org Machine LearningJun-5-2024

We propose an innovative statistical method, called Ordinal Mixed-Effect Random Forest (OMERF), that extends the use of random forest to the analysis of hierarchical data and ordinal responses. The model preserves the flexibility and ability of modeling complex patterns of both categorical and continuous variables, typical of tree-based ensemble methods, and, at the same time, takes into account the structure of hierarchical data, modeling the dependence structure induced by the grouping and allowing statistical inference at all data levels. A simulation study is conducted to validate the performance of the proposed method and to compare it to the one of other state-of-the art models. The application of OMERF is exemplified in a case study focusing on predicting students performances using data from the Programme for International Student Assessment (PISA) 2022. The model identifies discriminating student characteristics and estimates the school-effect.

numeric mean, omerf, random forest, (15 more...)

arXiv.org Machine Learning

2406.0313

Country:

Europe > Austria > Vienna (0.14)
North America > United States (0.04)
Europe > Italy > Lombardy > Milan (0.04)
North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.04)

Genre: Research Report > Promising Solution (0.48)

Industry: Education > Assessment & Standards > Student Performance (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.85)

Add feedback

Understanding the Rare Inflammatory Disease Using Large Language Models and Social Media Data

Xi, Nan Miles, Ji, Hong-Long, Wang, Lin

arXiv.org Artificial IntelligenceMay-12-2024

Sarcoidosis is a rare inflammatory disease characterized by the formation of granulomas in various organs. The disease presents diagnostic and treatment challenges due to its diverse manifestations and unpredictable nature. In this study, we employed a Large Language Model (LLM) to analyze sarcoidosis-related discussions on the social media platform Reddit. Our findings underscore the efficacy of LLMs in accurately identifying sarcoidosis-related content. We discovered a wide array of symptoms reported by patients, with fatigue, swollen lymph nodes, and shortness of breath as the most prevalent. Prednisone was the most prescribed medication, while infliximab showed the highest effectiveness in improving prognoses. Notably, our analysis revealed disparities in prognosis based on age and gender, with women and younger patients experiencing good and polarized outcomes, respectively. Furthermore, unsupervised clustering identified three distinct patient subgroups (phenotypes) with unique symptom profiles, prognostic outcomes, and demographic distributions. Finally, sentiment analysis revealed a moderate negative impact on patients' mental health post-diagnosis, particularly among women and younger individuals. Our study represents the first application of LLMs to understand sarcoidosis through social media data. It contributes to understanding the disease by providing data-driven insights into its manifestations, treatments, prognoses, and impact on patients' lives. Our findings have direct implications for improving personalized treatment strategies and enhancing the quality of care for individuals living with sarcoidosis.

prognosis, sarcoidosis, symptom, (15 more...)

arXiv.org Artificial Intelligence

2405.13005

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Illinois > Cook County > Maywood (0.04)
North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Rheumatology (1.00)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Add feedback

Empirical Studies of Parameter Efficient Methods for Large Language Models of Code and Knowledge Transfer to R

Esmaeili, Amirreza, Saberi, Iman, Fard, Fatemeh H.

arXiv.org Artificial IntelligenceMar-15-2024

Recently, Large Langauge Models (LLMs) have gained a lot of attention in the Software Engineering (SE) community. LLMs or their variants pre-trained on code are used for many SE tasks. A main approach for adapting LLMs to the downstream task is to fine-tune the models. However, with having billions-parameters-LLMs, fine-tuning the models is not practical. An alternative approach is using Parameter Efficient Fine Tuning (PEFT), in which the model parameters are frozen and only a few added parameters are trained. Though the LLMs are used for programming languages such as Python and Java widely, their capability for low-resource languages is limited. In this work, we empirically study PEFT methods, LoRA and Compacter, on CodeT5 and CodeLlama. We will assess their performance compared to fully fine-tuned models, whether they can be used for knowledge transfer from natural language models to code (using T5 and Llama models), and their ability to adapt the learned knowledge to an unseen language. For the unseen language, we aim to study R, as it has a wide community. The adaptability with less computational costs makes LLMs accessible in scenarios where heavy computational resources are not available. Moreover, studying R opens new opportunities for using LLMs for other languages. We anticipate our findings to showcase the capabilities of PEFT for code LLMs for R and reveal the improvement areas.

dataset, language model, peft method, (12 more...)

arXiv.org Artificial Intelligence

2405.01553

Country:

North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.14)
Europe > Portugal > Lisbon > Lisbon (0.05)
North America > Dominican Republic (0.04)
(10 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Studying Vulnerable Code Entities in R

Zhao, Zixiao, Das, Millon Madhur, Fard, Fatemeh H.

arXiv.org Artificial IntelligenceFeb-6-2024

Pre-trained Code Language Models (Code-PLMs) have shown many advancements and achieved state-of-the-art results for many software engineering tasks in the past few years. These models are mainly targeted for popular programming languages such as Java and Python, leaving out many other ones like R. Though R has a wide community of developers and users, there is little known about the applicability of Code-PLMs for R. In this preliminary study, we aim to investigate the vulnerability of Code-PLMs for code entities in R. For this purpose, we use an R dataset of code and comment pairs and then apply CodeAttack, a black-box attack model that uses the structure of code to generate adversarial code samples. We investigate how the model can attack different entities in R. This is the first step towards understanding the importance of R token types, compared to popular programming languages (e.g., Java). We limit our study to code summarization. Our results show that the most vulnerable code entity is the identifier, followed by some syntax tokens specific to R. The results can shed light on the importance of token types and help in developing models for code summarization and method name prediction for the R language.

codeattack, dataset, token type, (12 more...)

arXiv.org Artificial Intelligence

2402.04421

Country:

North America > United States > District of Columbia > Washington (0.05)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(9 more...)

Genre: Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Software > Programming Languages (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Investigating the Efficacy of Large Language Models for Code Clone Detection

Khajezade, Mohamad, Wu, Jie JW, Fard, Fatemeh Hendijani, Rodríguez-Pérez, Gema, Shehata, Mohamed Sami

arXiv.org Artificial IntelligenceJan-30-2024

Large Language Models (LLMs) have demonstrated remarkable success in various natural language processing and software engineering tasks, such as code generation. The LLMs are mainly utilized in the prompt-based zero/few-shot paradigm to guide the model in accomplishing the task. GPT-based models are one of the popular ones studied for tasks such as code comment generation or test generation. These tasks are `generative' tasks. However, there is limited research on the usage of LLMs for `non-generative' tasks such as classification using the prompt-based paradigm. In this preliminary exploratory study, we investigated the applicability of LLMs for Code Clone Detection (CCD), a non-generative task. By building a mono-lingual and cross-lingual CCD dataset derived from CodeNet, we first investigated two different prompts using ChatGPT to detect Type-4 code clones in Java-Java and Java-Ruby pairs in a zero-shot setting. We then conducted an analysis to understand the strengths and weaknesses of ChatGPT in CCD. ChatGPT surpasses the baselines in cross-language CCD attaining an F1-score of 0.877 and achieves comparable performance to fully fine-tuned models for mono-lingual CCD, with an F1-score of 0.878. Also, the prompt and the difficulty level of the problems has an impact on the performance of ChatGPT. Finally we provide insights and future directions based on our initial analysis

chatgpt, code clone detection, dataset, (9 more...)

arXiv.org Artificial Intelligence

2401.13802

Country:

North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.15)
Europe > Portugal > Lisbon > Lisbon (0.05)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ChatEd: A Chatbot Leveraging ChatGPT for an Enhanced Learning Experience in Higher Education

Wang, Kevin, Ramos, Jason, Lawrence, Ramon

arXiv.org Artificial IntelligenceDec-29-2023

With the rapid evolution of Natural Language Processing (NLP), Large Language Models (LLMs) like ChatGPT have emerged as powerful tools capable of transforming various sectors. Their vast knowledge base and dynamic interaction capabilities represent significant potential in improving education by operating as a personalized assistant. However, the possibility of generating incorrect, biased, or unhelpful answers are a key challenge to resolve when deploying LLMs in an education context. This work introduces an innovative architecture that combines the strengths of ChatGPT with a traditional information retrieval based chatbot framework to offer enhanced student support in higher education. Our empirical evaluations underscore the high promise of this approach.

chatbot, chated, information, (16 more...)

arXiv.org Artificial Intelligence

2401.00052

Country:

North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.05)
Oceania > Australia (0.04)

Genre:

Instructional Material > Course Syllabus & Notes (1.00)
Research Report (0.64)

Industry:

Education > Educational Setting > Higher Education (0.71)
Education > Educational Technology > Educational Software (0.47)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Random Models for Fuzzy Clustering Similarity Measures

DeWolfe, Ryan, Andrews, Jeffery L.

arXiv.org Machine LearningDec-15-2023

The Adjusted Rand Index (ARI) is a widely used method for comparing hard clusterings, but requires a choice of random model that is often left implicit. Several recent works have extended the Rand Index to fuzzy clusterings, but the assumptions of the most common random model is difficult to justify in fuzzy settings. We propose a single framework for computing the ARI with three random models that are intuitive and explainable for both hard and fuzzy clusterings, along with the benefit of lower computational complexity. The theory and assumptions of the proposed models are contrasted with the existing permutation model. Computations on synthetic and benchmark data show that each model has distinct behaviour, meaning that accurate model selection is important for the reliability of results.

allocation, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

2312.1027

Country:

North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.04)
Europe > France > Bourgogne-Franche-Comté > Doubs > Besançon (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Translating Natural Language Queries to SQL Using the T5 Model

Wong, Albert, Pham, Lien, Lee, Young, Chan, Shek, Sadaya, Razel, Khmelevsky, Youry, Clement, Mathias, Cheng, Florence Wing Yau, Mahony, Joe, Ferri, Michael

arXiv.org Artificial IntelligenceDec-12-2023

This paper presents the development process of a natural language to SQL model using the T5 model as the basis. The models, developed in August 2022 for an online transaction processing system and a data warehouse, have a 73\% and 84\% exact match accuracy respectively. These models, in conjunction with other work completed in the research project, were implemented for several companies and used successfully on a daily basis. The approach used in the model development could be implemented in a similar fashion for other database environments and with a more powerful pre-trained language model.

arxiv preprint arxiv, dataset, query, (12 more...)

arXiv.org Artificial Intelligence

2312.12414

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.15)
North America > Canada > British Columbia > Regional District of Central Okanagan > Kelowna (0.14)
North America > Canada > Ontario > National Capital Region > Ottawa (0.14)
Asia > India > Goa > Panaji (0.04)

Genre:

Overview (1.00)
Research Report (0.87)

Industry: Energy > Power Industry (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback