TaskComplexity: A Dataset for Task Complexity Classification with In-Context Learning, FLAN-T5 and GPT-4o Benchmarks

Rasheed, Areeg Fahad, Zarkoosh, M., Abbas, Safa F., Al-Azzawi, Sana Sabah

Sep-30-2024–arXiv.org Artificial Intelligence

This paper addresses the challenge of classifying and assigning programming tasks to experts, a process that typically requires significant effort, time, and cost. To tackle this issue, a novel dataset containing a total of 4,112 programming tasks was created by extracting tasks from various websites. Web scraping techniques were employed to collect this dataset of programming problems systematically. Specific HTML tags were tracked to extract key elements of each issue, including the title, problem description, input-output, examples, problem class, and complexity score. Examples from the dataset are provided in the appendix to illustrate the variety and complexity of tasks included. The dataset's effectiveness has been evaluated and benchmarked using two approaches; the first approach involved fine-tuning the FLAN-T5 small model on the dataset, while the second approach used in-context learning (ICL) with the GPT-4o mini. The performance was assessed using standard metrics: accuracy, recall, precision, and F1-score. The results indicated that in-context learning with GPT-4o-mini outperformed the FLAN-T5 model.

classification, dataset, sabah procedia computer science 00, (13 more...)

arXiv.org Artificial Intelligence

Sep-30-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Norway (0.04)
  - Denmark (0.04)
  - Sweden > Norrbotten County
    - Luleå (0.04)
- Asia
  - Middle East > Iraq
    - Baghdad Governorate > Baghdad (0.05)
    - Karbala Governorate > Karbala (0.04)
  - India > West Bengal
    - Kolkata (0.04)

Genre:
- Research Report (0.50)

Industry:
- Health & Medicine
  - Therapeutic Area (0.47)
  - Diagnostic Medicine (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found