length
- North America > United States (0.04)
- Europe > Finland > Uusimaa > Helsinki (0.04)
CoDocBench: A Dataset for Code-Documentation Alignment in Software Maintenance
Pai, Kunal, Devanbu, Premkumar, Ahmed, Toufique
One of the central tasks in software maintenance is being able to understand and develop code changes. Thus, given a natural language description of the desired new operation of a function, an agent (human or AI) might be asked to generate the set of edits to that function to implement the desired new operation; likewise, given a set of edits to a function, an agent might be asked to generate a changed description, of that function's new workings. Thus, there is an incentive to train a neural model for change-related tasks. Motivated by this, we offer a new, "natural", large dataset of coupled changes to code and documentation mined from actual high-quality GitHub projects, where each sample represents a single commit where the code and the associated docstring were changed together. We present the methodology for gathering the dataset, and some sample, challenging (but realistic) tasks where our dataset provides opportunities for both learning and evaluation. We find that current models (specifically Llama-3.1 405B, Mixtral 8$\times$22B) do find these maintenance-related tasks challenging.
A Hybrid Data-Driven Approach For Analyzing And Predicting Inpatient Length Of Stay In Health Centre
Chowdhury, Tasfia Noor, Mou, Sanjida Afrin, Rahman, Kazi Naimur
Patient length of stay (LoS) is a critical metric for evaluating the efficacy of hospital management. The primary objectives encompass to improve efficiency and reduce costs while enhancing patient outcomes and hospital capacity within the patient journey. By seamlessly merging data-driven techniques with simulation methodologies, the study proposes an all-encompassing framework for the optimization of patient flow. Using a comprehensive dataset of 2.3 million de-identified patient records, we analyzed demographics, diagnoses, treatments, services, costs, and charges with machine learning models (Decision Tree, Logistic Regression, Random Forest, Adaboost, LightGBM) and Python tools (Spark, AWS clusters, dimensionality reduction). Our model predicts patient length of stay (LoS) upon admission using supervised learning algorithms. This hybrid approach enables the identification of key factors influencing LoS, offering a robust framework for hospitals to streamline patient flow and resource utilization. The research focuses on patient flow, corroborating the efficacy of the approach, illustrating decreased patient length of stay within a real healthcare environment. The findings underscore the potential of hybrid data-driven models in transforming hospital management practices. This innovative methodology provides generally flexible decision-making, training, and patient flow enhancement; such a system could have huge implications for healthcare administration and overall satisfaction with healthcare.
- North America > United States > New York (0.04)
- Asia > India > Rajasthan > Jaipur (0.04)
- Asia > China > Hubei Province > Wuhan (0.04)
- Asia > Bangladesh (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.88)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Exploratory Data Analysis
Exploratory Data Analysis (EDA) is an approach used by data scientists to analyze datasets and summarize their main characteristics, with the help of data visualization methods. It helps data scientists to discover patterns, and economic trends, test a hypothesis or check assumptions. The main purpose of EDA is to help look at data before making any assumptions. It can help identify the trends, patterns, and relationships within the data. Data scientists can use exploratory analysis to ensure the results they produce are valid and applicable to any desired business outcomes and goals.
The best co-op games for PC, Nintendo Switch, PS5 and more
Online multiplayer has become part and parcel with many video games these days, but finding something you can play on the couch with a loved one has gotten tougher. If you're looking for some cooperative fun, though, we can help. Below are 25 of the best couch co-op games we've played across the Nintendo Switch, PlayStation, Xbox and PC. Note that we're focusing on genuine co-op experiences, not games that have local multiplayer but aren't truly cooperative in practice. Even still, our list encompasses everything from platformers and puzzlers to RPGs and arcade shooters. You know the broad strokes of any Super Mario game by now. Within the series, though, 3D World stands out for using a largely fixed camera and levels that are more semi-3D than the totally open spaces of games like Super Mario Odyssey or Super Mario Galaxy.
Discrete Convolution and Сross-Correlation. Theory and Implementation in Python
What are convolution and cross-correlation? How can we implement and apply them? Often we need to express the interaction between two processes represented by some functions. For example, let us find out how the amount of snow is changed over time in a given area. But snow does melt and it depends not only on the current total amount, but also it depends on the time when this snow fell.
i-Sim2Real: Reinforcement Learning of Robotic Policies in Tight Human-Robot Interaction Loops
Abeyruwan, Saminda, Graesser, Laura, D'Ambrosio, David B., Singh, Avi, Shankar, Anish, Bewley, Alex, Jain, Deepali, Choromanski, Krzysztof, Sanketi, Pannag R.
Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to train policies in simulation enables safe exploration and large-scale data collection quickly at low cost. However, prior works in sim-to-real transfer of robotic policies typically do not involve any human-robot interaction because accurately simulating human behavior is an open problem. In this work, our goal is to leverage the power of simulation to train robotic policies that are proficient at interacting with humans upon deployment. But there is a chicken and egg problem -- how to gather examples of a human interacting with a physical robot so as to model human behavior in simulation without already having a robot that is able to interact with a human? Our proposed method, Iterative-Sim-to-Real (i-S2R), attempts to address this. i-S2R bootstraps from a simple model of human behavior and alternates between training in simulation and deploying in the real world. In each iteration, both the human behavior model and the policy are refined. For all training we apply a new evolutionary search algorithm called Blackbox Gradient Sensing (BGS). We evaluate our method on a real world robotic table tennis setting, where the objective for the robot is to play cooperatively with a human player for as long as possible. Table tennis is a high-speed, dynamic task that requires the two players to react quickly to each other's moves, making for a challenging test bed for research on human-robot interaction. We present results on an industrial robotic arm that is able to cooperatively play table tennis with human players, achieving rallies of 22 successive hits on average and 150 at best. Further, for 80% of players, rally lengths are 70% to 175% longer compared to the sim-to-real plus fine-tuning (S2R+FT) baseline. For videos of our system in action, please see https://sites.google.com/view/is2r.
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- Oceania > Australia > Queensland > Brisbane (0.04)
- North America > United States > Oregon (0.04)
- (8 more...)
- Research Report > Experimental Study (0.67)
- Research Report > New Finding (0.46)
Teaching Algorithmic Reasoning via In-context Learning
Zhou, Hattie, Nova, Azade, Larochelle, Hugo, Courville, Aaron, Neyshabur, Behnam, Sedghi, Hanie
Large language models (LLMs) have shown increasing in-context learning capabilities through scaling up model and data size. Despite this progress, LLMs are still unable to solve algorithmic reasoning problems. While providing a rationale with the final answer has led to further improvements in multi-step reasoning problems, Anil et al. 2022 showed that even simple algorithmic reasoning tasks such as parity are far from solved. In this work, we identify and study four key stages for successfully teaching algorithmic reasoning to LLMs: (1) formulating algorithms as skills, (2) teaching multiple skills simultaneously (skill accumulation), (3) teaching how to combine skills (skill composition) and (4) teaching how to use skills as tools. We show that it is possible to teach algorithmic reasoning to LLMs via in-context learning, which we refer to as algorithmic prompting. We evaluate our approach on a variety of arithmetic and quantitative reasoning tasks, and demonstrate significant boosts in performance over existing prompting techniques. In particular, for long parity, addition, multiplication and subtraction, we achieve an error reduction of approximately 10x, 9x, 5x and 2x respectively compared to the best available baselines.
- Workflow (0.93)
- Research Report (0.81)
Geometric Mean Classifier for IRIS Dataset
Before you try the algorithm, do your pre-processing steps. Randomize the dataset and then split it into training and test dataset. You can do a 2/3 and 1/3rd split for training and test respectively. If the geometric mean of a row in the test data falls within the range, select that class as the label. The next part will show tuning.