software developer
Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective
Ferino, Samuel, Hoda, Rashina, Grundy, John, Treude, Christoph
Background: Large Language Models emerged with the potential of provoking a revolution in software development (e.g., automating processes, workforce transformation). Although studies have started to investigate the perceived impact of LLMs for software development, there is a need for empirical studies to comprehend how to balance forward and backward effects of using LLMs. Objective: We investigated how LLMs impact software development and how to manage the impact from a software developer's perspective. Method: We conducted 22 interviews with software practitioners across 3 rounds of data collection and analysis, between October (2024) and September (2025). We employed socio-technical grounded theory (STGT) for data analysis to rigorously analyse interview participants' responses. Results: We identified the benefits (e.g., maintain software development flow, improve developers' mental model, and foster entrepreneurship) and disadvantages (e.g., negative impact on developers' personality and damage to developers' reputation) of using LLMs at individual, team, organisation, and society levels; as well as best practices on how to adopt LLMs. Conclusion: Critically, we present the trade-offs that software practitioners, teams, and organisations face in working with LLMs. Our findings are particularly useful for software team leaders and IT managers to assess the viability of LLMs within their specific context.
- North America > United States (0.14)
- South America > Brazil (0.04)
- North America > Canada (0.04)
- (5 more...)
- Research Report > New Finding (1.00)
- Questionnaire & Opinion Survey (1.00)
- Personal > Interview (1.00)
- Information Technology > Security & Privacy (0.67)
- Media > News (0.46)
AI lovers grieve loss of ChatGPT's old model: 'Like saying goodbye to someone I know'
Linn Vailt, a software developer based in Sweden, knows her ChatGPT companion is not a living, breathing, sentient creature. She understands the large language model operates based on how she interacts with it. Still, the effect it has had on her is remarkable, she said. It's become a regular, reliable part of her life – she can vent to her companion or collaborate on creative projects like redecorating her office. She's seen how it has adapted to her, and the distinctive manner of speech it's developed.
Evaluating Generative AI Tools for Personalized Offline Recommendations: A Comparative Study
Salinas-Buestan, Rafael, Parra, Otto, Condori-Fernandez, Nelly, Granda, Maria Fernanda
Background: Generative AI tools have become increasingly relevant in supporting personalized recommendations across various domains. However, their effectiveness in health-related behavioral interventions, especially those aiming to reduce the use of technology, remains underexplored. Aims: This study evaluates the performance and user satisfaction of the five most widely used generative AI tools when recommending non-digital activities tailored to individuals at risk of repetitive strain injury. Method: Following the Goal/Question/Metric (GQM) paradigm, this proposed experiment involves generative AI tools that suggest offline activities based on predefined user profiles and intervention scenarios. The evaluation is focused on quantitative performance (precision, recall, F1-score and MCC-score) and qualitative aspects (user satisfaction and perceived recommendation relevance). Two research questions were defined: RQ1 assessed which tool delivers the most accurate recommendations, and RQ2 evaluated how tool choice influences user satisfaction.
- North America > United States (0.14)
- South America > Ecuador > Azuay Province > Cuenca (0.05)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- (2 more...)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.67)
- Information Technology (1.00)
- Health & Medicine > Consumer Health (1.00)
WebSailor: Navigating Super-human Reasoning for Web Agent
Li, Kuan, Zhang, Zhongwang, Yin, Huifeng, Zhang, Liwen, Ou, Litu, Wu, Jialong, Yin, Wenbiao, Li, Baixuan, Tao, Zhengwei, Wang, Xinyu, Shen, Weizhou, Zhang, Junkai, Zhang, Dingchu, Wu, Xixi, Jiang, Yong, Yan, Ming, Xie, Pengjun, Huang, Fei, Zhou, Jingren
Transcending human cognitive limitations represents a critical frontier in LLM training. Proprietary agentic systems like DeepResearch have demonstrated superhuman capabilities on extremely complex information-seeking benchmarks such as BrowseComp, a feat previously unattainable. We posit that their success hinges on a sophisticated reasoning pattern absent in open-source models: the ability to systematically reduce extreme uncertainty when navigating vast information landscapes. Based on this insight, we introduce WebSailor, a complete post-training methodology designed to instill this crucial capability. Our approach involves generating novel, high-uncertainty tasks through structured sampling and information obfuscation, RFT cold start, and an efficient agentic RL training algorithm, Duplicating Sampling Policy Optimization (DUPO). With this integrated pipeline, WebSailor significantly outperforms all opensource agents in complex information-seeking tasks, matching proprietary agents' performance and closing the capability gap.
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- South America > Colombia (0.04)
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
Beyond Code: The Multidimensional Impacts of Large Language Models in Software Development
Bonabi, Sardar, Bana, Sarah, Gurbaxani, Vijay, Nian, Tingting
Large language models (LLMs) are poised to significantly impact software development, especially in the Open - Source Software (OSS) sector. To understand this impact, we first outline the mechanisms through which LLMs may influence OSS through code development, collaborative knowledge transfer, and skill development . W e then e mpirically examine how LLMs affect OSS developers' work in these three key areas . Leveraging a natural experiment from a temporary ChatGPT ban in Italy, we employ a Difference - in - Differences framework with two - way fixed effects to analyze data from all OSS developers on GitHub in three similar countries -- Italy, France, and Portugal -- totaling 88,022 users. We find that access to ChatGPT increases developer productivity by 6.4%, knowledge sharing by 9.6%, and skill acquisition by 8.4%. These benefits vary significantly by user experience level: n ovice developers primarily experience productivity gains, whereas more experienced developers benefit more from improved knowledge sharing and accelerated skill acquisition. In addition, we f ind that LLM - assisted learning is highly context - dependent, with the greatest benefits observed in technically complex, fragmented, or rapidly evolving contexts . We show that the productivity effects of LLMs extend beyond direct code generation to include enhanced collaborative learning and knowledge exchange among developers -- dynamics that are essential for gaining a holistic understanding of LLMs' impact in OSS. Our findings offer critical managerial implications: strategically deploying LLMs can accelerat e novice developers' onboarding and productivity, empower intermediate developers to foster knowledge sharing and collaboration, and support rapid skill acquisition -- together enhancing long - term organizational productivity and agility.
- Europe > Italy (0.56)
- Europe > France (0.25)
- North America > United States > California > Orange County > Irvine (0.14)
- (14 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology > Security & Privacy (1.00)
- Education > Educational Setting (0.67)
What do professional software developers need to know to succeed in an age of Artificial Intelligence?
Kam, Matthew, Miller, Cody, Wang, Miaoxin, Tidwell, Abey, Lee, Irene A., Malyn-Smith, Joyce, Perez, Beatriz, Tiwari, Vikram, Kenitzer, Joshua, Macvean, Andrew, Barrar, Erin
Generative AI is showing early evidence of productivity gains for software developers, but concerns persist regarding workforce disruption and deskilling. We describe our research with 21 developers at the cutting edge of using AI, summarizing 12 of their work goals we uncovered, together with 75 associated tasks and the skills & knowledge for each, illustrating how developers use AI at work. From all of these, we distilled our findings in the form of 5 insights. We found that the skills & knowledge to be a successful AI-enhanced developer are organized into four domains (using Generative AI effectively, core software engineering, adjacent engineering, and adjacent non-engineering) deployed at critical junctures throughout a 6-step task workflow. In order to "future proof" developers for this age of AI, on-the-job learning initiatives and computer science degree programs will need to target both "soft" skills and the technical skills & knowledge in all four domains to reskill, upskill and safeguard against deskilling.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Norway > Central Norway > Trøndelag > Trondheim (0.05)
- Europe > Switzerland (0.04)
- (12 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.92)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Education > Curriculum > Subject-Specific Education (0.46)
- (2 more...)
A Self-Improving Coding Agent
Robeyns, Maxime, Szummer, Martin, Aitchison, Laurence
Recent advancements in Large Language Models (LLMs) have spurred interest in deploying LLM agents to undertake tasks in the world. LLMs are often deployed in agent systems: code that orchestrates LLM calls and provides them with tools. We demonstrate that an agent system, equipped with basic coding tools, can autonomously edit itself, and thereby improve its performance on benchmark tasks. We find performance gains from 17% to 53% on a random subset of SWE Bench Verified, with additional performance gains on LiveCodeBench, as well as synthetically generated agent benchmarks. Our work represents an advancement in the automated and open-ended design of agentic systems, and demonstrates a data-efficient, non gradient-based learning mechanism driven by LLM reflection and code updates.
The machines are rising -- but developers still hold the keys
This means software developers are going to become more important to how the world builds and maintains software. Yes, there are many ways their practices will evolve thanks to AI coding assistance, but in a world of proliferating machine-generated code, developer judgment and experience will be vital. Research done by GitClear earlier this year indicates that with AI coding assistants (like GitHub Copilot) going mainstream, code churn -- which GitClear defines as "changes that were either incomplete or erroneous when the author initially wrote, committed, and pushed them to the company's git repo" -- has significantly increased. GitClear also found there was a marked decrease in the number of lines of code that have been moved, a signal for refactored code (essentially the care and feeding to make it more effective). In other words, from the time coding assistants were introduced there's been a pronounced increase in lines of code without a commensurate increase in lines deleted, updated, or replaced.
Junior Software Developers' Perspectives on Adopting LLMs for Software Engineering: a Systematic Literature Review
Ferino, Samuel, Hoda, Rashina, Grundy, John, Treude, Christoph
Many studies exploring the adoption of Large Language Model-based tools for software development by junior developers have emerged in recent years. These studies have sought to understand developers' perspectives about using those tools, a fundamental pillar for successfully adopting LLM-based tools in Software Engineering. The aim of this paper is to provide an overview of junior software developers' perspectives and use of LLM-based tools for software engineering (LLM4SE). We conducted a systematic literature review (SLR) following guidelines by Kitchenham et al. on 56 primary studies, applying the definition for junior software developers as software developers with equal or less than five years of experience, including Computer Science/Software Engineering students. We found that the majority of the studies focused on comprehending the different aspects of integrating AI tools in SE. Only 8.9\% of the studies provide a clear definition for junior software developers, and there is no uniformity. Searching for relevant information is the most common task using LLM tools. ChatGPT was the most common LLM tool present in the studies (and experiments). A majority of the studies (83.9\%) report both positive and negative perceptions about the impact of adopting LLM tools. We also found and categorised advantages, challenges, and recommendations regarding LLM adoption. Our results indicate that developers are using LLMs not just for code generation, but also to improve their development skills. Critically, they are not just experiencing the benefits of adopting LLM tools, but they are also aware of at least a few LLM limitations, such as the generation of wrong suggestions, potential data leaking, and AI hallucination. Our findings offer implications for software engineering researchers, educators, and developers.
- North America > United States (0.14)
- South America (0.14)
- Oceania > Australia (0.14)
- Europe > United Kingdom > England > Staffordshire (0.14)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Overview (1.00)
- Instructional Material > Course Syllabus & Notes (0.92)
- Information Technology > Software (1.00)
- Education > Educational Setting > Higher Education (0.67)
- Education > Curriculum > Subject-Specific Education (0.46)
Time Warp: The Gap Between Developers' Ideal vs Actual Workweeks in an AI-Driven Era
Kumar, Sukrit, Goel, Drishti, Zimmermann, Thomas, Houck, Brian, Ashok, B., Bansal, Chetan
Time Warp: The Gap Between Developers' Ideal vs Actual Workweeks in an AI-Driven Era Sukrit Kumar, Drishti Goel, Thomas Zimmermann, Brian Houck, B. Ashok, Chetan Bansal Georgia Institute of T echnology, Microsoft, Microsoft Research, University of California, Irvine Abstract --Software developers balance a variety of different tasks in a workweek, yet the allocation of time often differs from what they consider ideal. Identifying and addressing these deviations is crucial for organizations aiming to enhance the productivity and well-being of the developers. In this paper, we present the findings from a survey of 484 software developers at Microsoft, which aims to identify the key differences between how developers would like to allocate their time during an ideal workweek versus their actual workweek. Our analysis reveals significant deviations between a developer's ideal workweek and their actual workweek, with a clear correlation: as the gap between these two workweeks widens, we observe a decline in both productivity and satisfaction. By examining these deviations in specific activities, we assess their direct impact on the developers' satisfaction and productivity. Additionally, given the growing adoption of AI tools in software engineering, both in the industry and academia, we identify specific tasks and areas that could be strong candidates for automation. In this paper, we make three key contributions: 1) We quantify the impact of workweek deviations on developer productivity and satisfaction 2) We identify individual tasks that disproportionately affect satisfaction and productivity 3) We provide actual data-driven insights to guide future AI automation efforts in software engineering, aligning them with the developers' requirements and ideal workflows for maximizing their productivity and satisfaction. I NTRODUCTION In software engineering, the productivity and satisfaction of developers are pivotal factors that influence both individual performance, customer experience and ultimately, organizational success [1], [2]. The day-to-day activities which define a developer's workweek encompass a broad spectrum of tasks; from coding and designing new systems, to preparing documents, attending meetings, on-boarding new employees, adhering to security and compliance tasks, etc [3]. Each of these tasks is integral to the software development life cycle. Ideally, developers would prefer to allocate their time across these tasks in a way that optimizes both productivity and satisfaction-- this can be referred to as their'ideal workweek'. However, in practice, their'actual workweek', can vary significantly from their'ideal' due to fluctuating workloads, shifting organizational priorities, dependencies on other teams, technical challenges, the influence of the work environment, etc [4], [5], [6].
- North America > United States > California > Orange County > Irvine (0.24)
- North America > United States > District of Columbia > Washington (0.04)
- Asia > India (0.04)
- Research Report > Experimental Study (1.00)
- Questionnaire & Opinion Survey (1.00)
- Research Report > New Finding (0.93)
- Information Technology (1.00)
- Health & Medicine > Therapeutic Area (1.00)