AITopics | software application

2511.06417

Country: Asia > Singapore (0.04)

Genre: Research Report (0.64)

Industry: Energy (0.34)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.88)

arXiv.org Artificial IntelligenceAug-13-2025

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Sun, Zeyi, Liu, Ziyu, Zang, Yuhang, Cao, Yuhang, Dong, Xiaoyi, Wu, Tong, Lin, Dahua, Wang, Jiaqi

Repurposing large vision-language models (LVLMs) as computer use agents (CUAs) has led to substantial breakthroughs, primarily driven by human-labeled data. However, these models often struggle with novel and specialized software, particularly in scenarios lacking human annotations. To address this challenge, we propose SEAgent, an agentic self-evolving framework enabling CUAs to autonomously evolve through interactions with unfamiliar software. Specifically, SEAgent empowers computer-use agents to autonomously master novel software environments via experiential learning, where agents explore new software, learn through iterative trial-and-error, and progressively tackle auto-generated tasks organized from simple to complex. To achieve this goal, we design a World State Model for step-wise trajectory assessment, along with a Curriculum Generator that generates increasingly diverse and challenging tasks. The agent's policy is updated through experiential learning, comprised of adversarial imitation of failure actions and Group Relative Policy Optimization (GRPO) on successful ones. Furthermore, we introduce a specialist-to-generalist training strategy that integrates individual experiential insights from specialist agents, facilitating the development of a stronger generalist CUA capable of continuous autonomous evolution. This unified agent ultimately achieves performance surpassing ensembles of individual specialist agents on their specialized software. We validate the effectiveness of SEAgent across five novel software environments within OS-World. Our approach achieves a significant improvement of 23.2% in success rate, from 11.3% to 34.5%, over a competitive open-source CUA, i.e., UI-TARS.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

2508.047

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Workflow (0.67)
Instructional Material (0.67)
Research Report (0.64)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Daily Mail - Science & techNov-4-2024, 19:59:31 GMT

Microsoft warns of dreaded 'blue screen of death' bug plaguing update - how to know if YOU'RE at risk

Microsoft has confirmed that its new Windows update is causing the blue screen of death for users attempting to install the software. The company issued a warning Friday, saying that its Windows Server 2025 is experiencing several bugs that cause the program to fail or at least three hours to restart. However, the bugs have caused the blue screen of death but Microsoft has said it's working on a fix that should roll out in the coming month. In the meantime, users should take precautions when downloading Windows Server 2025 by checking if your computer would be at risk. To determine if the update will cause the blue screen of death (BSOD), Microsoft encourages users to use the CTRL SHIFT ESC keys to open Windows Task Manager.

artificial intelligence, blue screen, microsoft, (11 more...)

Daily Mail - Science & tech

Technology:

Information Technology > Artificial Intelligence (0.36)
Information Technology > Software (0.33)

Khurana, Anjali, Subramonyam, Hari, Chilana, Parmit K

Why and When LLM-Based Assistants Can Go Wrong: Investigating the Effectiveness of Prompt-Based Interactions for Software Help-Seeking

arXiv.org Artificial IntelligenceFeb-12-2024

Large Language Model (LLM) assistants, such as ChatGPT, have emerged as potential alternatives to search methods for helping users navigate complex, feature-rich software. LLMs use vast training data from domain-specific texts, software manuals, and code repositories to mimic human-like interactions, offering tailored assistance, including step-by-step instructions. In this work, we investigated LLM-generated software guidance through a within-subject experiment with 16 participants and follow-up interviews. We compared a baseline LLM assistant with an LLM optimized for particular software contexts, SoftAIBot, which also offered guidelines for constructing appropriate prompts. We assessed task completion, perceived accuracy, relevance, and trust. Surprisingly, although SoftAIBot outperformed the baseline LLM, our results revealed no significant difference in LLM usage and user perceptions with or without prompt guidelines and the integration of domain context. Most users struggled to understand how the prompt's text related to the LLM's responses and often followed the LLM's suggestions verbatim, even if they were incorrect. This resulted in difficulties when using the LLM's advice for software tasks, leading to low task completion rates. Our detailed analysis also revealed that users remained unaware of inaccuracies in the LLM's responses, indicating a gap between their lack of software expertise and their ability to evaluate the LLM's assistance. With the growing push for designing domain-specific LLM assistants, we emphasize the importance of incorporating explainable, context-aware cues into LLMs to help users understand prompt-based interactions, identify biases, and maximize the utility of LLM assistants.

large language model, machine learning, natural language, (20 more...)

doi: 10.1145/3640543.3645200

2402.0803

Country:

North America > United States > New York > New York County > New York City (0.06)
North America > United States > South Carolina > Greenville County > Greenville (0.05)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(16 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Junior, Roberto Francisco de Lima, Presta, Luiz Fernando Paes de Barros, Borborema, Lucca Santos, da Silva, Vanderson Nogueira, Dahia, Marcio Leal de Melo, Santos, Anderson Carlos Sousa e

A Case Study on Test Case Construction with Large Language Models: Unveiling Practical Insights and Challenges

arXiv.org Artificial IntelligenceDec-21-2023

This paper presents a detailed case study examining the application of Large Language Models (LLMs) in the construction of test cases within the context of software engineering. LLMs, characterized by their advanced natural language processing capabilities, are increasingly garnering attention as tools to automate and enhance various aspects of the software development life cycle. Leveraging a case study methodology, we systematically explore the integration of LLMs in the test case construction process, aiming to shed light on their practical efficacy, challenges encountered, and implications for software quality assurance. The study encompasses the selection of a representative software application, the formulation of test case construction methodologies employing LLMs, and the subsequent evaluation of outcomes. Through a blend of qualitative and quantitative analyses, this study assesses the impact of LLMs on test case comprehensiveness, accuracy, and efficiency. Additionally, delves into challenges such as model interpretability and adaptation to diverse software contexts. The findings from this case study contributes with nuanced insights into the practical utility of LLMs in the domain of test case construction, elucidating their potential benefits and limitations. By addressing real-world scenarios and complexities, this research aims to inform software practitioners and researchers alike about the tangible implications of incorporating LLMs into the software testing landscape, fostering a more comprehensive understanding of their role in optimizing the software development process.

large language model, machine learning, natural language, (18 more...)

2312.12598

Country: North America > United States (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

#artificialintelligenceApr-13-2023, 08:30:02 GMT

Crypto Ai price today, CAI to USD live, marketcap and chart

A detailed Description Of the Project Crypto AI ($CAI), an AI-powered NFT (non-fungible token) generator is a software application that uses artificial intelligence and machine learning algorithms to create unique digital assets that can be sold as NFTs. NFTs are blockchain-based tokens that represent ownership of digital assets, such as artwork, music, videos, or any other type of creative content. The AI-powered NFT generator creates original digital assets by analyzing existing content and patterns in the data to generate new, unique creations. For example, an AI-powered NFT generator could analyze a database of images and use machine learning algorithms to create a new image based on the patterns and styles found in the original data. What is the project about?

artificial intelligence, digital asset, machine learning, (17 more...)

Industry: Banking & Finance > Trading (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceJan-5-2023, 20:53:55 GMT

How to Become the Data Whisperer

The data whisperer is the function sitting between the business and the technologists. She, or he, are experts in using data analysis to help organizations better understand their customers and make more informed decisions. They have the ability to interpret large amounts of data and transform it into actionable insights that can inform business decisions. They are also skilled at visualizing data in ways that are easy to understand and interpret. They often work closely with marketing and sales teams to help them identify trends in customer behaviors, develop targeted campaigns, and optimize their overall performance.

artificial intelligence, data mining, program manager, (9 more...)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Enterprise Applications > Customer Relationship Management (0.57)
Information Technology > Data Science > Data Mining (0.37)

#artificialintelligenceDec-22-2022, 05:10:50 GMT

RPA In Banking: Use-Cases, Benefits, And Steps To Deploy RPA Solution 2022

RPA-Robotic Process Automation is the most familiar technology in the banking sector. To deliver the best experiences to customers and automate routine tasks, the banking and financial service providers are increasingly deploying RPA applications. Artificial Intelligence (AI) and RPA-enabled virtual banking solutions help banks and financial organizations to optimize the service quality and alter the customer-to-brand interaction ways. In this evolving digital era, RPA-powered mobile apps and enterprise-level software applications are the best assets for banking and financial companies. They help service providers to overcome the challenges that persist in the traditional banking methods, boosting operational efficiency and productivity and remaining on top of the emerging digital world.

application, artificial intelligence, rpa application, (12 more...)

Country: North America > United States (0.05)

Industry: Banking & Finance > Financial Services (0.91)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

#artificialintelligenceNov-14-2022, 14:22:27 GMT

Heard on the Street – 11/14/2022 - insideBIGDATA

Welcome to insideBIGDATA's "Heard on the Street" round-up column! In this regular feature, we highlight thought-leadership commentaries from members of the big data ecosystem. Each edition covers the trends of the day with compelling perspectives that can provide important insights to give you a competitive advantage in the marketplace. We invite submissions with a focus on our favored technology topics areas: big data, data science, machine learning, AI and deep learning. Data is the new oil.

artificial intelligence, data mining, machine learning, (21 more...)

Country: North America > United States (0.04)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)
Banking & Finance > Trading (0.70)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.75)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

#artificialintelligenceNov-10-2022, 08:14:27 GMT

How AI is Improving Cloud Computing for Enterprises - ONLINE LIKE

The first two decades of the 21st century have been marked by exponential advances in technology that were once considered elements of a science fiction movie script. Technologies like Artificial intelligence (AI) and Cloud Computing--have stood the test of time and have become mainstream. In this article, we'll look at what these technologies are and how their combination has been a landscape-changing force in the world of modern technology. Simply put, artificial intelligence is the simulation of human intelligence by machines. The integration of artificial intelligence into business allows it to perceive and observe the environment and generate optimal results accordingly--very similar to how people operate, although much faster.

artificial intelligence, cloud computing, computing, (12 more...)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)