generation tool
ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory
Xiao, Yunzhong, Li, Yangmin, Wang, Hewei, Tang, Yunlong, Wang, Zora Zhiruo
Agents utilizing tools powered by large language models (LLMs) or vision-language models (VLMs) have demonstrated remarkable progress in diverse tasks across text and visual modalities. Unlike traditional tools such as calculators, which give deterministic outputs, neural tools perform uncertainly across task scenarios. While different tools for a task may excel in varied scenarios, existing agents typically rely on fixed tools, thus limiting the flexibility in selecting the most suitable tool for specific tasks. In contrast, humans snowball their understanding of the capabilities of different tools by interacting with them, and apply this knowledge to select the optimal tool when solving a future task. To build agents that similarly benefit from this process, we propose ToolMem that enables agents to develop memories of tool capabilities from previous interactions, by summarizing their strengths and weaknesses and storing them in memory; at inference, the agent can retrieve relevant entries from ToolMem, and select the best tool to solve individual tasks more accurately. We evaluate ToolMem on learning varied text generation and text-to-image generation neural tools. Compared to no-memory, generic agents, we find ToolMem-augmented agents predict tool performance 14.8% and 28.7% more accurately across text and multimodal generation scenarios. Moreover, ToolMem facilitates optimal tool selection among multiple choices by 21% and 24% absolute increases in respective scenarios.
Now you can generate images directly from ChatGPT and Sora
OpenAI just announced that all users will soon be able to generate images directly inside of ChatGPT. This will be the default image generation tool in 4o, so there will be no need to open Dall-E whenever you want to whip up a picture of a cat in space eating lasagna or whatever. The company says that the platform will "generate high-quality images based on your prompt, conversation and uploaded files." To the latter point, it'll be able to transform pre-existing images based on prompts. OpenAI is also boasting about significant improvements in text rendering and contextual understanding.
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.92)
Design choices made by LLM-based test generators prevent them from finding bugs
Mathews, Noble Saji, Nagappan, Meiyappan
There is an increasing amount of research and commercial tools for automated test case generation using Large Language Models (LLMs). This paper critically examines whether recent LLM-based test generation tools, such as Codium CoverAgent and CoverUp, can effectively find bugs or unintentionally validate faulty code. Considering bugs are only exposed by failing test cases, we explore the question: can these tools truly achieve the intended objectives of software testing when their test oracles are designed to pass? Using real human-written buggy code as input, we evaluate these tools, showing how LLM-generated tests can fail to detect bugs and, more alarmingly, how their design can worsen the situation by validating bugs in the generated test suite and rejecting bug-revealing tests. These findings raise important questions about the validity of the design behind LLM-based test generation tools and their impact on software quality and test suite reliability.
- North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
- Asia > Singapore (0.04)
There's no need to hire a creative team when you have this AI generator tool
TL;DR: Have your content creation needs met with AI Magicx's generation tools, now under 100 for life. Are you wondering why your brand is growing as fast as you'd like? It's probably because you don't have a recognizable logo. You may think you need to start hiring a team of creatives, but what if we told you that all you need is AI? You may think AI will only generate bad images or awkward content, but AI Magicx is specifically designed to develop high-quality media.
TextureMeDefect: LLM-based Defect Texture Generation for Railway Components on Mobile Devices
Ferdousi, Rahatara, Hossain, M. Anwar, Saddik, Abdulmotaleb El
Texture image generation has been studied for various applications, including gaming and entertainment. However, context-specific realistic texture generation for industrial applications, such as generating defect textures on railway components, remains unexplored. A mobile-friendly, LLM-based tool that generates fine-grained defect characteristics offers a solution to the challenge of understanding the impact of defects from actual occurrences. We introduce TextureMeDefect, an innovative tool leveraging an LLM-based AI-Inferencing engine. The tool allows users to create realistic defect textures interactively on images of railway components taken with smartphones or tablets. We conducted a multifaceted evaluation to assess the relevance of the generated texture, time, and cost in using this tool on iOS and Android platforms. We also analyzed the software usability score (SUS) across three scenarios. TextureMeDefect outperformed traditional image generation tools by generating meaningful textures faster, showcasing the potential of AI-driven mobile applications on consumer-grade devices.
- North America > Canada > Ontario > National Capital Region > Ottawa (0.04)
- North America > Canada > Ontario > Kingston (0.04)
Sex offender banned from using AI tools in landmark UK case
A sex offender convicted of making more than 1,000 indecent images of children has been banned from using any "AI creating tools" for the next five years in the first known case of its kind. Anthony Dover, 48, was ordered by a UK court "not to use, visit or access" artificial intelligence generation tools without the prior permission of police as a condition of a sexual harm prevention order imposed in February. The ban prohibits him from using tools such as text-to-image generators, which can make lifelike pictures based on a written command, and "nudifying" websites used to make explicit "deepfakes". Dover, who was given a community order and 200 fine, has also been explicitly ordered not to use Stable Diffusion software, which has reportedly been exploited by paedophiles to create hyper-realistic child sexual abuse material, according to records from a sentencing hearing at Poole magistrates court. The case is the latest in a string of prosecutions where AI generation has emerged as an issue and follows months of warnings from charities over the proliferation of AI-generated sexual abuse imagery.
Windows 11 will throttle 'excessive' users of AI as Copilot rolls out
Microsoft has one of the largest and most powerful collections of web servers on the planet. But even it might balk at the thought of a billion or so Windows users hitting data- and processor-intensive generative AI services 24-7. So perhaps it's not surprising that some new language in its online services user license agreement says that Microsoft will employ "temporary throttling of Customer's access to the Microsoft Generative AI service" for excessive use. Exactly what constitutes excessive use of generative AI (which allows a user to create text and images based on specific input, as seen with ChatGPT and DALL-E) is not specified. But as anyone who's tried out these tools knows, it's not an instant process and complex strings of text generation or intricate formatting might take several minutes for a remote server to complete.
A Large-Scale Survey on the Usability of AI Programming Assistants: Successes and Challenges
Liang, Jenny T., Yang, Chenyang, Myers, Brad A.
The software engineering community recently has witnessed widespread deployment of AI programming assistants, such as GitHub Copilot. However, in practice, developers do not accept AI programming assistants' initial suggestions at a high frequency. This leaves a number of open questions related to the usability of these tools. To understand developers' practices while using these tools and the important usability challenges they face, we administered a survey to a large population of developers and received responses from a diverse set of 410 developers. Through a mix of qualitative and quantitative analyses, we found that developers are most motivated to use AI programming assistants because they help developers reduce key-strokes, finish programming tasks quickly, and recall syntax, but resonate less with using them to help brainstorm potential solutions. We also found the most important reasons why developers do not use these tools are because these tools do not output code that addresses certain functional or non-functional requirements and because developers have trouble controlling the tool to generate the desired output. Our findings have implications for both creators and users of AI programming assistants, such as designing minimal cognitive effort interactions with these tools to reduce distractions for users while they are programming.
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- Europe > Portugal > Lisbon > Lisbon (0.05)
- South America (0.04)
- (5 more...)
Computing Education in the Era of Generative AI
Denny, Paul, Prather, James, Becker, Brett A., Finnie-Ansley, James, Hellas, Arto, Leinonen, Juho, Luxton-Reilly, Andrew, Reeves, Brent N., Santos, Eddie Antonio, Sarsa, Sami
The computing education community has a rich history of pedagogical innovation designed to support students in introductory courses, and to support teachers in facilitating student learning. Very recent advances in artificial intelligence have resulted in code generation models that can produce source code from natural language problem descriptions -- with impressive accuracy in many cases. The wide availability of these models and their ease of use has raised concerns about potential impacts on many aspects of society, including the future of computing education. In this paper, we discuss the challenges and opportunities such models present to computing educators, with a focus on introductory programming classrooms. We summarize the results of two recent articles, the first evaluating the performance of code generation models on typical introductory-level programming problems, and the second exploring the quality and novelty of learning resources generated by these models. We consider likely impacts of such models upon pedagogical practice in the context of the most recent advances at the time of writing.
- North America > United States > New York > New York County > New York City (0.05)
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.05)
- Europe > Finland (0.04)
- (3 more...)
- Research Report (0.64)
- Instructional Material > Course Syllabus & Notes (0.48)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.95)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.51)
Using Text-to-Image Generation for Architectural Design Ideation
Paananen, Ville, Oppenlaender, Jonas, Visuri, Aku
The recent progress of text-to-image generation has been recognized in architectural design. Our study is the first to investigate the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process. We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E. Through standardized questionnaires and group interviews, we found that image generation could be a meaningful part of the design process when design constraints are carefully considered. Generative tools support serendipitous discovery of ideas and an imaginative mindset, enriching the design process. We identified several challenges of image generators and provided considerations for software development and educators to support creativity and emphasize designers' imaginative mindset. By understanding the limitations and potential of text-to-image generators, architects and designers can leverage this technology in their design process and education, facilitating innovation and effective communication of concepts.
- Europe > Finland > Northern Ostrobothnia > Oulu (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Finland > Central Finland > Jyväskylä (0.04)
- Oceania > Australia > New South Wales > Sydney (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.68)