Goto

Collaborating Authors

 Generative AI


OpenAI takes on Meta and DeepSeek with free and customisable AI models

The Guardian

OpenAI is taking on Mark Zuckerberg's Meta and Chinese rival DeepSeek by launching its own freely available artificial intelligence models. The ChatGPT developer has announced two "open weight" large language models, which are free to download and can be customised by developers. Meta's Llama models are available on a similar basis, and OpenAI's move marks a departure from ChatGPT, which is based on a "closed" model that cannot be customised. Sam Altman, OpenAI's chief executive, said the company was excited to add to a stack of freely available AI models "based on democratic values โ€ฆ and for wide benefit". He added: "We're excited to make this model, the result of billions of dollars of research, available to the world to get AI into the hands of the most people possible." OpenAI said the models could underpin an AI agent that operates autonomously, and that they were "designed to be used within agentic workflows".


OpenAI Just Released Its First Open-Weight Models Since GPT-2

WIRED

OpenAI just dropped its first open-weight models in over five years. The two language models, gpt-oss-120b and gpt-oss-20b, can run locally on consumer devices and be fine-tuned for specific purposes. For OpenAI, they represent a shift away from its recent strategy of focusing on proprietary releases, as the company moves towards a wider, and more open, group of AI models that are available for users. "We're excited to make this model, the result of billions of dollars of research, available to the world to get AI into the hands of the most people possible," said OpenAI CEO Sam Altman in an emailed statement. Both gpt-oss-120b and gpt-oss-20b are officially available to download for free on Hugging Face, a popular hosting platform for AI tools.


OpenAI has finally released open-weight language models

MIT Technology Review

"The vast majority of our [enterprise and startup] customers are already using a lot of open models," said Casey Dvorak, a research program manager at OpenAI, in a media briefing about the model release. "Because there is no [competitive] open model from OpenAI, we wanted to plug that gap and actually allow them to use our technology across the board." The new models come in two different sizes, the smaller of which can theoretically run on 16 GB of RAM--the minimum amount that Apple currently offers on its computers. The larger model requires a high-end laptop or specialized hardware. Open models have a few key use cases.


OpenAI stops ChatGPT from telling people to break up with partners

The Guardian

ChatGPT will not tell people to break up with their partner and will encourage users to take breaks from long chatbot sessions, under new changes to the artificial intelligence tool. OpenAI, ChatGPT's developer, said the chatbot would stop giving definitive answers to personal challenges and would instead help people to mull over problems such as potential breakups. "When you ask something like: 'Should I break up with my boyfriend?' ChatGPT shouldn't give you an answer. It should help you think it through โ€“ asking questions, weighing pros and cons," said OpenAI.


ChatGPT now reminds you when it's time for a break

PCWorld

OpenAI has introduced a new feature in ChatGPT that prompts users to take breaks during longer conversations. The reminders appear as pop-up messages like: "You've been chatting for a while -- is it time for a break?" Users can still dismiss the reminder and continue the conversation. OpenAI says that in the future, ChatGPT will also become better at handling sensitive topics and avoid giving direct answers to big personal decisions. Instead, the AI will help users reflect on different options.


The Download: AI agent infrastructure, and OpenAI's ambitions

MIT Technology Review

Anthropic and Google are among the companies and groups working to fix that. Over the past year, they have both introduced protocols that try to define how AI agents should interact with each other and the world around them. If they work as planned, they could give us a crucial part of the infrastructure we need for agents to be useful. Read our story to learn more. OpenAI has given itself a dual mandate: on the one hand, it's a tech giant rooted in products, including of course ChatGPT, which people around the world reportedly send 2.5 billion messages to each day. But its original mission is as a research lab that will not only create "artificial general intelligence" but ensure that it benefits all of humanity.


A glimpse into OpenAI's largest ambitions

MIT Technology Review

As Will points out, there were two recent wins for OpenAI in its efforts to build AI that outcompetes humans. Its models took second place at a top-level coding competition and--alongside those from Google DeepMind--achieved gold-medal-level results in the 2025 International Math Olympiad. People who believe that AI doesn't pose genuine competition to human-level intelligence might actually take some comfort in that. AI is good at the mathematical and analytical, which are on full display in olympiads and coding competitions. That doesn't mean it's any good at grappling with the messiness of human emotions, making hard decisions, or creating art that resonates with anyone. But that distinction--between machine-like reasoning and the ability to think creatively--is not one OpenAI's heads of research are inclined to make.


Interview with Shaghayegh (Shirley) Shajarian: Applying generative AI to computer networks

AIHub

In this interview series, we're meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. This time, we hear from Shaghayegh (Shirley) Shajarian and learn about her research applying generative AI to computer networks. I am a third-year PhD student in the Computer Science department at North Carolina A&T State University, working under Dr Sajad Khorsandroo and Dr Mahmoud Abdelsalam. I am part of the Autonomous Cybersecurity and Resilience Lab, where my research focuses on applying generative AI to computer networks. I am developing AI-driven agents that assist with some network operations, such as log analysis, troubleshooting, and documentation.


Enhancing Math Reasoning in Small-sized LLMs via Preview Difficulty-Aware Intervention

arXiv.org Artificial Intelligence

Reinforcement learning scaling enhances the reasoning capabilities of large language models, with reinforcement learning serving as the key technique to draw out complex reasoning. However, key technical details of state-of-the-art reasoning LLMs, such as those in the OpenAI O series, Claude 3 series, DeepMind's Gemini 2.5 series, and Grok 3 series, remain undisclosed, making it difficult for the research community to replicate their reinforcement learning training results. Therefore, we start our study from an Early Preview Reinforcement Learning (EPRLI) algorithm built on the open-source GRPO framework, incorporating difficulty-aware intervention for math problems. Applied to a 1.5B-parameter LLM, our method achieves 50.0% on AIME24, 89.2% on Math500, 77.1% on AMC, 35.3% on Minerva, and 51.9% on OBench, superpass O1-Preview and is comparable to O1-mini within standard school-lab settings.


Diffusion Models for Future Networks and Communications: A Comprehensive Survey

arXiv.org Artificial Intelligence

The rise of Generative AI (GenAI) in recent years has catalyzed transformative advances in wireless communications and networks. Among the members of the GenAI family, Diffusion Models (DMs) have risen to prominence as a powerful option, capable of handling complex, high-dimensional data distribution, as well as consistent, noise-robust performance. In this survey, we aim to provide a comprehensive overview of the theoretical foundations and practical applications of DMs across future communication systems. We first provide an extensive tutorial of DMs and demonstrate how they can be applied to enhance optimizers, reinforcement learning and incentive mechanisms, which are popular approaches for problems in wireless networks. Then, we review and discuss the DM-based methods proposed for emerging issues in future networks and communications, including channel modeling and estimation, signal detection and data reconstruction, integrated sensing and communication, resource management in edge computing networks, semantic communications and other notable issues. We conclude the survey with highlighting technical limitations of DMs and their applications, as well as discussing future research directions.