Goto

Collaborating Authors

 Generative AI


Generative AI for the Optimization of Next-Generation Wireless Networks: Basics, State-of-the-Art, and Open Challenges

arXiv.org Artificial Intelligence

Next-generation (xG) wireless networks, with their complex and dynamic nature, present significant challenges to using traditional optimization techniques. Generative AI (GAI) emerges as a powerful tool due to its unique strengths. Unlike traditional optimization techniques and other machine learning methods, GAI excels at learning from real-world network data, capturing its intricacies. This enables safe, offline exploration of various configurations and generation of diverse, unseen scenarios, empowering proactive, data-driven exploration and optimization for xG networks. Additionally, GAI's scalability makes it ideal for large-scale xG networks. This paper surveys how GAI-based models unlock optimization opportunities in xG wireless networks. We begin by providing a review of GAI models and some of the major communication paradigms of xG (e.g., 6G) wireless networks. We then delve into exploring how GAI can be used to improve resource allocation and enhance overall network performance. Additionally, we briefly review the networking requirements for supporting GAI applications in xG wireless networks. The paper further discusses the key challenges and future research directions in leveraging GAI for network optimization. Finally, a case study demonstrates the application of a diffusion-based GAI model for load balancing, carrier aggregation, and backhauling optimization in non-terrestrial networks, a core technology of xG networks. This case study serves as a practical example of how the combination of reinforcement learning and GAI can be implemented to address real-world network optimization problems.


Generative AI: The power of the new education

arXiv.org Artificial Intelligence

The effective integration of generative artificial intelligence in education is a fundamental aspect to prepare future generations. This study proposes an accelerated learning methodology in artificial intelligence, focused on its generative capacity, as a way to achieve this goal. It recognizes the challenge of getting teachers to engage with new technologies and adapt their methods in all subjects, not just those related to AI. This methodology not only promotes interest in science, technology, engineering and mathematics, but also facilitates student understanding of the ethical uses and risks associated with AI. Students' perceptions of generative AI are examined, addressing their emotions towards its evolution, evaluation of its ethical implications, and everyday use of AI tools. In addition, AI applications commonly used by students and their integration into other disciplines are investigated. The study aims to provide educators with a deeper understanding of students' perceptions of AI and its relevance in society and in their future career paths.


Text-to-Model: Text-Conditioned Neural Network Diffusion for Train-Once-for-All Personalization

arXiv.org Artificial Intelligence

Generative artificial intelligence (GenAI) has made significant progress in understanding world knowledge and generating content from human languages across various modalities, like text-to-text large language models, text-to-image stable diffusion, and text-to-video Sora. While in this paper, we investigate the capability of GenAI for text-to-model generation, to see whether GenAI can comprehend hyper-level knowledge embedded within AI itself parameters. Specifically, we study a practical scenario termed train-once-for-all personalization, aiming to generate personalized models for diverse end-users and tasks using text prompts. Inspired by the recent emergence of neural network diffusion, we present Tina, a text-conditioned neural network diffusion for train-once-for-all personalization. Tina leverages a diffusion transformer model conditioned on task descriptions embedded using a CLIP model. Despite the astronomical number of potential personalized tasks (e.g., $1.73\times10^{13}$), by our design, Tina demonstrates remarkable in-distribution and out-of-distribution generalization even trained on small datasets ($\sim 1000$). We further verify whether and how \Tina understands world knowledge by analyzing its capabilities under zero-shot/few-shot image prompts, different numbers of personalized classes, prompts of natural language descriptions, and predicting unseen entities.


Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI

arXiv.org Artificial Intelligence

We tackle the question of whether Large Language Models (LLMs), viewed as dynamical systems with state evolving in the embedding space of symbolic tokens, are observable. That is, whether there exist multiple 'mental' state trajectories that yield the same sequence of generated tokens, or sequences that belong to the same Nerode equivalence class ('meaning'). If not observable, mental state trajectories ('experiences') evoked by an input ('perception') or by feedback from the model's own state ('thoughts') could remain self-contained and evolve unbeknown to the user while being potentially accessible to the model provider. Such "self-contained experiences evoked by perception or thought" are akin to what the American Psychological Association (APA) defines as 'feelings'. Beyond the lexical curiosity, we show that current LLMs implemented by autoregressive Transformers cannot have 'feelings' according to this definition: The set of state trajectories indistinguishable from the tokenized output is a singleton. But if there are 'system prompts' not visible to the user, then the set of indistinguishable trajectories becomes non-trivial, and there can be multiple state trajectories that yield the same verbalized output. We prove these claims analytically, and show examples of modifications to standard LLMs that engender such 'feelings.' Our analysis sheds light on possible designs that would enable a model to perform non-trivial computation that is not visible to the user, as well as on controls that the provider of services using the model could take to prevent unintended behavior.


OpenAI Just Gave Away the Entire Game

The Atlantic - Technology

If you're looking to understand the philosophy that underpins Silicon Valley's latest gold rush, look no further than OpenAI's Scarlett Johansson debacle. The story, according to Johansson's lawyers, goes like this: Nine months ago, OpenAI CEO Sam Altman approached the actor with a request to license her voice for a new digital assistant; Johansson declined. She alleges that just two days before the company's keynote event last week, in which that assistant was revealed as part of a new system called GPT-4o, Altman reached out to Johansson's team, urging the actor to reconsider. Johansson and Altman allegedly never spoke, and Johansson allegedly never granted OpenAI permission to use her voice. Nevertheless, the company debuted Sky two days later--a program with a voice many believed was alarmingly similar to Johansson's.


Sam Altman Is Showing Us Who He Really Is

Slate

Yet this month, it antagonized someone much more powerful, and is already retreating just a touch. On Monday evening, Scarlett Johansson issued a statement to NPR's Bobby Allyn about OpenAI's GPT-4o announcement, which the company showcased in a live demonstration just last week. Specifically, the multimodal computer interaction model centered around a voice assistant named Sky, whose timbre really, really resembled ScarJo's. "Last September, I received an offer from Sam Altman, who wanted to hire me to voice the current ChatGPT 4.0 system," Johansson wrote. "After much consideration and for personal reasons, I declined the offer. Nine months later, my friends, family and the general public all noted how much the newest system named'Sky' sounded like me. When I heard the released demo, I was shocked, angered and in disbelief that Mr. Altman would pursue a voice that sounded so eerily similar to mine that my closest friends and news outlets could not tell the difference."


The Scarlett Johansson Dispute Erodes Public Trust In OpenAI

TIME - Tech

Scarlett Johannson has gone to war with OpenAI, and in the battle for public opinion, OpenAI is losing--badly. Last week, OpenAI released an update of its AI chatbot called ChatGPT-4o, which featured a female voice talking to its users. Many people pointed out that the voice, which sometimes seemed to veer into flirtation, was eerily similar to Scarlett Johannson's in the 2013 dystopian sci-fi film Her. OpenAI CEO Sam Altman has long talked about how much the movie inspired the company's products, and even made the connection clear last week by tweeting the title of the movie. But on Monday, Johannson released a statement saying OpenAI had asked her to be the voice of the chatbot, and when she refused, they found a soundalike.


Exactly how stupid was what OpenAI did to Scarlett Johansson?

Washington Post - Technology News

The company said so again last week when it unveiled a chattier ChatGPT that featured the Johansson sound-alike. The same day, OpenAI CEO Sam Altman posted on X a one-word reference to the 2013 movie "Her," in which Johansson was the voice of an emotional companion AI.


Google Taps AI to Show Shoppers How Clothes Fit Different Bodies

WIRED

One of the worst parts of online shopping is trying to figure out whether an item of clothing will actually fit. While some brands have begun hiring models with more diverse body types, the process still often requires a leap of faith--or making a lot of returns. Google announced Tuesday that it's rolling out a new way to tackle the fit guessing problem using generative artificial intelligence. Brands that run ads for women's or men's shirts will now have the ability to show shoppers how the products look on dozens of different real models, without taking additional photos. The new feature means that typing in "eyelet crop top" on Google's search engine, for example, could return an ad with a clickable gallery that shows what an item looks like on women with different skin tones and body types.


AI Is a Black Box. Anthropic Figured Out a Way to Look Inside

WIRED

For the past decade, AI researcher Chris Olah has been obsessed with artificial neural networks. One question in particular engaged him, and has been the center of his work, first at Google Brain, then OpenAI, and today at AI startup Anthropic, where he is a cofounder. "What's going on inside of them?" he says. "We have these systems, we don't know what's going on. That question has become a core concern now that generative AI has become ubiquitous. Large language models like ChatGPT, Gemini, and Anthropic's own Claude have dazzled people with their language prowess and infuriated people with their tendency to make things up. Their potential to solve previously intractable problems enchants techno-optimists. But LLMs are strangers in our midst. Even the people who build them don't know exactly how they work, and massive effort is required to create guardrails to prevent them from churning out bias, misinformation, and even blueprints for deadly chemical weapons. If the people building the models knew what happened inside these "black boxes,'' it would be easier to make them safer.