Celebrating Diversity in Shared Multi-Agent Reinforcement Learning
Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve complex cooperative tasks. Its success is partly because of parameter sharing among agents. However, such sharing may lead agents to behave similarly and limit their coordination capacity. In this paper, we aim to introduce diversity in both optimization and representation of shared multi-agent reinforcement learning. Specifically, we propose an information-theoretical regularization to maximize the mutual information between agents' identities and their trajectories, encouraging extensive exploration and diverse individualized behaviors. In representation, we incorporate agent-specific modules in the shared neural network architecture, which are regularized by L1-norm to promote learning sharing among agents while keeping necessary diversity.
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment
Text-conditioned image generation models often generate incorrect associations between entities and their visual attributes. This reflects an impaired mapping between linguistic binding of entities and modifiers in the prompt and visual binding of the corresponding elements in the generated image. As one example, a query like "a pink sunflower and a yellow flamingo" may incorrectly produce an image of a yellow sunflower and a pink flamingo. To remedy this issue, we propose SynGen, an approach which first syntactically analyses the prompt to identify entities and their modifiers, and then uses a novel loss function that encourages the cross-attention maps to agree with the linguistic binding reflected by the syntax. Specifically, we encourage large overlap between attention maps of entities and their modifiers, and small overlap with other entities and modifier words. The loss is optimized during inference, without retraining or fine-tuning the model. Human evaluation on three datasets, including one new and challenging set, demonstrate significant improvements of SynGen compared with current state of the art methods. This work highlights how making use of sentence structure during inference can efficiently and substantially improve the faithfulness of text-to-image generation.
Waymo voluntarily recalled 1,200 robotaxis
Waymo recently recalled 1,212 of its self-driving taxis, according to the Alphabet-owned company. The recalled cars, which comprised the entirety of the company's fleet at the time, received a software update in November designed to significantly decrease the likelihood that Waymos would collide with stationary objects. Last May, the Department of Transportation's National Highway Traffic Safety Administration (NHTSA) opened an investigation into Waymo for 22 reported incidents in which its AVs collided with objects like gates, chains, and parked vehicles. The cars also appeared to disobey traffic safety control devices. The accidents occurred at low speeds and didn't result in injuries.
Event-3DGS: Event-based 3D Reconstruction Using 3D Gaussian Splatting
Event cameras, offering high temporal resolution and high dynamic range, have brought a new perspective to addressing 3D reconstruction challenges in fast-motion and low-light scenarios. Most methods use the Neural Radiance Field (NeRF) for event-based photorealistic 3D reconstruction. However, these NeRF methods suffer from time-consuming training and inference, as well as limited scene-editing capabilities of implicit representations. To address these problems, we propose Event-3DGS, the first event-based reconstruction using 3D Gaussian splatting (3DGS) for synthesizing novel views freely from event streams. Technically, we first propose an event-based 3DGS framework that directly processes event data and reconstructs 3D scenes by simultaneously optimizing scenario and sensor parameters.
Trump hails growing ties with UAE on last leg of Gulf tour
President Donald Trump has hailed deepening ties between the United States and the United Arab Emirates and said that the latter will invest 1.4 trillion in the former's artificial intelligence sector over the next decade. "I have absolutely no doubt that the relationship will only get bigger and better," Trump said on Thursday at a meeting with UAE President Sheikh Mohamed bin Zayed Al Nahyan, on the final leg of his three-country tour of the Gulf region that saw him strike a series of lucrative tech, business and military deals that he said amounted to 10 trillion. Sheikh Mohammed said the UAE remained "committed to working with the United States to advance peace and stability in our region and globally". The deal with UAE is expected to enable the Gulf country to build data centres vital to developing artificial intelligence models. The countries did not say which AI chips could be included in UAE data centres.
Trump's Computer Chip Deals With Saudi Arabia and UAE Divide US Government
Over the course of a three-day trip to the Middle East, President Trump and his emissaries from Silicon Valley have transformed the Persian Gulf from an artificial-intelligence neophyte into an A.I. power broker. They have reached an enormous deal with the United Arab Emirates to deliver hundreds of thousands of today's most advanced chips from Nvidia annually to build one of the world's largest data center hubs in the region, three people familiar with the talks said. The shipments would begin this year, and include roughly 100,000 chips for G42, an Emirati A.I. firm, with the rest going to U.S. cloud service providers. The administration revealed the agreement on Thursday in an announcement unveiling a new A.I. campus in Abu Dhabi supported by 5 gigawatts of electrical power. It would the largest such project outside of the United States and help U.S. companies serve customers in Africa, Europe and Asia, the administration said.