unified
Orthogonium : A Unified, Efficient Library of Orthogonal and 1-Lipschitz Building Blocks
Boissin, Thibaut, Mamalet, Franck, Lafargue, Valentin, Serrurier, Mathieu
Orthogonal and 1-Lipschitz neural network layers are essential building blocks in robust deep learning architectures, crucial for certified adversarial robustness, stable generative models, and reliable recurrent networks. Despite significant advancements, existing implementations remain fragmented, limited, and computationally demanding. To address these issues, we introduce Orthogonium , a unified, efficient, and comprehensive PyTorch library providing orthogonal and 1-Lipschitz layers. Orthogonium provides access to standard convolution features-including support for strides, dilation, grouping, and transposed-while maintaining strict mathematical guarantees. Its optimized implementations reduce overhead on large scale benchmarks such as ImageNet. Moreover, rigorous testing within the library has uncovered critical errors in existing implementations, emphasizing the importance of standardized and reliable tools. Orthogonium thus significantly lowers adoption barriers, enabling scalable experimentation and integration across diverse applications requiring orthogonality and robust Lipschitz constraints. Orthogonium is available at https://github.com/deel-ai/orthogonium.
A Unified, Scalable Framework for Neural Population Decoding
Our ability to use deep learning approaches to decipher neural activity would likely benefit from greater scale, in terms of both the model size and the datasets. However, the integration of many neural recordings into one unified model is challenging, as each recording contains the activity of different neurons from different individual animals. In this paper, we introduce a training framework and architecture designed to model the population dynamics of neural activity across diverse, large-scale neural recordings. Our method first tokenizes individual spikes within the dataset to build an efficient representation of neural events that captures the fine temporal structure of neural activity. We then employ cross-attention and a PerceiverIO backbone to further construct a latent tokenization of neural population activities. Utilizing this architecture and training framework, we construct a large-scale multi-session model trained on large datasets from seven nonhuman primates, spanning over 158 different sessions of recording from over 27,373 neural units and over 100 hours of recordings. In a number of different tasks, we demonstrate that our pretrained model can be rapidly adapted to new, unseen sessions with unspecified neuron correspondence, enabling few-shot performance with minimal labels. This work presents a powerful new approach for building deep learning tools to analyze neural data and stakes out a clear path to training at scale for neural decoding models.
Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI
Ni, Fei, Zhang, Min, Li, Pengyi, Yuan, Yifu, Zhang, Lingfeng, Liu, Yuecheng, Han, Peilong, Kou, Longxin, Ma, Shaojin, Qiao, Jinbin, Bravo, David Gamaliel Arcos, Wang, Yuening, Hu, Xiao, Zhang, Zhanguang, Yao, Xianze, Li, Yutong, Zhang, Zhao, Wen, Ying, Chen, Ying-Cong, Liang, Xiaodan, Lin, Liang, He, Bin, Bou-Ammar, Haitham, Wang, He, Xu, Huazhe, Deng, Jiankang, Luo, Shan, Jiang, Shuqiang, Pan, Wei, Gao, Yang, Zafeiriou, Stefanos, Peters, Jan, Zhuang, Yuzheng, Zhang, Yingxue, Zheng, Yan, Tang, Hongyao, Hao, Jianye
Embodied AI development significantly lags behind large foundation models due to three critical challenges: (1) lack of systematic understanding of core capabilities needed for Embodied AI, making research lack clear objectives; (2) absence of unified and standardized evaluation systems, rendering cross-benchmark evaluation infeasible; and (3) underdeveloped automated and scalable acquisition methods for embodied data, creating critical bottlenecks for model scaling. To address these obstacles, we present Embodied Arena, a comprehensive, unified, and evolving evaluation platform for Embodied AI. Our platform establishes a systematic embodied capability taxonomy spanning three levels (perception, reasoning, task execution), seven core capabilities, and 25 fine-grained dimensions, enabling unified evaluation with systematic research objectives. We introduce a standardized evaluation system built upon unified infrastructure supporting flexible integration of 22 diverse benchmarks across three domains (2D/3D Embodied Q&A, Navigation, Task Planning) and 30+ advanced models from 20+ worldwide institutes. Additionally, we develop a novel LLM-driven automated generation pipeline ensuring scalable embodied evaluation data with continuous evolution for diversity and comprehensiveness. Embodied Arena publishes three real-time leaderboards (Embodied Q&A, Navigation, Task Planning) with dual perspectives (benchmark view and capability view), providing comprehensive overviews of advanced model capabilities. Especially, we present nine findings summarized from the evaluation results on the leaderboards of Embodied Arena. This helps to establish clear research veins and pinpoint critical research problems, thereby driving forward progress in the field of Embodied AI.
No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning
Rajgaria, Abhishek, Dixit, Kushagra, Vyas, Mayank, Kalalbandi, Harshavardhan, Roth, Dan, Gupta, Vivek
Temporal Table Reasoning is a critical challenge for Large Language Models (LLMs), requiring effective reasoning to extract relevant insights. Despite existence of multiple prompting methods, their impact on table reasoning remains largely unexplored. Furthermore, model performance varies drastically across different table and context structures, making it difficult to determine an optimal approach. This work investigates multiple prompting technique on diverse table types to determine that performance depends on factors such as entity type, table structure, requirement of additional context and question complexity, with "NO" single method consistently outperforming others. To address this, we introduce SEAR, an adaptive prompting framework inspired by human reasoning that dynamically adjusts to context and integrates structured reasoning. Our results demonstrate that SEAR achieves superior performance across all table types compared to baseline prompting techniques. Additionally, we explore the impact of table structure refactoring, finding that a unified representation enhances model reasoning.
Does Worst-Performing Agent Lead the Pack? Analyzing Agent Dynamics in Unified Distributed SGD
Distributed learning is essential to train machine learning algorithms across heterogeneous agents while maintaining data privacy. We conduct an asymptotic analysis of Unified Distributed SGD (UD-SGD), exploring a variety of communication patterns, including decentralized SGD and local SGD within Federated Learning (FL), as well as the increasing communication interval in the FL setting. In this study, we assess how different sampling strategies, such as i.i.d. Our findings not only support existing theories on linear speedup and asymptotic network independence, but also theoretically and empirically show how efficient sampling strategies employed by individual agents contribute to overall convergence in UD-SGD. Simulations reveal that a few agents using highly efficient sampling can achieve or surpass the performance of the majority employing moderately improved strategies, providing new insights beyond traditional analyses focusing on the worst-performing agent.
A Unified, Scalable Framework for Neural Population Decoding
Our ability to use deep learning approaches to decipher neural activity would likely benefit from greater scale, in terms of both the model size and the datasets. However, the integration of many neural recordings into one unified model is challenging, as each recording contains the activity of different neurons from different individual animals. In this paper, we introduce a training framework and architecture designed to model the population dynamics of neural activity across diverse, large-scale neural recordings. Our method first tokenizes individual spikes within the dataset to build an efficient representation of neural events that captures the fine temporal structure of neural activity. We then employ cross-attention and a PerceiverIO backbone to further construct a latent tokenization of neural population activities.
Carvalho, who unplugged school AI chatbot, wants task force to tell him what went wrong
Alberto Carvalho, who remains determined to bring artificial intelligence into district classrooms despite the collapse of the technology company leading the effort, will appoint a task force to examine what went wrong and how to move forward. The schools chief announced the task force in an interview with The Times in advance of Tuesday's annual address to administrators, which is akin to a state-of-the-schools speech. In his public address, Carvalho is expected to highlight academic progress and L.A. Unified School District initiatives. In a recent appearance, he said he was hopeful that standardized test scores would rise at all grade levels in math and English. Although school districts throughout the state have received results -- and can make them public if they wish -- the state has not yet released local or statewide scores.
LAUSD shelves its hyped AI chatbot to help students after collapse of firm that made it
The school district said it dropped its dealings with AllHere, the company that created "Ed," the sun-shaped chatbot, after "we were notified of their financial collapse." AllHere did not respond to an inquiry this week from The Times and the level of its operation is unclear. In a separate development, a major data breach has affected a data cloud company called Snowflake, which has worked with L.A. Unified. The district said Tuesday that there is no connection to the AllHere situation, and that it is working with investigative agencies to assess the damage and which district records were obtained through a third-party contractor. Meanwhile, the district unplugged the chatbot -- for which AllHere had been paid 3 million -- on June 14, less than three months after unveiling the animated figure as an easy-to-use, conversational companion for students and a soon-to-be-indispensable guide for parents.
L.A. school district probes inappropriate images shared at Fairfax High. More AI abuse?
Los Angeles school officials are investigating allegations that inappropriate photos were "created and disseminated within the Fairfax High School community," in what appears to be the latest alleged misuse of technology by students, a district statement said. Last week, Laguna Beach High School administrators announced that they had launched an investigation after a student allegedly created and circulated "inappropriate images" of classmates through the use of artificial intelligence. In January, five Beverly Hills eighth-graders were expelled for their involvement in the creation and sharing of fake nude pictures of classmates. The students superimposed pictures of classmates' faces onto nude bodies generated by artificial intelligence. In total, 16 eighth-grade students were targeted by the pictures, which were shared through messaging apps, according to the district.