Goto

Collaborating Authors

 Large Language Model


Sanctioned Chinese AI Firm SenseTime Releases Image Model Built for Speed

WIRED

With US restrictions limiting its access to advanced tech, SenseTime is doubling down on open source with a new model optimized to run on Chinese-made chips. SenseTime, a Chinese AI company best known for its facial recognition technology, released a new open source model on Tuesday that it claims can both generate and interpret images far faster than top models developed by US competitors. SenseNova U1 could help the company reclaim lost ground after it slipped from its place among the leading players in China's AI development race. The model's secret sauce is its ability to "read" images without translating them to text first, speeding up the process and reducing the amount of computing power required. "The model's entire reasoning process is no longer limited to text. It can reason with images as well," Dahua Lin, cofounder and chief scientist at SenseTime, said in an interview with WIRED.


Your old prompts won't work with GPT-5.5. Try these instead

PCWorld

When you purchase through links in our articles, we may earn a small commission. If you're using long and overly specific prompts with ChatGPT's latest model, you're doing it wrong. OpenAI's latest and most powerful model, GPT-5.5, has been topping benchmark charts and impressing users with its coding and reasoning abilities, not to mention the sheer quantity of facts at its fingertips. But while ChatGPT's latest model doesn't require the hand-holding that older models did, it also gets fussy with the longer, highly detailed prompts that might have worked well in the past. If you're seeing worse performance with GPT-5.5 than you had with previous models, it might be your prompt constructions.


Revisiting Out of distribution Robustness in NLP Benchmark Analysis and LLMs Evaluations

Neural Information Processing Systems

We find that the distribution shift settings in previous studies commonly lack adequate challenges, hindering the accurate evaluation of OOD robustness. To address these issues, we propose a benchmark construction protocol that ensures clear differentiation and challenging distribution shifts. Then we introduce BOSS, a Benchmark suite for Out-of-distribution robustneSS evaluation covering 5 tasks and 20 datasets. Based on BOSS, we conduct a series of experiments on pretrained language models for analysis and evaluation of OOD robustness. First, for vanilla fine-tuning, we examine the relationship between in-distribution (ID) and OOD performance. We identify three typical types that unveil the inner learning mechanism, which could potentially facilitate the forecasting of OOD robustness, correlating with the advancements on ID datasets. Then, we evaluate 5 classic methods on BOSS and find that, despite exhibiting some effectiveness in specific cases, they do not offer significant improvement compared to vanilla fine-tuning. Further, we evaluate 5 LLMs with various adaptation paradigms and find that when sufficient ID data is available, fine-tuning domain-specific models outperform LLMs on ID examples significantly.


Victims Allege OpenAI Is Responsible for Mass Shooting

Mother Jones

A new lawsuit underscores key questions about the Tumbler Ridge killer's use of ChatGPT. A community vigil in Tumbler Ridge two days after the rural community experienced one of Canada's deadliest shootings Paige Taylor White/AFP/Getty Get your news from a source that's not owned and controlled by oligarchs. Victims of the Tumbler Ridge mass shooting and their families sued OpenAI and its CEO, Sam Altman, in US district court in San Francisco on Wednesday, claiming various negligence, product liability, and other violations. The civil complaints are the latest in a wave of litigation against OpenAI alleging that its globally popular chatbot, ChatGPT, helped people commit lethal violence. The complaints were filed by families of multiple victims wounded and killed at Tumbler Ridge Secondary School in British Columbia, Canada, where a suicidal 18-year-old opened fire on February 10.


The Oligarchy Is Afraid of Itself Too

Mother Jones

Musk v. Altman is a fight over how much power is too much in Silicon Valley. Get your news from a source that's not owned and controlled by oligarchs. In May 2016, Elon Musk did something out of character that he has now spent years of his life trying to undo: He made what he believed to be a charitable donation. The world's richest man is also among its stingiest. Musk's private foundation often doles out less than the minimum percentage required by law.


When Robots Have Their ChatGPT Moment, Remember These Pincers

WIRED

From sorting chicken nuggets to screwing in light bulbs, Eka's robots are eerily lifelike. But do they have real physical smarts? It starts gingerly pawing around the table, as if searching for its glasses on the nightstand. It gently positions the bulb between its two pincers. The claw goes chasing it across the table. After a few nips, the bulb is back in its grasp. In more than a decade of writing about robots, I have never seen one move so naturally.



In the coming AI future, Britain must not end up at the mercy of US tech giants Rafael Behr

The Guardian

Kendall says AI is the'currency of the future'. Kendall says AI is the'currency of the future'. Trump is volatile, capricious and unreasonable - but he belongs to the old world of analogue power. D onald Trump is not impressed by soft power. He respects hard men with military muscle.


SpeAr: A Spectral Approach for Zero-Shot Node Classification

Neural Information Processing Systems

Zero-shot node classification is a vital task in the field of graph data processing, aiming to identify nodes of classes unseen during the training process. Prediction bias is one of the primary challenges in zero-shot node classification, referring to the model's propensity to misclassify nodes of unseen classes as seen classes. However, most methods introduce external knowledge to mitigate the bias, inadequately leveraging the inherent cluster information within the unlabeled nodes. To address this issue, we employ spectral analysis coupled with learnable class prototypes to discover the implicit cluster structures within the graph, providing a more comprehensive understanding of classes. In this paper, we propose a spectral approach for zero-shot node classification (SpeAr). Specifically, we establish an approximate relationship between minimizing the spectral contrastive loss and performing spectral decomposition on the graph, thereby enabling effective node characterization through loss minimization. Subsequently, the class prototypes are iteratively refined based on the learned node representations, initialized with the semantic vectors. Finally, extensive experiments verify the effectiveness of the SpeAr, which can further alleviate the bias problem.