Media
'Fear really drives him': is Alex Karp of Palantir the world's scariest CEO?
'Palantir is the embodiment, in a lot of ways, of him' Alex Karp. 'Palantir is the embodiment, in a lot of ways, of him' Alex Karp. 'Fear really drives him': is Alex Karp of Palantir the world's scariest CEO? His company is potentially creating the ultimate state surveillance tool, and Karp has recently been on a striking political and philosophical journey. I n a recent interview, Alex Karp said that his company Palantir was "the most important software company in America and therefore in the world". He may well be right.
Chance of more showers in L.A., with a new storm set to hit Thursday
Things to Do in L.A. Tap to enable a layout that focuses on the article. Chance of more showers in L.A., with a new storm set to hit Thursday A driver navigates a flooded street during a storm Monday in Santa Barbara. This is read by an automated voice. Please report any issues or inconsistencies here . Showers could linger in Los Angeles on Tuesday following four straight days of rain -- and even more rain is likely on Thursday and Friday.
Don't blindly trust what AI tells you, says Google's Sundar Pichai
Don't blindly trust what AI tells you, says Google's Sundar Pichai People should not blindly trust everything AI tools tell them, the boss of Google's parent company Alphabet told the BBC. In an exclusive interview, chief executive Sundar Pichai said that AI models are prone to errors and urged people to use them alongside other tools. Mr Pichai said it highlighted the importance of having a rich information ecosystem, rather than solely relying on AI technology. This is why people also use Google search, and we have other products that are more grounded in providing accurate information. While AI tools were helpful if you want to creatively write something, Mr Pichai said people have to learn to use these tools for what they're good at, and not blindly trust everything they say.
Google boss warns 'no company is going to be immune' if AI bubble bursts
Google boss warns'no company is going to be immune' if AI bubble bursts Every company would be affected if the AI bubble were to burst, the head of Google's parent firm Alphabet has told the BBC. Speaking exclusively to BBC News, Sundar Pichai said while the growth of artificial intelligence (AI) investment had been an extraordinary moment, there was some irrationality in the current AI boom. It comes amid fears in Silicon Valley and beyond of a bubble as the value of AI tech companies has soared in recent months and companies spend big on the burgeoning industry. Asked whether Google would be immune to the impact of the AI bubble bursting, Mr Pichai said the tech giant could weather that potential storm, but also issued a warning. I think no company is going to be immune, including us, he said.
On the Entropy Calibration of Language Models
Cao, Steven, Valiant, Gregory, Liang, Percy
We study the problem of entropy calibration, which asks whether a language model's entropy over generations matches its log loss on human text. Past work found that models are miscalibrated, with entropy per step increasing (and text quality decreasing) as generations grow longer. This error accumulation is a fundamental problem in autoregressive models, and the standard solution is to truncate the distribution, which improves text quality at the cost of diversity. In this paper, we ask: is miscalibration likely to improve with scale, and is it theoretically possible to calibrate without tradeoffs? To build intuition, we first study a simplified theoretical setting to characterize the scaling behavior of miscalibration with respect to dataset size. We find that the scaling behavior depends on the power law exponent of the data distribution -- in particular, for a power law exponent close to 1, the scaling exponent is close to 0, meaning that miscalibration improves very slowly with scale. Next, we measure miscalibration empirically in language models ranging from 0.5B to 70B parameters. We find that the observed scaling behavior is similar to what is predicted by the simplified setting: our fitted scaling exponents for text are close to 0, meaning that larger models accumulate error at a similar rate as smaller ones. This scaling (or, lack thereof) provides one explanation for why we sample from larger models with similar amounts of truncation as smaller models, even though the larger models are of higher quality. However, truncation is not a satisfying solution because it comes at the cost of increased log loss. In theory, is it even possible to reduce entropy while preserving log loss? We prove that it is possible, if we assume access to a black box which can fit models to predict the future entropy of text.
Out-of-Context Misinformation Detection via Variational Domain-Invariant Learning with Test-Time Training
Yang, Xi, Zhang, Han, Lin, Zhijian, Hu, Yibiao, Han, Hong
Out-of-context misinformation (OOC) is a low-cost form of misinformation in news reports, which refers to place authentic images into out-of-context or fabricated image-text pairings. This problem has attracted significant attention from researchers in recent years. Current methods focus on assessing image-text consistency or generating explanations. However, these approaches assume that the training and test data are drawn from the same distribution. When encountering novel news domains, models tend to perform poorly due to the lack of prior knowledge. To address this challenge, we propose \textbf{VDT} to enhance the domain adaptation capability for OOC misinformation detection by learning domain-invariant features and test-time training mechanisms. Domain-Invariant Variational Align module is employed to jointly encodes source and target domain data to learn a separable distributional space domain-invariant features. For preserving semantic integrity, we utilize domain consistency constraint module to reconstruct the source and target domain latent distribution. During testing phase, we adopt the test-time training strategy and confidence-variance filtering module to dynamically updating the VAE encoder and classifier, facilitating the model's adaptation to the target domain distribution. Extensive experiments conducted on the benchmark dataset NewsCLIPpings demonstrate that our method outperforms state-of-the-art baselines under most domain adaptation settings.
GCAgent: Long-Video Understanding via Schematic and Narrative Episodic Memory
Yeo, Jeong Hun, Chung, Sangyun, Park, Sungjune, Kim, Dae Hoe, Moon, Jinyoung, Ro, Yong Man
Long-video understanding remains a significant challenge for Multimodal Large Language Models (MLLMs) due to inherent token limitations and the complexity of capturing long-term temporal dependencies. Existing methods often fail to capture the global context and complex event relationships necessary for deep video reasoning. To address this, we introduce GCAgent, a novel Global-Context-Aware Agent framework that achieves comprehensive long-video understanding. Our core innovation is the Schematic and Narrative Episodic Memory. This memory structurally models events and their causal and temporal relations into a concise, organized context, fundamentally resolving the long-term dependency problem. Operating in a multi-stage Perception-Action-Reflection cycle, our GCAgent utilizes a Memory Manager to retrieve relevant episodic context for robust, context-aware inference. Extensive experiments confirm that GCAgent significantly enhances long-video understanding, achieving up to 23.5\% accuracy improvement on the Video-MME Long split over a strong MLLM baseline. Furthermore, our framework establishes state-of-the-art performance among comparable 7B-scale MLLMs, achieving 73.4\% accuracy on the Long split and the highest overall average (71.9\%) on the Video-MME benchmark, validating our agent-based reasoning paradigm and structured memory for cognitively-inspired long-video understanding.
Read Between the Lines: A Benchmark for Uncovering Political Bias in Bangla News Articles
Lia, Nusrat Jahan, Dipta, Shubhashis Roy, Zehady, Abdullah Khan, Islam, Naymul, Chakraborty, Madhusodan, Wasif, Abdullah Al
Detecting media bias is crucial, specifically in the South Asian region. Despite this, annotated datasets and computational studies for Bangla political bias research remain scarce. Crucially because, political stance detection in Bangla news requires understanding of linguistic cues, cultural context, subtle biases, rhetorical strategies, code-switching, implicit sentiment, and socio-political background. To address this, we introduce the first benchmark dataset of 200 politically significant and highly debated Bangla news articles, labeled for government-leaning, government-critique, and neutral stances, alongside diagnostic analyses for evaluating large language models (LLMs). Our comprehensive evaluation of 28 proprietary and open-source LLMs shows strong performance in detecting government-critique content (F1 up to 0.83) but substantial difficulty with neutral articles (F1 as low as 0.00). Models also tend to over-predict government-leaning stances, often misinterpreting ambiguous narratives. This dataset and its associated diagnostics provide a foundation for advancing stance detection in Bangla media research and offer insights for improving LLM performance in low-resource languages.