restriction
9d411e87d0f37059f40fb27c5de00ba0-Supplemental-Datasets_and_Benchmarks_Track.pdf
The following section is answers to questions listed in datasheets for datasets.858 A.1 Motivation859 Question: For what purpose was the dataset created? Was there a specific task in mind?860 Was there a specific gap that needed to be filled? Answer: To evaluate the linguistic robustness of language models across diverse English862 varieties by transforming Standard American English (SAE) datasets.863 Question: Who created the dataset (e.g., which team, research group) and on behalf of864 which entity (e.g., company, institution, organization)?865 Answer: The authors of this paper.866 Question: Who funded the creation of the dataset? If there is an associated grant, please867 provide the name of the grantor and the grant name and number.868
SentinelKilnDB: ALarge-Scale Dataset and Benchmark for OBBBrick Kiln Detection in South Asia Using Satellite Imagery Supplementary Information
The questions are presented in blue, with our corresponding responses shown in black. For what purpose was the dataset created? Was there a specific task in mind? This dataset was created for academic and research purposes to advance scientific understanding and support policy development on air quality and sustainability issues. The findings highlight important opportunities to improve regulatory compliance and encourage the adoption of cleaner technologies within the brick kiln sector, which is a significant contributor to regional air pollution. Beyond its environmental relevance, this dataset is especially valuable for the fields of object detection and computer vision. It provides a large-scale, hand-validated collection of brick kiln locations annotated with oriented bounding boxes (OBBs) on freely available Sentinel-2 satellite imagery.
Is Putin Finally Feeling Pressure?
Is Vladimir Putin Finally Feeling Pressure? The Russian President is facing growing domestic discontent after a series of successful attacks by the Ukrainian Army, including a major attack on Moscow. The war in Ukraine, which not long ago seemed to be turning in favor of Vladimir Putin's invading Russian Army, appears to have undergone another reversal. Thanks in part to its drone campaign, the Ukrainians have, according to some analysts, " turned the tide," putting pressure on Putin to potentially accept a ceasefire in the coming months. At the same time, there have been bubbles of discontent forming within Russia, over the cost of the war and government crackdowns on internet access. To understand what might be happening in Russia, and how the Putin regime might respond, I recently e-mailed several rounds of questions to Tatiana Stanovaya, a senior fellow at the Carnegie Russia Eurasia Center, and the founder of the political analysis organization R.Politik. Our conversation, edited for length and clarity, is below.
The Download: a reality check for geoengineering and the science of interoception
Plus: SpaceX is now valued higher than Amazon. Solar geoengineering, the controversial idea that we could deliberately intervene in the climate system to counteract global warming, is moving beyond computer simulations and into the practical engineering challenges required to make it real. Researchers are now working on aircraft, materials, and other systems for solar geoengineering. But as they delve into these details, they're finding that even early deployment would require significant new infrastructure, time, and investment. Find out what happens when solar geoengineering encounters the realities of trying to cool the planet . Scientists have a word for how we sense ourselves from the inside: interoception.
3e5b0db387078ac4968fd536d3c3a019-Supplemental-Datasets_and_Benchmarks_Track.pdf
For models trained for multi-image input, text prompt is:850 Which objects are present in both images? You can think of your answer in any way (e.g. For models where we first concatenate the input images, the text prompt is:855 There are two images provided, one on the left and the other on the right.856 Which objects are present in both images? You can think of your answer in any way (e.g. We used the following procedure to guide our creation of images.
Five big questions about the UK's under-16s social media ban
Five big questions about the UK's under-16s social media ban After the government's announcement on Monday, we know a social media ban is coming for under-16s in the UK . However, details on which apps are and are not included, besides those named by the government, and how the measures will extend to gaming sites like Roblox, remain sparse. And many are already asking whether enforcing the ban will mean cracking down on virtual private networks (VPNs), which can disguise someone's location online. Ministers have said they will provide an update on further restrictions like potential curfews, curbing of addictive features like infinite scroll and AI chatbots, in July. But here are some of the big unanswered questions about the UK social media ban.
Knee-Deep in C-RASP: ATransformer Depth Hierarchy
It has been observed that transformers with greater depth (that is, more layers) have more capabilities, but can we establish formally which capabilities are gained? We answer this question with a theoretical proof followed by an empirical study. First, we consider transformers that round to fixed precision except inside attention. We show that this subclass of transformers is expressively equivalent to the programming language C-RASP and this equivalence preserves depth. Second, we prove that deeper C-RASP programs are more expressive than shallower C-RASP programs, implying that deeper transformers are more expressive than shallower transformers (within the subclass mentioned above). The same is also proven for transformers with positional encodings (like RoPE and ALiBi). These results are established by studying a temporal logic with counting operators equivalent to C-RASP. Finally, we provide empirical evidence that our theory predicts the depth required for transformers without positional encodings to length-generalize on a family of sequential dependency tasks.
The UK Places a Sweeping Ban on Social Media for Kids Under 16
The UK government is introducing a ban on social media for children and a minimum age for some chatbots in an attempt to shield young people from dangerous corners of the web. UK prime minister Keir Starmer has been leading the charge on under-16 social media regulation. Children under the age of 16 will be banned from social media platforms in the UK, under new measures announced by prime minister Keir Starmer on Monday. "The need for action could not be clearer. Social media is making our children unhappy and unsafe," said Starmer, in an X post .
Online Learning of Neural Networks
We study online learning of feedforward neural networks with the sign activation function that implement functions from the unit ball in $\mathbb{R}^d$ to a finite label set $\mathcal{Y} = \{1, \ldots, Y \}$. First, we characterize a margin condition that is sufficient and in some cases necessary for online learnability of a neural network: Every neuron in the first hidden layer classifies all instances with some margin $\gamma$ bounded away from zero. Quantitatively, we prove that for any net, the optimal mistake bound is at most approximately $\mathtt{TS}(d,\gamma)$, which is the $(d,\gamma)$-totally-separable-packing number, a more restricted variation of the standard $(d,\gamma)$-packing number. We complement this result by constructing a net on which any learner makes $\mathtt{TS}(d,\gamma)$ many mistakes. We also give a quantitative lower bound of approximately $\mathtt{TS}(d,\gamma) \geq \max\{1/(\gamma \sqrt{d})^d, d\}$ when $\gamma \geq 1/2$, implying that for some nets and input sequences every learner will err for $\exp(d)$ many times, and that a dimension-free mistake bound is almost always impossible.
Bayesian Multiplicity Correction in the Probabilistic Forward Stepwise Framework
Womack, Andrew, Taylor-Rodriguez, Daniel
We develop a natural Bayesian multiplicity-correcting prior distribution within the probabilistic forward stepwise representation of model space priors for regression problems. The proposed prior, obtained from making an analogy to the Holm procedure, exhibits behavior closely aligned with that of the Matryoshka doll prior. We compare both priors to several other priors, including some recently put forward as objective choices for model space prior probabilities. Our comparisons indicate that adequate multiplicity correction requires a degree of sparsity that many recommended priors do not provide, and we argue that multiplicity correction itself offers a principled and transparent criterion for specifying model space priors in regression.