Indian Ocean
Smart cities, smarter public health
Over the course of the last two years, we interviewed mayors, city officials, urban planners, academics, and citizens in cities around the world to identify the trends that are making urban living more sustainable, affordable, and human. One theme that emerged was cities' increasingly important role in ensuring the health and well-being of their residents.4 Cities currently represent just 3% of the world's territory but harbor 55% of the world's population. By 2050, it's estimated that 70% of the world's population will live in urban centers.5 At an economic level, cities generate around 80% of the global GDP,6 and are responsible for 80% of energy consumption and more than 70% of carbon emissions and global waste.7
Hashish and pirates: How AI is cleaning up the high seas
On August 8th, 2021, Spanish police and customs agents intercepted the cargo ship NATALIA on suspicion of narcotics trafficking. The ship was en route from Lebanon via Iskenderun, Turkey, to Lagos, Nigeria, and hidden on board was nearly 20 tons of hashish worth $470 million. That may sound like the opening scene of an action flick, but it's the kind of occurrence that happens more frequently than you might expect on the high seas. Drug smuggling, illegal fishing, and piracy are constant threats. Following a number of recent piracy incidents in the Gulf of Aden, Iran, Russia, and China recently began naval and air drills seeking to counter maritime piracy.
Large Language Models with Controllable Working Memory
Li, Daliang, Rawat, Ankit Singh, Zaheer, Manzil, Wang, Xin, Lukasik, Michal, Veit, Andreas, Yu, Felix, Kumar, Sanjiv
Large language models (LLMs) have led to a series of breakthroughs in natural language processing (NLP), owing to their excellent understanding and generation abilities. Remarkably, what further sets these models apart is the massive amounts of world knowledge they internalize during pretraining. While many downstream applications provide the model with an informational context to aid its performance on the underlying task, how the model's world knowledge interacts with the factual information presented in the context remains under explored. As a desirable behavior, an LLM should give precedence to the context whenever it contains task-relevant information that conflicts with the model's memorized knowledge. This enables model predictions to be grounded in the context, which can then be used to update or correct specific model predictions without frequent retraining. By contrast, when the context is irrelevant to the task, the model should ignore it and fall back on its internal knowledge. In this paper, we undertake a first joint study of the aforementioned two properties, namely controllability and robustness, in the context of LLMs. We demonstrate that state-of-the-art T5 and PaLM (both pretrained and finetuned) could exhibit poor controllability and robustness, which do not scale with increasing model size. As a solution, we propose a novel method - Knowledge Aware FineTuning (KAFT) - to strengthen both controllability and robustness by incorporating counterfactual and irrelevant contexts to standard supervised datasets. Our comprehensive evaluation showcases the utility of KAFT across model architectures and sizes.
Review of coreference resolution in English and Persian
Mohammadi, Hassan Haji, Talebpour, Alireza, Aznaveh, Ahmad Mahmoudi, Yazdani, Samaneh
Coreference resolution (CR) is one of the most challenging areas of natural language processing. This task seeks to identify all textual references to the same real-world entity. Research in this field is divided into coreference resolution and anaphora resolution. Due to its application in textual comprehension and its utility in other tasks such as information extraction systems, document summarization, and machine translation, this field has attracted considerable interest. Consequently, it has a significant effect on the quality of these systems. This article reviews the existing corpora and evaluation metrics in this field. Then, an overview of the coreference algorithms, from rule-based methods to the latest deep learning techniques, is provided. Finally, coreference resolution and pronoun resolution systems in Persian are investigated.
ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts
Ashkboos, Saleh, Huang, Langwen, Dryden, Nikoli, Ben-Nun, Tal, Dueben, Peter, Gianinazzi, Lukas, Kummer, Luca, Hoefler, Torsten
Post-processing ensemble prediction systems can improve the reliability of weather forecasting, especially for extreme event prediction. In recent years, different machine learning models have been developed to improve the quality of weather post-processing. However, these models require a comprehensive dataset of weather simulations to produce high-accuracy results, which comes at a high computational cost to generate. This paper introduces the ENS-10 dataset, consisting of ten ensemble members spanning 20 years (1998-2017). The ensemble members are generated by perturbing numerical weather simulations to capture the chaotic behavior of the Earth. To represent the three-dimensional state of the atmosphere, ENS-10 provides the most relevant atmospheric variables at 11 distinct pressure levels and the surface at 0.5-degree resolution for forecast lead times T=0, 24, and 48 hours (two data points per week). We propose the ENS-10 prediction correction task for improving the forecast quality at a 48-hour lead time through ensemble post-processing. We provide a set of baselines and compare their skill at correcting the predictions of three important atmospheric variables. Moreover, we measure the baselines' skill at improving predictions of extreme weather events using our dataset. The ENS-10 dataset is available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This Week's Awesome Tech Stories From Around the Web (Through November 5)
Having AIs Train Robot Dogs to Balance Makes Them a Lot Cheaper Jeremy Tsu New Scientist "An AI has been used to train a small robot dog to perform cleaning tasks. The hardware cost a total of $6300, which is less than a tenth of the price tag of the well-known robot dogs built by US tech firm Boston Dynamics. This type of self-taught robotic body coordination relies on an AI training regimen that could pave the way for affordable robot dogs and possibly even humanoid robots that could be used as helpers in homes and workplaces." Google Plans Giant AI Language Model Supporting World's 1,000 Most Spoken Languages James Vincent The Verge "i'The way we get to 1,000 languages is not by building 1,000 different models. Languages are like organisms, they've evolved from one another and they have certain similarities. And we can find some pretty spectacular advances in what we call zero-shot learning when we incorporate data from a new language into our 1,000 language model and get the ability to translate [what it's learned] from a high-resource language to a low-resource language,' says [Zoubin Ghahramani, vice president of research at Google AI]. Genetically Modified Mosquitoes Cut the Insect's Number by 96 Percent Miriam Fauzia New Scientist "Although not a permanent fix, periodically releasing such mosquitoes could reduce the burden of infections including dengue, malaria, and Zika.
Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast
Bi, Kaifeng, Xie, Lingxi, Zhang, Hengheng, Chen, Xin, Gu, Xiaotao, Tian, Qi
In this paper, we present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast. For this purpose, we establish a data-driven environment by downloading $43$ years of hourly global weather data from the 5th generation of ECMWF reanalysis (ERA5) data and train a few deep neural networks with about $256$ million parameters in total. The spatial resolution of forecast is $0.25^\circ\times0.25^\circ$, comparable to the ECMWF Integrated Forecast Systems (IFS). More importantly, for the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy (latitude-weighted RMSE and ACC) of all factors (e.g., geopotential, specific humidity, wind speed, temperature, etc.) and in all time ranges (from one hour to one week). There are two key strategies to improve the prediction accuracy: (i) designing a 3D Earth Specific Transformer (3DEST) architecture that formulates the height (pressure level) information into cubic data, and (ii) applying a hierarchical temporal aggregation algorithm to alleviate cumulative forecast errors. In deterministic forecast, Pangu-Weather shows great advantages for short to medium-range forecast (i.e., forecast time ranges from one hour to one week). Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast (e.g., tropical cyclone tracking) and large-member ensemble forecast in real-time. Pangu-Weather not only ends the debate on whether AI-based methods can surpass conventional NWP methods, but also reveals novel directions for improving deep learning weather forecast systems.
Scaling Multimodal Pre-Training via Cross-Modality Gradient Harmonization
Wu, Junru, Liang, Yi, Han, Feng, Akbari, Hassan, Wang, Zhangyang, Yu, Cong
Self-supervised pre-training recently demonstrates success on large-scale multimodal data, and state-of-the-art contrastive learning methods often enforce the feature consistency from cross-modality inputs, such as video/audio or video/text pairs. Despite its convenience to formulate and leverage in practice, such cross-modality alignment (CMA) is only a weak and noisy supervision, since two modalities can be semantically misaligned even they are temporally aligned. For example, even in the (often adopted) instructional videos, a speaker can sometimes refer to something that is not visually present in the current frame; and the semantic misalignment would only be more unpredictable for the raw videos collected from unconstrained internet sources. We conjecture that might cause conflicts and biases among modalities, and may hence prohibit CMA from scaling up to training with larger and more heterogeneous data. This paper first verifies our conjecture by observing that, even in the latest VATT pre-training using only narrated videos, there exist strong gradient conflicts between different CMA losses within the same sample triplet (video, audio, text), indicating them as the noisy source of supervision. We then propose to harmonize such gradients during pre-training, via two techniques: (i) cross-modality gradient realignment: modifying different CMA loss gradients for one sample triplet, so that their gradient directions are in more agreement; and (ii) gradient-based curriculum learning: leveraging the gradient conflict information on an indicator of sample noisiness, to develop a curriculum learning strategy to prioritize training with less noisy sample triplets. Applying those gradient harmonization techniques to pre-training VATT on the HowTo100M dataset, we consistently improve its performance on different downstream tasks. Moreover, we are able to scale VATT pre-training to more complicated non-narrative Youtube8M dataset to further improve the state-of-the-arts.
Crime Prediction using Machine Learning with a Novel Crime Dataset
Shohan, Faisal Tareque, Akash, Abu Ubaida, Ibrahim, Muhammad, Alam, Mohammad Shafiul
Crime is an unlawful act that carries legal repercussions. Bangladesh has a high crime rate due to poverty, population growth, and many other socio-economic issues. For law enforcement agencies, understanding crime patterns is essential for preventing future criminal activity. For this purpose, these agencies need structured crime database. This paper introduces a novel crime dataset that contains temporal, geographic, weather, and demographic data about 6574 crime incidents of Bangladesh. We manually gather crime news articles of a seven year time span from a daily newspaper archive. We extract basic features from these raw text. Using these basic features, we then consult standard service-providers of geo-location and weather data in order to garner these information related to the collected crime incidents. Furthermore, we collect demographic information from Bangladesh National Census data. All these information are combined that results in a standard machine learning dataset. Together, 36 features are engineered for the crime prediction task. Five supervised machine learning classification algorithms are then evaluated on this newly built dataset and satisfactory results are achieved. We also conduct exploratory analysis on various aspects the dataset. This dataset is expected to serve as the foundation for crime incidence prediction systems for Bangladesh and other countries. The findings of this study will help law enforcement agencies to forecast and contain crime as well as to ensure optimal resource allocation for crime patrol and prevention.
What's your favorite scary movie? AI reimagines classic horror film posters
Artificial intelligence has reimagined movie posters of popular horror films just in time for Halloween - and the results are teeming with blood, gore and terror. A graphic design team inputted key words like mask, black cloak and blood to inspire the AI-powered app Wonder that brought the nightmares to life. The popular 1996 slasher film Scream features a woman with blue eyes and covering her mouth on its movie poster, but the AI created a hooded figure with a mask that is dripping in blood that is'arguably even more terrifying than the original.' The visuals were created using an app that asks users to describe what they want to see in the digital artwork, which has become a new medium recently. A graphic design team inputted key words like mask, black cloak and blood to inspire the AI-powered app Wonder that brought the nightmares to life.