Pacific Ocean
JamPatoisNLI: A Jamaican Patois Natural Language Inference Dataset
Armstrong, Ruth-Ann, Hewitt, John, Manning, Christopher
JamPatoisNLI provides the first dataset for natural language inference in a creole language, Jamaican Patois. Many of the most-spoken low-resource languages are creoles. These languages commonly have a lexicon derived from a major world language and a distinctive grammar reflecting the languages of the original speakers and the process of language birth by creolization. This gives them a distinctive place in exploring the effectiveness of transfer from large monolingual or multilingual pretrained models. While our work, along with previous work, shows that transfer from these models to low-resource languages that are unrelated to languages in their training set is not very effective, we would expect stronger results from transfer to creoles. Indeed, our experiments show considerably better results from few-shot learning of JamPatoisNLI than for such unrelated languages, and help us begin to understand how the unique relationship between creoles and their high-resource base languages affect cross-lingual transfer. JamPatoisNLI, which consists of naturally-occurring premises and expert-written hypotheses, is a step towards steering research into a traditionally underserved language and a useful benchmark for understanding cross-lingual NLP.
An Interpretable Model of Climate Change Using Correlative Learning
Anderson, Charles, Stock, Jason
Determining changes in global temperature and precipitation that may indicate climate change is complicated by annual variations. One approach for finding potential climate change indicators is to train a model that predicts the year from annual means of global temperatures and precipitations. Such data is available from the CMIP6 ensemble of simulations. Here a two-hidden-layer neural network trained on this data successfully predicts the year. Differences among temperature and precipitation patterns for which the model predicts specific years reveal changes through time. To find these optimal patterns, a new way of interpreting what the neural network has learned is explored. Alopex, a stochastic correlative learning algorithm, is used to find optimal temperature and precipitation maps that best predict a given year. These maps are compared over multiple years to show how temperature and precipitations patterns indicative of each year change over time.
Progress and Challenges for the Application of Machine Learning for Neglected Tropical Diseases
Khew, Chung Yuen, Akbar, Rahmad, Assaad, Norfarhan Mohd.
Neglected tropical diseases (NTDs) continue to affect the livelihood of individuals in countries in the Southeast Asia and Western Pacific region. These diseases have been long existing and have caused devastating health problems and economic decline to people in low- and middle-income (developing) countries. An estimated 1.7 billion of the world's population suffer one or more NTDs annually, this puts approximately one in five individuals at risk for NTDs. In addition to health and social impact, NTDs inflict significant financial burden to patients, close relatives, and are responsible for billions of dollars lost in revenue from reduced labor productivity in developing countries alone. There is an urgent need to better improve the control and eradication or elimination efforts towards NTDs. This can be achieved by utilizing machine learning tools to better the surveillance, prediction and detection program, and combat NTDs through the discovery of new therapeutics against these pathogens. This review surveys the current application of machine learning tools for NTDs and the challenges to elevate the state-of-the-art of NTDs surveillance, management, and treatment.
A Deep Learning Architecture for Passive Microwave Precipitation Retrievals using CloudSat and GPM Data
Rahimi, Reyhaneh, Vahedizadeh, Sajad, Ebtehaj, Ardeshir
This paper presents an algorithm that relies on a series of dense and deep neural networks for passive microwave retrieval of precipitation. The neural networks learn from coincidences of brightness temperatures from the Global Precipitation Measurement (GPM) Microwave Imager (GMI) with the active precipitating retrievals from the Dual-frequency Precipitation Radar (DPR) onboard GPM as well as those from the {CloudSat} Profiling Radar (CPR). The algorithm first detects the precipitation occurrence and phase and then estimates its rate, while conditioning the results to some key ancillary information including parameters related to cloud microphysical properties. The results indicate that we can reconstruct the DPR rainfall and CPR snowfall with a detection probability of more than 0.95 while the probability of a false alarm remains below 0.08 and 0.03, respectively. Conditioned to the occurrence of precipitation, the unbiased root mean squared error in estimation of rainfall (snowfall) rate using DPR (CPR) data is less than 0.8 (0.1) mm/hr over oceans and land. Beyond methodological developments, comparing the results with ERA5 reanalysis and official GPM products demonstrates that the uncertainty in global satellite snowfall retrievals continues to be large while there is a good agreement among rainfall products. Moreover, the results indicate that CPR active snowfall data can improve passive microwave estimates of global snowfall while the current CPR rainfall retrievals should only be used for detection and not estimation of rates.
FECAM: Frequency Enhanced Channel Attention Mechanism for Time Series Forecasting
Jiang, Maowei, Zeng, Pengyu, Wang, Kai, Liu, Huan, Chen, Wenbo, Liu, Haoran
Time series forecasting is a long-standing challenge due to the real-world information is in various scenario (e.g., energy, weather, traffic, economics, earthquake warning). However some mainstream forecasting model forecasting result is derailed dramatically from ground truth. We believe it's the reason that model's lacking ability of capturing frequency information which richly contains in real world datasets. At present, the mainstream frequency information extraction methods are Fourier transform(FT) based. However, use of FT is problematic due to Gibbs phenomenon. If the values on both sides of sequences differ significantly, oscillatory approximations are observed around both sides and high frequency noise will be introduced. Therefore We propose a novel frequency enhanced channel attention that adaptively modelling frequency interdependencies between channels based on Discrete Cosine Transform which would intrinsically avoid high frequency noise caused by problematic periodity during Fourier Transform, which is defined as Gibbs Phenomenon. We show that this network generalize extremely effectively across six real-world datasets and achieve state-of-the-art performance, we further demonstrate that frequency enhanced channel attention mechanism module can be flexibly applied to different networks. This module can improve the prediction ability of existing mainstream networks, which reduces 35.99% MSE on LSTM, 10.01% on Reformer, 8.71% on Informer, 8.29% on Autoformer, 8.06% on Transformer, etc., at a slight computational cost ,with just a few line of code. Our codes and data are available at https://github.com/Zero-coder/FECAM.
Interactive data prep widget for notebooks powered by Amazon SageMaker Data Wrangler
According to a 2020 survey of data scientists conducted by Anaconda, data preparation is one of the critical steps in machine learning (ML) and data analytics workflows, and often very time consuming for data scientists. Data scientists spend about 66% of their time on data preparation and analysis tasks, including loading (19%), cleaning (26%), and visualizing data (21%). Amazon SageMaker Studio is the first fully integrated development environment (IDE) for ML. With a single click, data scientists and developers can quickly spin up Studio notebooks to explore datasets and build models. If you prefer a GUI-based and interactive interface, you can use Amazon SageMaker Data Wrangler, with over 300 built in visualizations, analyses, and transformations to efficiently process data backed by Spark without writing a single line of code.
Amazon to warn customers on limitations of its AI
Inc (AMZN.O) is planning to roll out warning cards for software sold by its cloud-computing division, in light of ongoing concern that artificially intelligent systems can discriminate against different groups, the company told Reuters. Akin to lengthy nutrition labels, Amazon's so-called AI Service Cards will be public so its business customers can see the limitations of certain cloud services, such as facial recognition and audio transcription. The goal would be to prevent mistaken use of its technology, explain how its systems work and manage privacy, Amazon said. The company is not the first to publish such warnings. International Business Machines Corp (IBM.N), a smaller player in the cloud, did so years ago.
San Francisco approves plan to allow police robots to use deadly force in emergency situations
San Francisco leaders voted to allow the city's police department to use potentially lethal robots in emergency situations. "Under this policy, SFPD is authorized to use these robots to carry out deadly force in extremely limited situations when risk to loss of life to members of the public or officers is imminent and outweighs any other force option available," City Supervisor Rafael Mandelman wrote on Twitter. San Francisco leaders voted 8-3 on Tuesday in support of the new policy. The San Francisco Police Department has 17 robots, but none are armed with guns, and the department has no plans to do so. The department could deploy robots equipped with explosive charges "to contact, incapacitate, or disorient violent, armed, or dangerous suspect" during emergency situations when lives are at risk, according to a police department spokesperson.
San Francisco police given power to use killer robots
Officials in San Francisco have voted to give the city's police the power to use potentially lethal, remote-controlled robots in emergency situations. The 8-3 vote in favour of the move followed an emotionally charged two-hour debate and came despite strong objections from civil liberties and other police oversight groups in the city on the west coast of the United States. Supervisor Connie Chan, a member of the committee that forwarded the proposal to the full board, said she understood concerns over use of force but that "according to state law, we are required to approve the use of these equipments. So here we are, and it's definitely not an easy discussion." The San Francisco Police Department (SFPD) has said it does not have pre-armed robots and has no plans to arm robots with guns.
San Francisco approves police proposal to use potentially deadly robots
Police in San Francisco will be allowed to deploy potentially lethal, remote-controlled robots in emergency situations. The controversial policy was approved after weeks of scrutiny and a heated debate among the city's board of supervisors during their meeting on Tuesday. Police oversight groups, the ACLU and San Francisco's public defender had urged the 11-member body to reject the police's use of equipment proposal. Opponents of the policy said it would lead to further militarization of a police force already too aggressive with underserved communities. They said the parameters under which use would be allowed were too vague.