Goto

Collaborating Authors

 Pacific Ocean


Beyond Dashboards: The Future Of Analytics And Business Intelligence?

#artificialintelligence

Analytics and business intelligence (BI) have long been understood to be fundamental to business success. Today, powerful technologies, including artificial intelligence (AI) and machine learning (ML), make it possible to gain deeper insights into all areas of business activity in order to drive efficiency, reduce waste and gain a better understanding of customers. Truly benefiting from analytics – particularly the most advanced and powerful analytics techniques involving AI – requires developing a top-to-bottom culture of data literacy throughout an organization and this, in my experience, is where many businesses are still failing. This is highlighted by one particular statistic that came up during my recent webinar conversation with Amir Orad, CEO of Sisense. Orad told me that according to his observations, 80 percent of employees in the average organization simply aren't leveraging the analytics that, in theory, are available to them.


Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

arXiv.org Artificial Intelligence

In this paper, we present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast. For this purpose, we establish a data-driven environment by downloading $43$ years of hourly global weather data from the 5th generation of ECMWF reanalysis (ERA5) data and train a few deep neural networks with about $256$ million parameters in total. The spatial resolution of forecast is $0.25^\circ\times0.25^\circ$, comparable to the ECMWF Integrated Forecast Systems (IFS). More importantly, for the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy (latitude-weighted RMSE and ACC) of all factors (e.g., geopotential, specific humidity, wind speed, temperature, etc.) and in all time ranges (from one hour to one week). There are two key strategies to improve the prediction accuracy: (i) designing a 3D Earth Specific Transformer (3DEST) architecture that formulates the height (pressure level) information into cubic data, and (ii) applying a hierarchical temporal aggregation algorithm to alleviate cumulative forecast errors. In deterministic forecast, Pangu-Weather shows great advantages for short to medium-range forecast (i.e., forecast time ranges from one hour to one week). Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast (e.g., tropical cyclone tracking) and large-member ensemble forecast in real-time. Pangu-Weather not only ends the debate on whether AI-based methods can surpass conventional NWP methods, but also reveals novel directions for improving deep learning weather forecast systems.


Make it pop! Do we really need the Beatles to sound new?

The Guardian

Yellow Submarine, Ringo Starr's turn on Revolver, has been a gateway for children into the music of the Beatles since its release in 1966. A new reissue of the album makes that relationship more explicit: Giles Martin, son of original producer George and the sonic custodian of the Beatles catalogue, says his "de-mixing" of the album – using AI to separate individual instruments that were originally squeezed together on four tracks – was done in part with a playlist-listening younger audience in mind. Martin recently told Variety that his teenage children listen to old and new music side by side, veering from Fleetwood Mac to Billie Eilish and Olivia Rodrigo. "[W]hat I want to make sure is that when people hear the Beatles, that it has the same dynamic as the other stuff they're listening to," he said. He added that 1969's Abbey Road, recorded on a then luxuriant eight tracks and the first Beatles album not released in mono, stands out from the band's catalogue as "it sounds more hi-fi than the other Beatles albums".


Review on Monitoring, Operation and Maintenance of Smart Offshore Wind Farms

arXiv.org Artificial Intelligence

In recent years, with the development of wind energy, the number and scale of wind farms have been developing rapidly. Since offshore wind farms have the advantages of stable wind speed, being clean renewable, non-polluting, and the non-occupation of cultivated land, they have gradually become a new trend in the wind power industry all over the world. The operation and maintenance of offshore wind powe has been developing in the direction of digitization and intelligence. It is of great significance to carry ou research on the monitoring, operation, and maintenance of offshore wind farms, which will be of benefit fo the reduction of the operation and maintenance costs, the improvement of the power generation efficiency improvement of the stability of offshore wind farm systems, and the building of smart offshore wind farms This paper will mainly summarize the monitoring, operation, and maintenance of offshore wind farms, with particular focus on the following points: monitoring of "offshore wind power engineering and biological and environment", the monitoring of power equipment, and the operation and maintenance of smart offshore wind farms. Finally, the future research challenges in relation to the monitoring, operation, and maintenance of smart offshore wind farms are proposed, and the future research directions in this field are explored especially in marine environment monitoring, weather and climate prediction, intelligent monitoring of powe equipment, and digital platforms.


Understanding Adverse Biological Effect Predictions Using Knowledge Graphs

arXiv.org Artificial Intelligence

Extrapolation of adverse biological (toxic) effects of chemicals is an important contribution to expand available hazard data in (eco)toxicology without the use of animals in laboratory experiments. In this work, we extrapolate effects based on a knowledge graph (KG) consisting of the most relevant effect data as domain-specific background knowledge. An effect prediction model, with and without background knowledge, was used to predict mean adverse biological effect concentration of chemicals as a prototypical type of stressors. The background knowledge improves the model prediction performance by up to 40\% in terms of $R^2$ (\ie coefficient of determination). We use the KG and KG embeddings to provide quantitative and qualitative insights into the predictions. These insights are expected to improve the confidence in effect prediction. Larger scale implementation of such extrapolation models should be expected to support hazard and risk assessment, by simplifying and reducing testing needs.


Credit-Based Congestion Pricing: Equilibrium Properties and Optimal Scheme Design

arXiv.org Artificial Intelligence

Credit-based congestion pricing (CBCP) has emerged as a mechanism to alleviate the social inequity concerns of road congestion pricing - a promising strategy for traffic congestion mitigation - by providing low-income users with travel credits to offset some of their toll payments. While CBCP offers immense potential for addressing inequity issues that hamper the practical viability of congestion pricing, the deployment of CBCP in practice is nascent, and the potential efficacy and optimal design of CBCP schemes have yet to be formalized. In this work, we study the design of CBCP schemes to achieve particular societal objectives and investigate their influence on traffic patterns when routing heterogeneous users with different values of time (VoTs) in a multi-lane highway with an express lane. We introduce a new non-atomic congestion game model of a mixed-economy, wherein eligible users receive travel credits while the remaining ineligible users pay out-of-pocket to use the express lane. In this setting, we investigate the effect of CBCP schemes on traffic patterns by characterizing the properties (i.e., existence, comparative statics) of the corresponding Nash equilibria and, in the setting when eligible users have time-invariant VoTs, develop a convex program to compute these equilibria. We further present a bi-level optimization framework to design optimal CBCP schemes to achieve a central planner's societal objectives. Finally, we conduct numerical experiments based on a case study of the San Mateo 101 Express Lanes Project, one of the first North American CBCP pilots. Our results demonstrate the potential of CBCP to enable low-income travelers to avail of the travel time savings provided by congestion pricing on express lanes while having comparatively low impacts on the travel costs of other road users.


ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation

arXiv.org Artificial Intelligence

We study the text generation task under the approach of pre-trained language models (PLMs). Typically, an auto-regressive (AR) method is adopted for generating texts in a token-by-token manner. Despite many advantages of AR generation, it usually suffers from inefficient inference. Therefore, non-autoregressive (NAR) models are proposed to generate all target tokens simultaneously. However, NAR models usually generate texts of lower quality due to the absence of token dependency in the output text. In this paper, we propose ELMER: an efficient and effective PLM for NAR text generation to explicitly model the token dependency during NAR generation. By leveraging the early exit technique, ELMER enables the token generations at different layers, according to their prediction confidence (a more confident token will exit at a lower layer). Besides, we propose a novel pre-training objective, Layer Permutation Language Modeling, to pre-train ELMER by permuting the exit layer for each token in sequences. Experiments on three text generation tasks show that ELMER significantly outperforms NAR models and further narrows the performance gap with AR PLMs (\eg ELMER (29.92) vs BART (30.61) ROUGE-L in XSUM) while achieving over 10 times inference speedup.


Effective and Efficient Query-aware Snippet Extraction for Web Search

arXiv.org Artificial Intelligence

Query-aware webpage snippet extraction is widely used in search engines to help users better understand the content of the returned webpages before clicking. Although important, it is very rarely studied. In this paper, we propose an effective query-aware webpage snippet extraction method named DeepQSE, aiming to select a few sentences which can best summarize the webpage content in the context of input query. DeepQSE first learns query-aware sentence representations for each sentence to capture the fine-grained relevance between query and sentence, and then learns document-aware query-sentence relevance representations for snippet extraction. Since the query and each sentence are jointly modeled in DeepQSE, its online inference may be slow. Thus, we further propose an efficient version of DeepQSE, named Efficient-DeepQSE, which can significantly improve the inference speed of DeepQSE without affecting its performance. The core idea of Efficient-DeepQSE is to decompose the query-aware snippet extraction task into two stages, i.e., a coarse-grained candidate sentence selection stage where sentence representations can be cached, and a fine-grained relevance modeling stage. Experiments on two real-world datasets validate the effectiveness and efficiency of our methods.


TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack

arXiv.org Artificial Intelligence

We present Twin Answer Sentences Attack (TASA), an adversarial attack method for question answering (QA) models that produces fluent and grammatical adversarial contexts while maintaining gold answers. Despite phenomenal progress on general adversarial attacks, few works have investigated the vulnerability and attack specifically for QA models. In this work, we first explore the biases in the existing models and discover that they mainly rely on keyword matching between the question and context, and ignore the relevant contextual relations for answer prediction. Based on two biases above, TASA attacks the target model in two folds: (1) lowering the model's confidence on the gold answer with a perturbed answer sentence; (2) misguiding the model towards a wrong answer with a distracting answer sentence. Equipped with designed beam search and filtering methods, TASA can generate more effective attacks than existing textual attack methods while sustaining the quality of contexts, in extensive experiments on five QA datasets and human evaluations.


Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

arXiv.org Artificial Intelligence

Testing Machine Learning (ML) models and AI-Infused Applications (AIIAs), or systems that contain ML models, is highly challenging. In addition to the challenges of testing classical software, it is acceptable and expected that statistical ML models sometimes output incorrect results. A major challenge is to determine when the level of incorrectness, e.g., model accuracy or F1 score for classifiers, is acceptable and when it is not. In addition to business requirements that should provide a threshold, it is a best practice to require any proposed ML solution to out-perform simple baseline models, such as a decision tree. We have developed complexity measures, which quantify how difficult given observations are to assign to their true class label; these measures can then be used to automatically determine a baseline performance threshold. These measures are superior to the best practice baseline in that, for a linear computation cost, they also quantify each observation' classification complexity in an explainable form, regardless of the classifier model used. Our experiments with both numeric synthetic data and real natural language chatbot data demonstrate that the complexity measures effectively highlight data regions and observations that are likely to be misclassified.