Materials
An Introduction to Kernel and Operator Learning Methods for Homogenization by Self-consistent Clustering Analysis
Huang, Owen, Saha, Sourav, Guo, Jiachen, Liu, Wing Kam
Recent advances in operator learning theory have improved our knowledge about learning maps between infinite dimensional spaces. However, for large-scale engineering problems such as concurrent multiscale simulation for mechanical properties, the training cost for the current operator learning methods is very high. The article presents a thorough analysis on the mathematical underpinnings of the operator learning paradigm and proposes a kernel learning method that maps between function spaces. We first provide a survey of modern kernel and operator learning theory, as well as discuss recent results and open problems. From there, the article presents an algorithm to how we can analytically approximate the piecewise constant functions on R for operator learning. This implies the potential feasibility of success of neural operators on clustered functions. Finally, a k-means clustered domain on the basis of a mechanistic response is considered and the Lippmann-Schwinger equation for micro-mechanical homogenization is solved. The article briefly discusses the mathematics of previous kernel learning methods and some preliminary results with those methods. The proposed kernel operator learning method uses graph kernel networks to come up with a mechanistic reduced order method for multiscale homogenization.
CREPE: Open-Domain Question Answering with False Presuppositions
Yu, Xinyan Velocity, Min, Sewon, Zettlemoyer, Luke, Hajishirzi, Hannaneh
Information seeking users often pose questions with false presuppositions, especially when asking about unfamiliar topics. Most existing question answering (QA) datasets, in contrast, assume all questions have well defined answers. We introduce CREPE, a QA dataset containing a natural distribution of presupposition failures from online information-seeking forums. We find that 25% of questions contain false presuppositions, and provide annotations for these presuppositions and their corrections. Through extensive baseline experiments, we show that adaptations of existing open-domain QA models can find presuppositions moderately well, but struggle when predicting whether a presupposition is factually correct. This is in large part due to difficulty in retrieving relevant evidence passages from a large text corpus. CREPE provides a benchmark to study question answering in the wild, and our analyses provide avenues for future work in better modeling and further studying the task.
WikiWhy: Answering and Explaining Cause-and-Effect Questions
Ho, Matthew, Sharma, Aditya, Chang, Justin, Saxon, Michael, Levy, Sharon, Lu, Yujie, Wang, William Yang
As large language models (LLMs) grow larger and more sophisticated, assessing their "reasoning" capabilities in natural language grows more challenging. Recent question answering (QA) benchmarks that attempt to assess reasoning are often limited by a narrow scope of covered situations and subject matters. We introduce WikiWhy, a QA dataset built around a novel auxiliary task: explaining why an answer is true in natural language. WikiWhy contains over 9,000 "why" question-answer-rationale triples, grounded on Wikipedia facts across a diverse set of topics. Each rationale is a set of supporting statements connecting the question to the answer. WikiWhy serves as a benchmark for the reasoning capabilities of LLMs because it demands rigorous explicit rationales for each answer to demonstrate the acquisition of implicit commonsense knowledge, which is unlikely to be easily memorized. GPT-3 baselines achieve only 38.7% human-evaluated correctness in the end-to-end answer & explain condition, leaving significant room for future improvements.
Rolf Schmitz, Co-Founder & Co-CEO of CollectiveCrunch – Interview Series
Rolf Schmitz is the Co-Founder & Co-CEO of CollectiveCrunch, a platform changing the world's understanding of forests by providing the most accurate, scalable, timely analytics globally and enabling sustainable forestry and bring transparency to carbon trading markets. Rolf is an Engineer by education and holds an MBA from Manchester Business School. He has deep experience in global Business Development and Sales, having built teams in Asia, USA and Europe. Could you share the genesis story behind CollectiveCrunch? We are steeped in handling large amounts of data and deriving insights from them.
203 Cyber Monday Deals Still Going Strong Right Now
CYBER MONDAY IS officially over, but many of our favorite deals are still available. If you took an extended holiday from screens or celebrated Buy Nothing Weekend, fear not, some deals remain. We combed through our many Cyber Monday guides and picked out the items still on sale. It's unclear how long they'll last, and many may even start to expire before the day's end, but have a look if you're still hunting for holiday bargains. We test products year-round and handpicked these deals. Products that are sold out or no longer discounted as of publishing will be crossed out . We'll update this guide throughout the week. If you buy something using links in our stories, we may earn a commission. This helps support our journalism. Check out our Best Cheap Phones and the Best iPhones guides for more recommendations and context. You can also find more picks in our Best Tablets, Best iPads, and Best Amazon Fire Tablets guides. Samsung's fourth-gen folding phones still have a futuristic feel ...
HR Data Architect at Syngenta Group - Bracknell, United Kingdom
Syngenta is one of the world's leading agriculture companies. Our ambition is to help safely feed the world while taking care of the planet. We aim to improve the sustainability, quality and safety of agriculture with world class science and innovative crop solutions. Our technologies enable millions of farmers around the world to make better use of limited agricultural resources. With 28,000 people in more than 90 countries we are working to transform how crops are grown.
Automatic Discovery of Multi-perspective Process Model using Reinforcement Learning
Sim, Sunghyun, Liu, Ling, Bae, Hyerim
Process mining is a methodology for the derivation and analysis of process models based on the event log. When process mining is employed to analyze business processes, the process discovery step, the conformance checking step, and the enhancements step are repeated. If a user wants to analyze a process from multiple perspectives (such as activity perspectives, originator perspectives, and time perspectives), the above procedure, inconveniently, has to be repeated over and over again. Although past studies involving process mining have applied detailed stepwise methodologies, no attempt has been made to incorporate and optimize multi-perspective process mining procedures. This paper contributes to developing a solution approach to this problem. First, we propose an automatic discovery framework of a multi-perspective process model based on deep Q-Learning. Our Dual Experience Replay with Experience Distribution (DERED) approach can automatically perform process model discovery steps, conformance check steps, and enhancements steps. Second, we propose a new method that further optimizes the experience replay (ER) method, one of the key algorithms of deep Q-learning, to improve the learning performance of reinforcement learning agents. Finally, we validate our approach using six real-world event datasets collected in port logistics, steel manufacturing, finance, IT, and government administration. We show that our DERED approach can provide users with multi-perspective, high-quality process models that can be employed more conveniently for multi-perspective process mining.
Energy-Motivated Equivariant Pretraining for 3D Molecular Graphs
Jiao, Rui, Han, Jiaqi, Huang, Wenbing, Rong, Yu, Liu, Yang
Pretraining molecular representation models without labels is fundamental to various applications. Conventional methods mainly process 2D molecular graphs and focus solely on 2D tasks, making their pretrained models incapable of characterizing 3D geometry and thus defective for downstream 3D tasks. In this work, we tackle 3D molecular pretraining in a complete and novel sense. In particular, we first propose to adopt an equivariant energy-based model as the backbone for pretraining, which enjoys the merits of fulfilling the symmetry of 3D space. Then we develop a node-level pretraining loss for force prediction, where we further exploit the Riemann-Gaussian distribution to ensure the loss to be E(3)-invariant, enabling more robustness. Moreover, a graph-level noise scale prediction task is also leveraged to further promote the eventual performance. We evaluate our model pretrained from a large-scale 3D dataset GEOM-QM9 on two challenging 3D benchmarks: MD17 and QM9. Experimental results demonstrate the efficacy of our method against current state-of-the-art pretraining approaches, and verify the validity of our design for each proposed component.
Balanced Semi-Supervised Generative Adversarial Network for Damage Assessment from Low-Data Imbalanced-Class Regime
Gao, Yuqing, Zhai, Pengyuan, Mosalam, Khalid M.
In recent years, applying deep learning (DL) to assess structural damages has gained growing popularity in vision-based structural health monitoring (SHM). However, both data deficiency and class-imbalance hinder the wide adoption of DL in practical applications of SHM. Common mitigation strategies include transfer learning, over-sampling, and under-sampling, yet these ad-hoc methods only provide limited performance boost that varies from one case to another. In this work, we introduce one variant of the Generative Adversarial Network (GAN), named the balanced semi-supervised GAN (BSS-GAN). It adopts the semi-supervised learning concept and applies balanced-batch sampling in training to resolve low-data and imbalanced-class problems. A series of computer experiments on concrete cracking and spalling classification were conducted under the low-data imbalanced-class regime with limited computing power. The results show that the BSS-GAN is able to achieve better damage detection in terms of recall and $F_\beta$ score than other conventional methods, indicating its state-of-the-art performance.
Design Space Exploration and Explanation via Conditional Variational Autoencoders in Meta-model-based Conceptual Design of Pedestrian Bridges
Balmer, Vera M., Kuhn, Sophia V., Bischof, Rafael, Salamanca, Luis, Kaufmann, Walter, Perez-Cruz, Fernando, Kraus, Michael A.
For conceptual design, engineers rely on conventional iterative (often manual) techniques. Emerging parametric models facilitate design space exploration based on quantifiable performance metrics, yet remain time-consuming and computationally expensive. Pure optimisation methods, however, ignore qualitative aspects (e.g. aesthetics or construction methods). This paper provides a performance-driven design exploration framework to augment the human designer through a Conditional Variational Autoencoder (CVAE), which serves as forward performance predictor for given design features as well as an inverse design feature predictor conditioned on a set of performance requests. The CVAE is trained on 18'000 synthetically generated instances of a pedestrian bridge in Switzerland. Sensitivity analysis is employed for explainability and informing designers about (i) relations of the model between features and/or performances and (ii) structural improvements under user-defined objectives. A case study proved our framework's potential to serve as a future co-pilot for conceptual design studies of pedestrian bridges and beyond.