eel
LeanReasoner: Boosting Complex Logical Reasoning with Lean
Jiang, Dongwei, Fonseca, Marcio, Cohen, Shay B.
Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning. We use Lean, a theorem proving framework, to address these challenges. By formalizing logical reasoning problems into theorems within Lean, we can solve them by proving or disproving the corresponding theorems. This method reduces the risk of logical inconsistencies with the help of Lean's symbolic solver. It also enhances our ability to treat complex reasoning tasks by using Lean's extensive library of theorem proofs. Our method achieves state-of-the-art performance on the FOLIO dataset and achieves performance near this level on ProofWriter. Notably, these results were accomplished by fine-tuning on fewer than 100 in-domain samples for each dataset.
- Asia > Middle East > Republic of Türkiye (0.08)
- North America > United States > California > San Mateo County > Menlo Park (0.04)
Synergistic Signals: Exploiting Co-Engagement and Semantic Links via Graph Neural Networks
Huang, Zijie, Li, Baolin, Asgharzadeh, Hafez, Cocos, Anne, Liu, Lingyi, Cox, Evan, Wise, Colby, Lamkhede, Sudarshan
Given a set of candidate entities (e.g. movie titles), the ability to identify similar entities is a core capability of many recommender systems. Most often this is achieved by collaborative filtering approaches, i.e. if users co-engage with a pair of entities frequently enough, the embeddings should be similar. However, relying on co-engagement data alone can result in lower-quality embeddings for new and unpopular entities. We study this problem in the context recommender systems at Netflix. We observe that there is abundant semantic information such as genre, content maturity level, themes, etc. that complements co-engagement signals and provides interpretability in similarity models. To learn entity similarities from both data sources holistically, we propose a novel graph-based approach called SemanticGNN. SemanticGNN models entities, semantic concepts, collaborative edges, and semantic edges within a large-scale knowledge graph and conducts representation learning over it. Our key technical contributions are twofold: (1) we develop a novel relation-aware attention graph neural network (GNN) to handle the imbalanced distribution of relation types in our graph; (2) to handle web-scale graph data that has millions of nodes and billions of edges, we develop a novel distributed graph training paradigm. The proposed model is successfully deployed within Netflix and empirical experiments indicate it yields up to 35% improvement in performance on similarity judgment tasks.
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- Asia > Singapore (0.05)
- North America > United States > Texas > Travis County > Austin (0.04)
- (3 more...)
- Media > Film (0.91)
- Leisure & Entertainment (0.91)
- Information Technology > Services (0.57)
How might JPL look for life on watery worlds? With the help of this slithering robot
Engineers at NASA's Jet Propulsion Laboratory are taking artificial intelligence to the next level -- by sending it into space disguised as a robotic snake. With the sun beating down on JPL's Mars Yard, the robot lifts its "head" off a glossy surface of faux ice to scan the world around it. It maps its surroundings, analyzes potential obstacles and chooses the safest path through a valley of fake boulders to the destination it has been instructed to reach. Once it has a plan in place, the 14-foot-long robot lowers its head, engages its 48 motors and slowly slithers forward. Its cautious movements are propelled by the clockwise or counterclockwise turns of the spiral connectors that link its 10 body segments, sending the cyborg in a specific direction.
- Government > Regional Government > North America Government > United States Government (0.70)
- Government > Space Agency (0.56)
EEL: Efficiently Encoding Lattices for Reranking
Singhal, Prasann, Xu, Jiacheng, Ye, Xi, Durrett, Greg
Standard decoding approaches for conditional text generation tasks typically search for an output hypothesis with high model probability, but this may not yield the best hypothesis according to human judgments of quality. Reranking to optimize for "downstream" metrics can better optimize for quality, but many metrics of interest are computed with pre-trained language models, which are slow to apply to large numbers of hypotheses. We explore an approach for reranking hypotheses by using Transformers to efficiently encode lattices of generated outputs, a method we call EEL. With a single Transformer pass over the entire lattice, we can approximately compute a contextualized representation of each token as if it were only part of a single hypothesis in isolation. We combine this approach with a new class of token-factored rerankers (TFRs) that allow for efficient extraction of high reranker-scoring hypotheses from the lattice. Empirically, our approach incurs minimal degradation error compared to the exponentially slower approach of encoding each hypothesis individually. When applying EEL with TFRs across three text generation tasks, our results show both substantial speedup compared to naive reranking and often better performance on downstream metrics than comparable approaches.
- Europe > United Kingdom (0.28)
- Europe > Italy > Tuscany > Florence (0.04)
- Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
- (6 more...)
Sample-Specific Root Causal Inference with Latent Variables
Strobl, Eric V., Lasko, Thomas A.
Root causal analysis seeks to identify the set of initial perturbations that induce an unwanted outcome. In prior work, we defined sample-specific root causes of disease using exogenous error terms that predict a diagnosis in a structural equation model. We rigorously quantified predictivity using Shapley values. However, the associated algorithms for inferring root causes assume no latent confounding. We relax this assumption by permitting confounding among the predictors. We then introduce a corresponding procedure called Extract Errors with Latents (EEL) for recovering the error terms up to contamination by vertices on certain paths under the linear non-Gaussian acyclic model. EEL also identifies the smallest sets of dependent errors for fast computation of the Shapley values. The algorithm bypasses the hard problem of estimating the underlying causal graph in both cases. Experiments highlight the superior accuracy and robustness of EEL relative to its predecessors.
Evolutionary Ensemble Learning for Multivariate Time Series Prediction
Song, Hui, Qin, A. K., Salim, Flora D.
Multivariate time series (MTS) prediction plays a key role in many fields such as finance, energy and transport, where each individual time series corresponds to the data collected from a certain data source, so-called channel. A typical pipeline of building an MTS prediction model (PM) consists of selecting a subset of channels among all available ones, extracting features from the selected channels, and building a PM based on the extracted features, where each component involves certain optimization tasks, i.e., selection of channels, feature extraction (FE) methods, and PMs as well as configuration of the selected FE method and PM. Accordingly, pursuing the best prediction performance corresponds to optimizing the pipeline by solving all of its involved optimization problems. This is a non-trivial task due to the vastness of the solution space. Different from most of the existing works which target at optimizing certain components of the pipeline, we propose a novel evolutionary ensemble learning framework to optimize the entire pipeline in a holistic manner. In this framework, a specific pipeline is encoded as a candidate solution and a multi-objective evolutionary algorithm is applied under different population sizes to produce multiple Pareto optimal sets (POSs). Finally, selective ensemble learning is designed to choose the optimal subset of solutions from the POSs and combine them to yield final prediction by using greedy sequential selection and least square methods. We implement the proposed framework and evaluate our implementation on two real-world applications, i.e., electricity consumption prediction and air quality prediction. The performance comparison with state-of-the-art techniques demonstrates the superiority of the proposed approach.
- Asia > China (0.14)
- Oceania > Australia > Victoria > Melbourne (0.14)
- North America > United States (0.14)
- Energy > Oil & Gas (0.46)
- Energy > Power Industry (0.34)