finder
FINDER: Feature Inference on Noisy Datasets using Eigenspace Residuals
Murphy, Trajan, Dogra, Akshunna S., Gu, Hanfeng, Meredith, Caleb, Kon, Mark, Castrillion-Candas, Julio Enrique
''Noisy'' datasets (regimes with low signal to noise ratios, small sample sizes, faulty data collection, etc) remain a key research frontier for classification methods with both theoretical and practical implications. We introduce FINDER, a rigorous framework for analyzing generic classification problems, with tailored algorithms for noisy datasets. FINDER incorporates fundamental stochastic analysis ideas into the feature learning and inference stages to optimally account for the randomness inherent to all empirical datasets. We construct ''stochastic features'' by first viewing empirical datasets as realizations from an underlying random field (without assumptions on its exact distribution) and then mapping them to appropriate Hilbert spaces. The Kosambi-Karhunen-Loéve expansion (KLE) breaks these stochastic features into computable irreducible components, which allow classification over noisy datasets via an eigen-decomposition: data from different classes resides in distinct regions, identified by analyzing the spectrum of the associated operators. We validate FINDER on several challenging, data-deficient scientific domains, producing state of the art breakthroughs in: (i) Alzheimer's Disease stage classification, (ii) Remote sensing detection of deforestation. We end with a discussion on when FINDER is expected to outperform existing methods, its failure modes, and other limitations.
- North America > United States > California (0.28)
- Europe > United Kingdom > England > Greater London > London (0.14)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- (7 more...)
- Research Report > Experimental Study (0.48)
- Research Report > New Finding (0.46)
- Research Report > Promising Solution (0.34)
- Information Technology > Data Science (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.67)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Program of Thoughts for Financial Reasoning: Leveraging Dynamic In-Context Examples and Generative Retrieval
Khatuya, Subhendu, Naidu, Shashwat, Goyal, Pawan, Ganguly, Niloy
Despite continuous advancements in the capabilities of large language models (LLMs), numerical reasoning remains a challenging area. Techniques like chain-of-thought prompting, tree-of-thought prompting, and program-of-thought prompting guide LLMs through intermediate reasoning steps. Although in-context learning with few-shot prompting has improved performance, LLMs still lag behind state-of-the-art models on financial numerical reasoning datasets such as FinQA and ConvFinQA. In this work, we introduce FINDER, a novel two-step framework, to enhance LLMs' capabilities in financial numerical reasoning. The first step utilizes a generative retriever to extract relevant facts from unstructured data, including both text and tables. This is followed by context-aware Program of Thought prompting with dynamic selection of in-context examples. Our model FINDER achieves a new state-of-the-art performance on both the FinQA and ConvFinQA datasets, surpassing previous benchmarks with execution accuracy improvements of 5.98% and 4.05%, respectively.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Thailand > Bangkok > Bangkok (0.04)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- (13 more...)
- Workflow (0.66)
- Research Report > Promising Solution (0.34)
A Omitted proofs of Section 3
H . Using these observations we bound the growth function, Π In this proof we consider the notation introduced above in Section B.1. The proof of Theorem 7 will occupy this section. H . (II) For each label null = 2,...,k, there is at most 1 index j [n ] such that h Consider a setting of an iterative game in which a "chooser" player picks a "hidden" We now consider a slight modification of the setting defined above. Next, we are ready to prove the main result, stated in Theorem 7. C.3 Proof of Theorem 7 Proof. (see Definition 1).
Learning a Continue-Thinking Token for Enhanced Test-Time Scaling
Ringel, Liran, Tolochinsky, Elad, Romano, Yaniv
Test-time scaling has emerged as an effective approach for improving language model performance by utilizing additional compute at inference time. Recent studies have shown that overriding end-of-thinking tokens (e.g., replacing "" with "Wait") can extend reasoning steps and improve accuracy. In this work, we explore whether a dedicated continue-thinking token can be learned to trigger extended reasoning. We augment a distilled version of DeepSeek-R1 with a single learned "<|continue-thinking|>" token, training only its embedding via reinforcement learning while keeping the model weights frozen. Our experiments show that this learned token achieves improved accuracy on standard math benchmarks compared to both the baseline model and a test-time scaling approach that uses a fixed token (e.g., "Wait") for budget forcing. In particular, we observe that in cases where the fixed-token approach enhances the base model's accuracy, our method achieves a markedly greater improvement. For example, on the GSM8K benchmark, the fixed-token approach yields a 1.3% absolute improvement in accuracy, whereas our learned-token method achieves a 4.2% improvement over the base model that does not use budget forcing.
- North America > United States > Texas (0.04)
- North America > United States > Alaska (0.04)
- Asia > Middle East > Israel (0.04)
Stochastic Quasi-Newton Optimization in Large Dimensions Including Deep Network Training
Suman, Uttam, Mamajiwala, Mariya, Saxena, Mukul, Tyagi, Ankit, Roy, Debasish
Our proposal is on a new stochastic optimizer for non-convex and possibly non-smooth objective functions typically defined over large dimensional design spaces. Towards this, we have tried to bridge noise-assisted global search and faster local convergence, the latter being the characteristic feature of a Newton-like search. Our specific scheme -- acronymed FINDER (Filtering Informed Newton-like and Derivative-free Evolutionary Recursion), exploits the nonlinear stochastic filtering equations to arrive at a derivative-free update that has resemblance with the Newton search employing the inverse Hessian of the objective function. Following certain simplifications of the update to enable a linear scaling with dimension and a few other enhancements, we apply FINDER to a range of problems, starting with some IEEE benchmark objective functions to a couple of archetypal data-driven problems in deep networks to certain cases of physics-informed deep networks. The performance of the new method vis-\'a-vis the well-known Adam and a few others bears evidence to its promise and potentialities for large dimensional optimization problems of practical interest.
- North America > United States (0.28)
- Asia > India (0.28)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
- Information Technology > Mathematics of Computing (0.93)
Find Everything: A General Vision Language Model Approach to Multi-Object Search
Choi, Daniel, Fung, Angus, Wang, Haitong, Tan, Aaron Hao
In various real-world robot applications, MOS describes the problem of locating multiple objects efficiently [1], in domains such as warehouse management [2, 3], construction inspection [4], or hospitality [5, 6, 7], and retail assistance [8, 9]. Existing MOS methods can be categorized into: 1) probabilistic planning (PP) [1, 10, 11, 12], and 2) deep reinforcement learning (DRL) methods [13, 14, 15, 16, 17, 18, 19, 20]. PP methods utilize Partially Observable Markov Decision Processes (POMDPs) to estimate belief states and plan actions under uncertainty in object locations, while DRL methods optimizes action selection using a reward function [21]. However, both approaches face challenges such as inefficient exploration due to limited semantic modeling between objects and scenes [18], and poor generlization caused by the sim-to-real gap [19]. Recently, Large Foundation Models (LFMs) such as vision-language models (VLMs) and large language models (LLMs) have been applied to single object search (SOS) tasks by using either: 1) VLMs (e.g., CLIP, BLIP, etc.) to generate scene-level embeddings that capture the semantic correlations between the robot's environment and the target object to guide the robot towards regions with high target object likelihood [19, 22, 23, 24, 25]; or, 2) VLMs/LLMs to generate scene captions that describe both the spatial layout and semantic details of the robot's environment which are then used to plan the robot's actions [26, 27, 28, 29, 30, 31, 32]. However, these SOS methods have limitations: 1) they cannot be directly applied to MOS, as they lack explicit mechanisms to track and reason about multiple objects simultaneously, and 2) scene-level embeddings are often noisy and coarse [33], which cannot be effectively applied in object-dense environments. In such cases, fine-grained, object-level embeddings are needed. In this paper, we introduce Finder, the first MOS approach that leverages VLMs to locate multiple target objects in various unknown environments.
- North America > Canada > Ontario > Toronto (0.14)
- Europe > Switzerland (0.04)
Toward Digitalization: A Secure Approach to Find a Missing Person Using Facial Recognition Technology
Ayon, Abid Faisal, Alam, S M Maksudul
Facial Recognition is a technique, based on machine learning technology that can recognize a human being analyzing his facial profile, and is applied in solving various types of realworld problems nowadays. In this paper, a common real-world problem, finding a missing person has been solved in a secure and effective way with the help of facial recognition technology. Although there exist a few works on solving the problem, the proposed work is unique with respect to its security, design, and feasibility. Impeding intruders in participating in the processes and giving importance to both finders and family members of a missing person are two of the major features of this work. The proofs of the works of our system in finding a missing person have been described in the result section of the paper. The advantages that our system provides over the other existing systems can be realized from the comparisons, described in the result summary section of the paper. The work is capable of providing a worthy solution to find a missing person on the digital platform.
- Africa > Zimbabwe (0.05)
- Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)
Robot dogs, tech bros and virtual Geisha girls: when SXSW came to Sydney
A simultaneously familiar and slightly terrifying robot dog wanders through the audience of a session at the Sydney edition of South by South West. On stage, the panellists opine about a future increasingly defined by artificial intelligence and automation. "It's going to get much, much more significant," says Ed Santow, the former human rights commissioner and current director of policy and governance at the UTS Human Technology Institute. "And for many people that will be a good thing, [but] for a lot of people it'll be really, really hard." The robot is creepy but its fan is as noisy as a ps4 so it's not sneaking up on anyone.
- Leisure & Entertainment (0.72)
- Banking & Finance (0.71)
- Media (0.51)
Astronomical source finding services for the CIRASA visual analytic platform
Riggi, S., Bordiu, C., Vitello, F., Tudisco, G., Sciacca, E., Magro, D., Sortino, R., Pino, C., Molinaro, M., Benedettini, M., Leurini, S., Bufano, F., Raciti, M., Becciani, U.
Innovative developments in data processing, archiving, analysis, and visualization are nowadays unavoidable to deal with the data deluge expected in next-generation facilities for radio astronomy, such as the Square Kilometre Array (SKA) and its precursors. In this context, the integration of source extraction and analysis algorithms into data visualization tools could significantly improve and speed up the cataloguing process of large area surveys, boosting astronomer productivity and shortening publication time. To this aim, we are developing a visual analytic platform (CIRASA) for advanced source finding and classification, integrating state-of-the-art tools, such as the CAESAR source finder, the ViaLactea Visual Analytic (VLVA) and Knowledge Base (VLKB). In this work, we present the project objectives and the platform architecture, focusing on the implemented source finding services.
- Europe > Italy > Friuli Venezia Giulia > Trieste Province > Trieste (0.04)
- Europe > Spain > Galicia > Madrid (0.04)
- Europe > Middle East > Malta > Eastern Region > Northern Harbour District > Msida (0.04)
- (2 more...)