finch
FINCH: Financial Intelligence using Natural language for Contextualized SQL Handling
Singh, Avinash Kumar, Sarmah, Bhaskarjit, Pasquali, Stefano
Text-to-SQL, the task of translating natural language questions into SQL queries, has long been a central challenge in NLP. While progress has been significant, applying it to the financial domain remains especially difficult due to complex schema, domain-specific terminology, and high stakes of error. Despite this, there is no dedicated large-scale financial dataset to advance research, creating a critical gap. To address this, we introduce a curated financial dataset (FINCH) comprising 292 tables and 75,725 natural language-SQL pairs, enabling both fine-tuning and rigorous evaluation. Building on this resource, we benchmark reasoning models and language models of varying scales, providing a systematic analysis of their strengths and limitations in financial Text-to-SQL tasks. Finally, we propose a finance-oriented evaluation metric (FINCH Score) that captures nuances overlooked by existing measures, offering a more faithful assessment of model performance.
- North America > United States > New York (0.04)
- Europe > Italy > Tuscany > Florence (0.04)
- Asia > Middle East > Jordan (0.04)
- Asia > India > Telangana > Hyderabad (0.04)
The way a child plays is the way they live': how therapists are using video games to help vulnerable children
Oleksii Sukhorukov's son was 12 when the Russian invasion of Ukraine began. For months, the family existed in a state of trauma and disarray: Sukhorukov was forced to give up his work in the entertainment industry, which had included virtual reality and video games; they became isolated from friends and relatives. But amid the chaos, his boy had one outlet: Minecraft. Whatever was happening outside, he'd boot up Mojang's block-building video game and escape. "After 24 February 2022, I began to see the game in a completely different light," says Sukhorukov.
- Oceania > Australia (0.05)
- North America > United States > Virginia (0.05)
- North America > United States > Massachusetts (0.05)
- (4 more...)
Robin: A multi-agent system for automating scientific discovery
Ghareeb, Ali Essam, Chang, Benjamin, Mitchener, Ludovico, Yiu, Angela, Szostkiewicz, Caralyn J., Laurent, Jon M., Razzak, Muhammed T., White, Andrew D., Hinks, Michaela M., Rodriques, Samuel G.
Scientific discovery is driven by the iterative process of background research, hypothesis generation, experimentation, and data analysis. Despite recent advancements in applying artificial intelligence to scientific discovery, no system has yet automated all of these stages in a single workflow. Here, we introduce Robin, the first multi-agent system capable of fully automating the key intellectual steps of the scientific process. By integrating literature search agents with data analysis agents, Robin can generate hypotheses, propose experiments, interpret experimental results, and generate updated hypotheses, achieving a semi-autonomous approach to scientific discovery. By applying this system, we were able to identify a novel treatment for dry age-related macular degeneration (dAMD), the major cause of blindness in the developed world. Robin proposed enhancing retinal pigment epithelium phagocytosis as a therapeutic strategy, and identified and validated a promising therapeutic candidate, ripasudil. Ripasudil is a clinically-used rho kinase (ROCK) inhibitor that has never previously been proposed for treating dAMD. To elucidate the mechanism of ripasudil-induced upregulation of phagocytosis, Robin then proposed and analyzed a follow-up RNA-seq experiment, which revealed upregulation of ABCA1, a critical lipid efflux pump and possible novel target. All hypotheses, experimental plans, data analyses, and data figures in the main text of this report were produced by Robin. As the first AI system to autonomously discover and validate a novel therapeutic candidate within an iterative lab-in-the-loop framework, Robin establishes a new paradigm for AI-driven scientific discovery.
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
- North America > United States > California > San Francisco County > San Francisco (0.04)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
- (3 more...)
- Workflow (1.00)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.93)
- Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)
- Health & Medicine > Therapeutic Area > Oncology (1.00)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- (2 more...)
FINCH: Locally Visualizing Higher-Order Feature Interactions in Black Box Models
Kleinau, Anna, Preim, Bernhard, Meuschke, Monique
In an era where black-box AI models are integral to decision-making across industries, robust methods for explaining these models are more critical than ever. While these models leverage complex feature interplay for accurate predictions, most explanation methods only assign relevance to individual features. There is a research gap in methods that effectively illustrate interactions between features, especially in visualizing higher-order interactions involving multiple features, which challenge conventional representation methods. To address this challenge in local explanations focused on individual instances, we employ a visual, subset-based approach to reveal relevant feature interactions. Our visual analytics tool FINCH uses coloring and highlighting techniques to create intuitive, human-centered visualizations, and provides additional views that enable users to calibrate their trust in the model and explanations. We demonstrate FINCH in multiple case studies, demonstrating its generalizability, and conducted an extensive human study with machine learning experts to highlight its helpfulness and usability. With this approach, FINCH allows users to visualize feature interactions involving any number of features locally.
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence
Peng, Bo, Goldstein, Daniel, Anthony, Quentin, Albalak, Alon, Alcaide, Eric, Biderman, Stella, Cheah, Eugene, Du, Xingjian, Ferdinan, Teddy, Hou, Haowen, Kazienko, Przemysław, GV, Kranthi Kiran, Kocoń, Jan, Koptyra, Bartłomiej, Krishna, Satyapriya, McClelland, Ronald Jr., Muennighoff, Niklas, Obeid, Fares, Saito, Atsushi, Song, Guangyu, Tu, Haoqin, Woźniak, Stanisław, Zhang, Ruichong, Zhao, Bingchen, Zhao, Qihang, Zhou, Peng, Zhu, Jian, Zhu, Rui-Jie
We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture. Our architectural design advancements include multi-headed matrix-valued states and a dynamic recurrence mechanism that improve expressivity while maintaining the inference efficiency characteristics of RNNs. We introduce a new multilingual corpus with 1.12 trillion tokens and a fast tokenizer based on greedy matching for enhanced multilinguality. We trained four Eagle models, ranging from 0.46 to 7.5 billion parameters, and two Finch models with 1.6 and 3.1 billion parameters and find that they achieve competitive performance across a wide variety of benchmarks. We release all our models on HuggingFace under the Apache 2.0 license. Models at: https://huggingface.co/RWKV Training code at: https://github.com/RWKV/RWKV-LM Inference code at: https://github.com/RWKV/ChatRWKV Time-parallel training code at: https://github.com/RWKV/RWKV-infctx-trainer
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- North America > United States > Ohio (0.04)
- (12 more...)
- Health & Medicine (0.67)
- Education (0.67)
- Media (0.67)
- Leisure & Entertainment (0.46)
Model Evaluation for Domain Identification of Unknown Classes in Open-World Recognition: A Proposal
Alfarisy, Gusti Ahmad Fanshuri, Malik, Owais Ahmed, Hong, Ong Wee
Open-World Recognition (OWR) is an emerging field that makes a machine learning model competent in rejecting the unknowns, managing them, and incrementally adding novel samples to the base knowledge. However, this broad objective is not practical for an agent that works on a specific task. Not all rejected samples will be used for learning continually in the future. Some novel images in the open environment may not belong to the domain of interest. Hence, identifying the unknown in the domain of interest is essential for a machine learning model to learn merely the important samples. In this study, we propose an evaluation protocol for estimating a model's capability in separating unknown in-domain (ID) and unknown out-of-domain (OOD). We evaluated using three approaches with an unknown domain and demonstrated the possibility of identifying the domain of interest using the pre-trained parameters through traditional transfer learning, Automated Machine Learning (AutoML), and Nearest Class Mean (NCM) classifier with First Integer Neighbor Clustering Hierarchy (FINCH). We experimented with five different domains: garbage, food, dogs, plants, and birds. The results show that all approaches can be used as an initial baseline yielding a good accuracy. In addition, a Balanced Accuracy (BACCU) score from a pre-trained model indicates a tendency to excel in one or more domains of interest. We observed that MobileNetV3 yielded the highest BACCU score for the garbage domain and surpassed complex models such as the transformer network. Meanwhile, our results also suggest that a strong representation in the pre-trained model is important for identifying unknown classes in the same domain. This study could open the bridge toward open-world recognition in domain-specific tasks where the relevancy of the unknown classes is vital.
- Asia > Brunei (0.14)
- North America > United States > New York > New York County > New York City (0.04)
- Oceania > New Zealand > South Island > Marlborough District > Blenheim (0.04)
- (9 more...)
'We have a bias problem': California bill addresses race and gender in venture capital funding
California would become the first state to require venture capital firms to disclose the race and gender of the founders of the companies they fund, under a bill currently awaiting governor Gavin Newsom's signature. The business community strongly opposes the legislation, characterizing it as an example of bureaucratic overreach. But civil rights groups and female entrepreneurs say it could go a long way toward equalizing opportunity in Silicon Valley, where startup capital overwhelmingly flows to white men. According to the business data firm PitchBook, companies founded by all-female teams accounted for just 2% of venture capital funding last year. Those led by Black women and Latinas received even less, 0.85%, according to a report from Project Diane, a research effort focused on female founders.
- North America > United States > California (0.99)
- North America > United States > New York > New York County > New York City (0.05)
- Europe > Germany (0.05)
- Law (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- Banking & Finance > Capital Markets (1.00)
Boosting Language Models Reasoning with Chain-of-Knowledge Prompting
Wang, Jianing, Sun, Qiushi, Chen, Nuo, Li, Xiang, Gao, Ming
Recently, Chain-of-Thought (CoT) prompting has delivered success on complex reasoning tasks, which aims at designing a simple prompt like ``Let's think step by step'' or multiple in-context exemplars with well-designed rationales to elicit Large Language Models (LLMs) to generate intermediate reasoning steps. However, the generated rationales often come with mistakes, making unfactual and unfaithful reasoning chains. To mitigate this brittleness, we propose a novel Chain-of-Knowledge (CoK) prompting, where we aim at eliciting LLMs to generate explicit pieces of knowledge evidence in the form of structure triple. This is inspired by our human behaviors, i.e., we can draw a mind map or knowledge map as the reasoning evidence in the brain before answering a complex question. Benefiting from CoK, we additionally introduce a F^2-Verification method to estimate the reliability of the reasoning chains in terms of factuality and faithfulness. For the unreliable response, the wrong evidence can be indicated to prompt the LLM to rethink. Extensive experiments demonstrate that our method can further improve the performance of commonsense, factual, symbolic, and arithmetic reasoning tasks.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Pennsylvania (0.05)
- Europe > United Kingdom (0.05)
- (17 more...)
Leveraging triplet loss for unsupervised action segmentation
Bueno-Benito, E., Tura, B., Dimiccoli, M.
In this paper, we propose a novel fully unsupervised framework that learns action representations suitable for the action segmentation task from the single input video itself, without requiring any training data. Our method is a deep metric learning approach rooted in a shallow network with a triplet loss operating on similarity distributions and a novel triplet selection strategy that effectively models temporal and semantic priors to discover actions in the new representational space. Under these circumstances, we successfully recover temporal boundaries in the learned action representations with higher quality compared with existing unsupervised approaches. The proposed method is evaluated on two widely used benchmark datasets for the action segmentation task and it achieves competitive performance by applying a generic clustering algorithm on the learned representations.
Dialpad Introduces AI-Powered Customer Intelligence
This week Dialpad announced the evolution of its "TrueCaaS" strategy, using the phrase "AI-Powered Customer Intelligence." The company had been using TrueCaaS to describe its single cloud software stack that can deliver Unified Communications as a Service (UCaaS) and Contact Center as a Service (CCaaS). One of the primary benefits of a converged platform is having a single data lake with which to perform analytics to make business decisions. Most communications vendors offer UCaaS or CCaaS and then partner for the other capability. Dialpad is one of the few that has built a single, cloud native stack to deliver both.