lisbon
Rewarding explainability in drug repurposing with knowledge graphs
Drug repurposing often starts as a hypothesis: a known compound might help treat a disease beyond its original indication. Knowledge graphs are a natural place to look for such hypotheses because they encode biomedical entities (drugs, genes, phenotypes, diseases) and their relations. In KG terms, that repurposing can be framed as a triple (). However, many link prediction methods trade away interpretability for raw accuracy, making it hard for scientists to see why a suggested drug should work. We argue that for AI to function as a reliable scientific tool, it must deliver scientifically grounded explanations, not just scores.
- Europe > Portugal > Lisbon > Lisbon (0.08)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.05)
- Africa (0.05)
Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark
Cheng, Ziming, Xu, Binrui, Gong, Lisheng, Song, Zuhe, Zhou, Tianshuo, Zhong, Shiqi, Ren, Siyu, Chen, Mingxiang, Meng, Xiangchao, Zhang, Yuxin, Li, Yanlin, Ren, Lei, Chen, Wei, Huang, Zhiyuan, Zhan, Mingjie, Wang, Xiaojie, Feng, Fangxiang
With enhanced capabilities and widespread applications, Multimodal Large Language Models (MLLMs) are increasingly required to process and reason over multiple images simultaneously. However, existing MLLM benchmarks focus either on single-image visual reasoning or on multi-image understanding tasks with only final-answer evaluation, leaving the reasoning capabilities of MLLMs over multi-image inputs largely underexplored. To address this gap, we introduce the $\textbf{Multimodal Multi-image Reasoning Benchmark (MMRB)}$, the first benchmark designed to evaluate structured visual reasoning across multiple images. MMRB comprises $\textbf{92 sub-tasks}$ covering spatial, temporal, and semantic reasoning, with multi-solution, CoT-style annotations generated by GPT-4o and refined by human experts. A derivative subset is designed to evaluate multimodal reward models in multi-image scenarios. To support fast and scalable evaluation, we propose a sentence-level matching framework using open-source LLMs. Extensive baseline experiments on $\textbf{40 MLLMs}$, including 9 reasoning-specific models and 8 reward models, demonstrate that open-source MLLMs still lag significantly behind commercial MLLMs in multi-image reasoning tasks. Furthermore, current multimodal reward models are nearly incapable of handling multi-image reward ranking tasks.
- Europe > Portugal > Lisbon > Lisbon (0.06)
- Asia > China (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Asia > Singapore (0.04)
The Hottest Startups in Lisbon in 2024
Two years ago, Jon Fath moved with his family to Portugal from the Netherlands with the sole purpose of launching a fintech startup there. "This country is brimming with talent and ambition," Fath says. "I thank Lisbon for welcoming me, along with so many other expats and entrepreneurs, so warmly." Indeed, it's no surprise that the European Commission named Lisbon as 2023's European Capital of Innovation, while the Financial Times, in partnership with Statista, ranked two Portuguese startup hubs in Europe's top ten startup hubs--including the Unicorn Factory Lisboa, which launched in 2022 and has already supported more than 820 startups and helped raise more than 1 billion ( 1.1 billion) . "Portugal offers unique advantages, such as its climate, safety, and cost of living, which make it an attractive choice over countries in central or northern Europe," says Nuno Pereira, CEO of Paynest.
- Europe > Portugal > Lisbon > Lisbon (1.00)
- North America > United States (0.29)
- Europe > Northern Europe (0.25)
- (7 more...)
- Banking & Finance (1.00)
- Government > Regional Government (0.68)
- Health & Medicine > Therapeutic Area (0.49)
Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation
Macedo, Marcos, Tian, Yuan, Cogo, Filipe R., Adams, Bram
Code translation between programming languages is a long-existing and critical task in software engineering, facilitating the modernization of legacy systems, ensuring cross-platform compatibility, and enhancing software performance. With the recent advances in large language models (LLMs) and their applications to code translation, there is an increasing need for comprehensive evaluation of these models. In this study, we empirically analyze the generated outputs of eleven popular instruct-tuned LLMs with parameters ranging from 1B up to 46.7B on 3,820 translation pairs across five languages, including C, C++, Go, Java, and Python. Our analysis found that between 26.4% and 73.7% of code translations produced by our evaluated LLMs necessitate post-processing, as these translations often include a mix of code, quotes, and text rather than being purely source code. Overlooking the output format of these models can inadvertently lead to underestimation of their actual performance. This is particularly evident when evaluating them with execution-based metrics such as Computational Accuracy (CA). Our results demonstrate that a strategic combination of prompt engineering and regular expression can effectively extract the source code from the model generation output. In particular, our method can help eleven selected models achieve an average Code Extraction Success Rate (CSR) of 92.73%. Our findings shed light on and motivate future research to conduct more reliable benchmarks of LLMs for code translation.
- North America > Canada > Ontario > Kingston (0.14)
- Europe > Portugal > Lisbon > Lisbon (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- (2 more...)
Semantically Aligned Question and Code Generation for Automated Insight Generation
Singha, Ananya, Chopra, Bhavya, Khatry, Anirudh, Gulwani, Sumit, Henley, Austin Z., Le, Vu, Parnin, Chris, Singh, Mukul, Verbruggen, Gust
Automated insight generation is a common tactic for helping knowledge workers, such as data scientists, to quickly understand the potential value of new and unfamiliar data. Unfortunately, automated insights produced by large-language models can generate code that does not correctly correspond (or align) to the insight. In this paper, we leverage the semantic knowledge of large language models to generate targeted and insightful questions about data and the corresponding code to answer those questions. Then through an empirical study on data from Open-WikiTable, we show that embeddings can be effectively used for filtering out semantically unaligned pairs of question and code. Additionally, we found that generating questions and code together yields more diverse questions.
- Europe > Portugal > Lisbon > Lisbon (0.05)
- Asia > India (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- (6 more...)
Welcome Your New AI Teammate: On Safety Analysis by Leashing Large Language Models
Nouri, Ali, Cabrero-Daniel, Beatriz, Törner, Fredrik, Sivencrona, Hȧkan, Berger, Christian
DevOps is a necessity in many industries, including the development of Autonomous Vehicles. In those settings, there are iterative activities that reduce the speed of SafetyOps cycles. One of these activities is "Hazard Analysis & Risk Assessment" (HARA), which is an essential step to start the safety requirements specification. As a potential approach to increase the speed of this step in SafetyOps, we have delved into the capabilities of Large Language Models (LLMs). Our objective is to systematically assess their potential for application in the field of safety engineering. To that end, we propose a framework to support a higher degree of automation of HARA with LLMs. Despite our endeavors to automate as much of the process as possible, expert review remains crucial to ensure the validity and correctness of the analysis results, with necessary modifications made accordingly.
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.06)
- Europe > Portugal > Lisbon > Lisbon (0.06)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Albania > Durrës County > Durrës (0.04)
- Automobiles & Trucks (0.48)
- Information Technology (0.35)
Translating between SQL Dialects for Cloud Migration
Zmigrod, Ran, Alamir, Salwa, Liu, Xiaomo
Migrations of systems from on-site premises to the cloud has been a fundamental endeavor by many industrial institutions. A crucial component of such cloud migrations is the transition of databases to be hosted online. In this work, we consider the difficulties of this migration for SQL databases. While SQL is one of the prominent methods for storing database procedures, there are a plethora of different SQL dialects (e.g., MySQL, Postgres, etc.) which can complicate migrations when the on-premise SQL dialect differs to the dialect hosted on the cloud. Tools exist by common cloud provides such as AWS and Azure to aid in translating between dialects in order to mitigate the majority of the difficulties. However, these tools do not successfully translate $100\%$ of the code. Consequently, software engineers must manually convert the remainder of the untranslated database. For large organizations, this task quickly becomes intractable and so more innovative solutions are required. We consider this challenge a novel yet vital industrial research problem for any large corporation that is considering cloud migrations. Furthermore, we introduce potential avenues of research to tackle this challenge that have yielded promising preliminary results.
- Europe > Portugal > Lisbon > Lisbon (0.06)
- Europe > United Kingdom > England > Greater London > London (0.05)
- Oceania > Australia > Queensland > Brisbane (0.04)
- (6 more...)
- Information Technology > Services (0.69)
- Banking & Finance (0.47)
Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning
Raymond, Christian, Chen, Qi, Xue, Bing, Zhang, Mengjie
In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a new meta-learning framework for task and model-agnostic loss function learning via a hybrid search approach. The framework first uses genetic programming to find a set of symbolic loss functions. Second, the set of learned loss functions is subsequently parameterized and optimized via unrolled differentiation. The versatility and performance of the proposed framework are empirically validated on a diverse set of supervised learning tasks. Results show that the learned loss functions bring improved convergence, sample efficiency, and inference performance on tabulated, computer vision, and natural language processing problems, using a variety of task-specific neural network architectures.
- Europe > Portugal > Lisbon > Lisbon (0.05)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Oceania > New Zealand > North Island > Wellington Region > Wellington (0.04)
- (3 more...)
Towards Engineering Fair and Equitable Software Systems for Managing Low-Altitude Airspace Authorizations
Gohar, Usman, Hunter, Michael C., Marczak-Czajka, Agnieszka, Lutz, Robyn R., Cohen, Myra B., Cleland-Huang, Jane
Small Unmanned Aircraft Systems (sUAS) have gained widespread adoption across a diverse range of applications. This has introduced operational complexities within shared airspaces and an increase in reported incidents, raising safety concerns. In response, the U.S. Federal Aviation Administration (FAA) is developing a UAS Traffic Management (UTM) system to control access to airspace based on an sUAS's predicted ability to safely complete its mission. However, a fully automated system capable of swiftly approving or denying flight requests can be prone to bias and must consider safety, transparency, and fairness to diverse stakeholders. In this paper, we present an initial study that explores stakeholders' perspectives on factors that should be considered in an automated system. Results indicate flight characteristics and environmental conditions were perceived as most important but pilot and drone capabilities should also be considered. Further, several respondents indicated an aversion to any AI-supported automation, highlighting the need for full transparency in automated decision-making. Results provide a societal perspective on the challenges of automating UTM flight authorization decisions and help frame the ongoing design of a solution acceptable to the broader sUAS community.
- Europe > United Kingdom (0.14)
- Europe > Portugal > Lisbon > Lisbon (0.05)
- North America > United States > New York > New York County > New York City (0.04)
- (11 more...)
- Research Report > New Finding (1.00)
- Questionnaire & Opinion Survey (1.00)
- Overview (1.00)
- Research Report > Experimental Study (0.68)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Air (1.00)
- Aerospace & Defense > Aircraft (1.00)
- (3 more...)
C4Q: A Chatbot for Quantum
Aragonés-Soria, Yaiza, Oriol, Manuel
Quantum computing is a growing field that promises many real-world applications such as quantum cryptography or quantum finance. The number of people able to use quantum computing is however still very small. This limitation comes from the difficulty to understand the concepts and to know how to start coding. Therefore, there is a need for tools that can assist non-expert in overcoming this complexity. One possibility would be to use existing conversational agents. Unfortunately ChatGPT and other Large-Language Models produce inaccurate results. This article presents C4Q, a chatbot that answers accurately basic questions and guides users when trying to code quantum programs. Contrary to other approaches C4Q uses a pre-trained large language model only to discover and classify user requests. It then generates an accurate answer using an own engine. Thanks to this architectural design, C4Q's answers are always correct, and thus C4Q can become a support tool that makes quantum computing more available to non-experts.
- Europe > Portugal > Lisbon > Lisbon (0.05)
- North America > United States > New York > New York County > New York City (0.05)
- North America > Costa Rica > Heredia Province > Heredia (0.04)
- (4 more...)
- Education (0.68)
- Information Technology > Security & Privacy (0.34)