Goto

Collaborating Authors

 best combination


Metric Learning for Temporal Sequence Alignment

Neural Information Processing Systems

In this paper, we propose to learn a Mahalanobis distance to perform alignment of multivariate time series. The learning examples for this task are time series for which the true alignment is known. We cast the alignment problem as a structured prediction task, and propose realistic losses between alignments for which the optimization is tractable. We provide experiments on real data in the audio-toaudio context, where we show that the learning of a similarity measure leads to improvements in the performance of the alignment task. We also propose to use this metric learning framework to perform feature selection and, from basic audio features, build a combination of these with better alignment performance.


Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models

arXiv.org Artificial Intelligence

To efficiently select optimal dataset combinations for enhancing multi-task learning (MTL) performance in large language models, we proposed a novel framework that leverages a neural network to predict the best dataset combinations. The framework iteratively refines the selection, greatly improving efficiency, while being model-, dataset-, and domain-independent. Through experiments on 12 biomedical datasets across four tasks - named entity recognition, relation extraction, event extraction, and text classification-we demonstrate that our approach effectively identifies better combinations, even for tasks that may seem unpromising from a human perspective. This verifies that our framework provides a promising solution for maximizing MTL potential.


A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm

arXiv.org Artificial Intelligence

This paper proposes a geometric approach for estimating the α value in Q learning. We establish a systematic framework that optimizes the α parameter, thereby enhancing learning efficiency and stability. Our results show that there is a relationship between the learning rate and the angle between a vector T (total time steps in each episode of learning) and R (the reward vector for each episode). The concept of angular bisector between vectors T and R and Nash Equilibrium provide insight into estimating α such that the algorithm minimizes losses arising from explorationexploitation trade-off. Keywords: Q Learning, Reinforcement Learning, Nash Equilibrium, Learning Rate, α, Stability of Equilibrium 1 - Introduction Reinforcement Learning (RL) algorithms, particularly Q-learning, are pivotal in enabling agents to learn optimal strategies through interaction with environments.


Generation of Granular-Balls for Clustering Based on the Principle of Justifiable Granularity

arXiv.org Artificial Intelligence

Efficient and robust data clustering remains a challenging task in the field of data analysis. Recent efforts have explored the integration of granular-ball (GB) computing with clustering algorithms to address this challenge, yielding promising results. However, existing methods for generating GBs often rely on single indicators to measure GB quality and employ threshold-based or greedy strategies, potentially leading to GBs that do not accurately capture the underlying data distribution. To address these limitations, this article introduces a novel GB generation method. The originality of this method lies in leveraging the principle of justifiable granularity to measure the quality of a GB for clustering tasks. To be precise, we define the coverage and specificity of a GB and introduce a comprehensive measure for assessing GB quality. Utilizing this quality measure, the method incorporates a binary tree pruning-based strategy and an anomaly detection method to determine the best combination of sub-GBs for each GB and identify abnormal GBs, respectively. Compared to previous GB generation methods, the new method maximizes the overall quality of generated GBs while ensuring alignment with the data distribution, thereby enhancing the rationality of the generated GBs. Experimental results obtained from both synthetic and publicly available datasets underscore the effectiveness of the proposed GB generation method, showcasing improvements in clustering accuracy and normalized mutual information.


Using Combinatorial Optimization to Design a High quality LLM Solution

arXiv.org Artificial Intelligence

We introduce a novel LLM based solution design approach that utilizes combinatorial optimization and sampling. Specifically, a set of factors that influence the quality of the solution are identified. They typically include factors that represent prompt types, LLM inputs alternatives, and parameters governing the generation and design alternatives. Identifying the factors that govern the LLM solution quality enables the infusion of subject matter expert knowledge. Next, a set of interactions between the factors are defined and combinatorial optimization is used to create a small subset $P$ that ensures all desired interactions occur in $P$. Each element $p \in P$ is then developed into an appropriate benchmark. Applying the alternative solutions on each combination, $p \in P$ and evaluating the results facilitate the design of a high quality LLM solution pipeline. The approach is especially applicable when the design and evaluation of each benchmark in $P$ is time-consuming and involves manual steps and human evaluation. Given its efficiency the approach can also be used as a baseline to compare and validate an autoML approach that searches over the factors governing the solution.


Metric Learning for Temporal Sequence Alignment Damien Garreau Rémi Lajugie ENS Francis Bach

Neural Information Processing Systems

In this paper, we propose to learn a Mahalanobis distance to perform alignment of multivariate time series. The learning examples for this task are time series for which the true alignment is known. We cast the alignment problem as a structured prediction task, and propose realistic losses between alignments for which the optimization is tractable. We provide experiments on real data in the audio-toaudio context, where we show that the learning of a similarity measure leads to improvements in the performance of the alignment task. We also propose to use this metric learning framework to perform feature selection and, from basic audio features, build a combination of these with better alignment performance.


Learning and evolution: factors influencing an effective combination

arXiv.org Artificial Intelligence

The interplay between learning and evolution has been studied for decades, but it is still a very controversial topic. Despite the huge amount of work, to what extent the interaction between learning and evolution actually fosters the development of successful behaviors is still a matter of debate in the scientific community. Indeed, as it is well described in [1-2], there exist some controversial arguments about the effect of learning on evolution. Some studies revealed how learning accelerates evolution [3-15], while other works demonstrated that learning does not provide any advantage on the course of evolution [16-24]. As explained in [25], Evolution and learning (or phylogenetic and ontogenetic adaptation) are two forms of biological adaptation that differ in space and time. Evolution is a process of selective reproduction and substitution based on the existence of a population of individuals displaying variability at the genetic level.


Optimize Any Python, Swift, or Java Object with Reinforcement Learning

#artificialintelligence

Improve AI is a machine learning platform for making apps self-improving, meaning they optimize their own data structures and variables to improve revenue and conversions. With Improve AI v7.2, you can now optimize the variables of any Java, Swift, or Python object with reinforcement learning. The new optimize() method finds the best combination of variable values given current conditions. Optimized objects are created immediately, on the fly, with zero network latency. Improve AI can optimize any object or JSON-encodable dictionary in Swift, Java, or Python to find the best combination of variables given current conditions.


Big Data in soccer: Creating an xG model - Damavis Blog

#artificialintelligence

The ability to collect and process large amounts of data represents additional value for many companies in today's market. The world of sports has been no exception, starting with baseball with the emergence of SABRmetrics in the 1980s, through motor racing to sports such as basketball and soccer more recently. The creation of models and metrics through artificial intelligence allows sports fans to analyze the game from another perspective and, for their professionals, to gain a competitive advantage over their rivals. In the case of soccer, probably the most popular metric is the one known as expected goal (xG). The xG is intended to measure the probability that a shot will result in a goal, taking into account variables such as the position of the shot, the position of the goalkeeper or the part of the body with which the shot is taken. As it is a probability, it should take values between 0 and 1, so that for the clearest opportunities (for example, a shot inside the small area without a goalkeeper) it takes values close to 1, and for shots further away or with greater difficulty it tends to 0. This metric is very useful for coaching staffs and scouting teams to evaluate the finishing or chance-creating ability of different players.


Hyperparameter Tuning with Grid Search and Random Search

#artificialintelligence

Hyperparameters are parameters that are defined before training to specify how we want model training to happen. We have full control over hyperparameter settings and by doing that we control the learning process. For example in the random forest model n_estimators (number of decision trees we want to have) is a hyperparameter. It can be set to any integer value but of course, setting it to 10 or 1000 changes the learning process significantly. Parameters, on the other hand, are found during the training. We have no control over parameter values as they are the result of model training.