Goto

Collaborating Authors

 South America


Conformal and kNN Predictive Uncertainty Quantification Algorithms in Metric Spaces

arXiv.org Machine Learning

This paper introduces a framework for uncertainty quantification in regression models defined in metric spaces. Leveraging a newly defined notion of homoscedasticity, we develop a conformal prediction algorithm that offers finite-sample coverage guarantees and fast convergence rates of the oracle estimator. In heteroscedastic settings, we forgo these non-asymptotic guarantees to gain statistical efficiency, proposing a local $k$--nearest--neighbor method without conformal calibration that is adaptive to the geometry of each particular nonlinear space. Both procedures work with any regression algorithm and are scalable to large data sets, allowing practitioners to plug in their preferred models and incorporate domain expertise. We prove consistency for the proposed estimators under minimal conditions. Finally, we demonstrate the practical utility of our approach in personalized--medicine applications involving random response objects such as probability distributions and graph Laplacians.


Domain-Adaptive Small Language Models for Structured Tax Code Prediction

arXiv.org Artificial Intelligence

Every day, multinational firms process thousands of transactions, each of which must adhere to tax regulations that vary by jurisdiction and are often nuanced. The determination of product and service tax codes, such as HSN or SAC is a major use case in Tax compliance. An accurate determination of such codes is imperative to avoid any tax penalties. This paper proposes a domain-adaptive small language model (SLM) with an encoder-decoder architecture for the enhanced prediction of product and service tax codes. In this approach, we address the problem of predicting hierarchical tax code sequences using unstructured product and services data. We employ an SLM based upon encoder-decoder architecture as this enables sequential generation of tax codes to capture the hierarchical dependencies present within the tax codes. Our experiments demonstrate that encoder-decoder SLMs can be successfully applied to the sequential prediction of structured tax codes, a domain that remains comparatively unexplored in current NLP research. In this paper, we demonstrate the superior performance of the domain-adaptive encoder-decoder SLMs over flat classifiers when applied to the Harmonized System of Nomenclature (HSN), and achieve superior results compared to decoder-only and encoder-only architectures for structured sequence generation tasks. This approach can also be scaled to other government-mandated tax commodity codes, such as United Nations Standard Products and Services Codes (UNSPSC), or Brazil's Nomenclatura Comum do Mercosul (NCM).


England players racially abused during Argentina game

BBC News

England's players were racially abused during their second Test victory over Argentina in San Juan on 12 July. Team officials lodged a complaint to governing body World Rugby over the incident that occurred when the visitors' replacements were warming up in the first half. "While it is clear that an incident took place, we regret that the individuals responsible could not be identified," said World Rugby, adding their investigation included witness statements and video analysis. "Intense efforts were made to identify the small group of five or seven individuals responsible within a crowd of over 20,000 spectators," said Gabriel Travaglini, president of the Union Argentina de Rugby (UAR). "Unfortunately, despite an exhaustive search, it was not possible to identify the perpetrators. "We strongly condemn all acts of racism and stand in solidarity with the England rugby players who felt aggrieved." He added that the UAR would work with World Rugby to educate fans. There have been several recent high-profile cases of discriminatory behaviour in Argentine sport. In 2020, Pablo Matera and Guido Petti, both of whom played in the match in San Juan, were suspended from the team after racist remarks they had made on social media several years earlier were unearthed. In 2024, Chelsea footballer Enzo Fernandez apologised to team-mates after being filmed joining in with a chant that questioned the heritage of France's black and mixed race players. "Rugby completely condemns discriminatory behaviour of any kind," said World Rugby chairman Brett Robinson. "We offer our full support to the players involved and want them to know that rugby stands with them in opposing racism.


Conceptual and Design Principles for a Self-Referential Algorithm Mimicking Neuronal Assembly Functions

arXiv.org Artificial Intelligence

However, the epistemological approach differs from that of so-called "grounded cognition". We can summarise this difference as follows: while grounded cognition analyses the experience of a living system from the point of view of an observer, we adopt the point of view of the system itself, defined by the need to preserve the biological properties essential for its survival. Therefore, our proposal implies the idea that the system is self-referential, since it operates with the aim of being able to continue operating. The method is based on an algorithmic schema that we called Environment Generative Operator (EGO) and uses an object language developed for this purpose, that we called E-language. EGO simulates cognitive processes by manipulating E-language strings. Among all the feasible ones, an EGO model called "EGO-P" (Supplementary Material 2) was implemented and tested, achieving the expected objectives. Repositories 2 and 3, as all the others mentioned in the article, can be accessed via the corresponding link in the bibliography. E-language has various mathematical properties. Those useful for this work have been demonstrated and are available in Supplementary Material 1.


Livestream of RoboCup2025

AIHub

RoboCup2025 is currently taking place in Salvador, Brazil. With day one of the main competition complete, things are hotting up across the many different leagues. From soccer to rescue, from industrial to home scenarios, teams are putting their robots through their paces across a variety of tasks and matches. If you would like to catch up on the action from the first day, you can watch the recording of the livestream below. This includes coverage of the teams competing, interviews with participants and organisers, and insights into RoboCup and the various leagues.


Netflix uses generative AI in one of its shows for first time

The Guardian

Netflix has used artificial intelligence in one of its TV shows for the first time, in a move the streaming company's boss said would make films and programmes cheaper and of better quality. Ted Sarandos, a co-chief executive of Netflix, said the Argentinian science fiction series El Eternauta (The Eternaut) was the first it had made that involved using generative AI footage. "We remain convinced that AI represents an incredible opportunity to help creators make films and series better, not just cheaper," he told analysts on Thursday after Netflix reported its second-quarter results. He said the series, which follows survivors of a rapid and devastating toxic snowfall, involved Netflix and visual effects (VFX) artists using AI to show a building collapsing in Buenos Aires. "Using AI-powered tools, they were able to achieve an amazing result with remarkable speed and, in fact, that VFX sequence was completed 10 times faster than it could have been completed with traditional VFX tools and workflows," he said.


VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

arXiv.org Artificial Intelligence

Recent advancements in vision-language models (VLMs) have improved performance by increasing the number of visual tokens, which are often significantly longer than text tokens. However, we observe that most real-world scenarios do not require such an extensive number of visual tokens. While the performance drops significantly in a small subset of OCR-related tasks, models still perform accurately in most other general VQA tasks with only 1/4 resolution. Therefore, we propose to dynamically process distinct samples with different resolutions, and present a new paradigm for visual token compression, namely, VisionThink. It starts with a downsampled image and smartly decides whether it is sufficient for problem solving. Otherwise, the model could output a special token to request the higher-resolution image. Compared to existing Efficient VLM methods that compress tokens using fixed pruning ratios or thresholds, VisionThink autonomously decides whether to compress tokens case by case. As a result, it demonstrates strong fine-grained visual understanding capability on OCR-related tasks, and meanwhile saves substantial visual tokens on simpler tasks. We adopt reinforcement learning and propose the LLM-as-Judge strategy to successfully apply RL to general VQA tasks. Moreover, we carefully design a reward function and penalty mechanism to achieve a stable and reasonable image resize call ratio. Extensive experiments demonstrate the superiority, efficiency, and effectiveness of our method. Our code is available at https://github.com/dvlab-research/VisionThink.


Higher-Order Pattern Unification Modulo Similarity Relations

arXiv.org Artificial Intelligence

The combination of higher-order theories and fuzzy logic can be useful in decision-making tasks that involve reasoning across abstract functions and predicates, where exact matches are often rare or unnecessary. Developing efficient reasoning and computational techniques for such a combined formalism presents a significant challenge. In this paper, we adopt a more straightforward approach aiming at integrating two well-established and computationally well-behaved components: higher-order patterns on one side and fuzzy equivalences expressed through similarity relations based on minimum T-norm on the other. We propose a unification algorithm for higher-order patterns modulo these similarity relations and prove its termination, soundness, and completeness. This unification problem, like its crisp counterpart, is unitary. The algorithm computes a most general unifier with the highest degree of approximation when the given terms are unifiable.


Imbalanced Regression Pipeline Recommendation

arXiv.org Artificial Intelligence

Imbalanced problems are prevalent in various real-world scenarios and are extensively explored in classification tasks. However, they also present challenges for regression tasks due to the rarity of certain target values. A common alternative is to employ balancing algorithms in preprocessing to address dataset imbalance. However, due to the variety of resampling methods and learning models, determining the optimal solution requires testing many combinations. Furthermore, the learning model, dataset, and evaluation metric affect the best strategies. This work proposes the Meta-learning for Imbalanced Regression (Meta-IR) framework, which diverges from existing literature by training meta-classifiers to recommend the best pipeline composed of the resampling strategy and learning model per task in a zero-shot fashion. The meta-classifiers are trained using a set of meta-features to learn how to map the meta-features to the classes indicating the best pipeline. We propose two formulations: Independent and Chained. Independent trains the meta-classifiers to separately indicate the best learning algorithm and resampling strategy. Chained involves a sequential procedure where the output of one meta-classifier is used as input for another to model intrinsic relationship factors. The Chained scenario showed superior performance, suggesting a relationship between the learning algorithm and the resampling strategy per task. Compared with AutoML frameworks, Meta-IR obtained better results. Moreover, compared with baselines of six learning algorithms and six resampling algorithms plus no resampling, totaling 42 (6 X 7) configurations, Meta-IR outperformed all of them. The code, data, and further information of the experiments can be found on GitHub: https://github.com/JusciAvelino/Meta-IR.


Selective Quantization Tuning for ONNX Models

arXiv.org Artificial Intelligence

Quantization is a process that reduces the precision of deep neural network models to lower model size and computational demands, often at the cost of accuracy. However, fully quantized models may exhibit sub-optimal performance below acceptable levels and face deployment challenges on low-end hardware accelerators due to practical constraints. To address these issues, quantization can be selectively applied to only a subset of layers, but selecting which layers to exclude is non-trivial. To this direction, we propose TuneQn, a suite enabling selective quantization, deployment and execution of ONNX models across various CPU and GPU devices, combined with profiling and multi-objective optimization. TuneQn generates selectively quantized ONNX models, deploys them on different hardware, measures performance on metrics like accuracy and size, performs Pareto Front minimization to identify the best model candidate and visualizes the results. To demonstrate the effectiveness of TuneQn, we evaluated TuneQn on four ONNX models with two quantization settings across CPU and GPU devices. As a result, we demonstrated that our utility effectively performs selective quantization and tuning, selecting ONNX model candidates with up to a $54.14$% reduction in accuracy loss compared to the fully quantized model, and up to a $72.9$% model size reduction compared to the original model.