comix
- North America > United States (0.04)
- Europe > Spain (0.04)
- Europe > Italy (0.04)
- Asia > Middle East > Jordan (0.04)
CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding
The comic domain is rapidly advancing with the development of single-page analysis and synthesis models. However, evaluation metrics and datasets lag behind, often limited to small-scale or single-style test sets. We introduce a novel benchmark, CoMix, designed to evaluate the multi-task capabilities of models in comic analysis. Unlike existing benchmarks that focus on isolated tasks such as object detection or text recognition, CoMix addresses a broader range of tasks including object detection, speaker identification, character re-identification, reading order, and multi-modal reasoning tasks like character naming and dialogue generation. Our benchmark comprises three existing datasets with expanded annotations to support multi-task evaluation. To mitigate the over-representation of manga-style data, we have incorporated a new dataset of carefully selected American comic-style books, thereby enriching the diversity of comic styles. CoMix is designed to assess pre-trained models in zero-shot and limited fine-tuning settings, probing their transfer capabilities across different comic styles and tasks. The validation split of the benchmark is publicly available for research purposes, and an evaluation server for the held-out test split is also provided. Comparative results between human performance and state-of-the-art models reveal a significant performance gap, highlighting substantial opportunities for advancements in comic understanding.
- North America > United States (0.14)
- Europe > Italy (0.14)
- Asia (0.14)
- Government (0.46)
- Media (0.46)
CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding
The comic domain is rapidly advancing with the development of single-page analysis and synthesis models. However, evaluation metrics and datasets lag behind, often limited to small-scale or single-style test sets. We introduce a novel benchmark, CoMix, designed to evaluate the multi-task capabilities of models in comic analysis. Unlike existing benchmarks that focus on isolated tasks such as object detection or text recognition, CoMix addresses a broader range of tasks including object detection, speaker identification, character re-identification, reading order, and multi-modal reasoning tasks like character naming and dialogue generation. Our benchmark comprises three existing datasets with expanded annotations to support multi-task evaluation.
COMIX: Compositional Explanations using Prototypes
Sivaprasad, Sarath, Kangin, Dmitry, Angelov, Plamen, Fritz, Mario
Aligning machine representations with human understanding is key to improving interpretability of machine learning (ML) models. When classifying a new image, humans often explain their decisions by decomposing the image into concepts and pointing to corresponding regions in familiar images. Current ML explanation techniques typically either trace decision-making processes to reference prototypes, generate attribution maps highlighting feature importance, or incorporate intermediate bottlenecks designed to align with human-interpretable concepts. The proposed method, named COMiX, classifies an image by decomposing it into regions based on learned concepts and tracing each region to corresponding ones in images from the training dataset, assuring that explanations fully represent the actual decision-making process. We dissect the test image into selected internal representations of a neural network to derive prototypical parts (primitives) and match them with the corresponding primitives derived from the training data. In a series of qualitative and quantitative experiments, we theoretically prove and demonstrate that our method, in contrast to post hoc analysis, provides fidelity of explanations and shows that the efficiency is competitive with other inherently interpretable architectures. Notably, it shows substantial improvements in fidelity and sparsity metrics, including 48.82% improvement in the C-insertion score on the ImageNet dataset over the best state-of-the-art baseline. Neural networks (NNs) have been successfully applied across various computer vision tasks, achieving notable results in safety-critical domains such as medical image classification (Huang et al., 2023), autonomous driving (Geiger et al., 2012), and robotics (Robinson et al., 2023) amongst others. However, explaining their decisions remains an ongoing research challenge (Samek et al., 2021). The two key factors in interpreting neural network decisions are: (1) representing the reasoning behind the prediction in human-understandable terms and (2) ensuring that the explanations accurately reflect the underlying computations of the neural network.
- North America > Canada > Newfoundland and Labrador > Newfoundland (0.04)
- Europe > Germany (0.04)
- Health & Medicine (0.88)
- Information Technology (0.66)
CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision Making
Minelli, Giovanni, Musolesi, Mirco
Robust coordination skills enable agents to operate cohesively in shared environments, together towards a common goal and, ideally, individually without hindering each other's progress. To this end, this paper presents Coordinated QMIX (CoMIX), a novel training framework for decentralized agents that enables emergent coordination through flexible policies, allowing at the same time independent decision-making at individual level. CoMIX models selfish and collaborative behavior as incremental steps in each agent's decision process. This allows agents to dynamically adapt their behavior to different situations balancing independence and collaboration. Experiments using a variety of simulation environments demonstrate that CoMIX outperforms baselines on collaborative tasks. The results validate our incremental policy approach as effective technique for improving coordination in multi-agent systems.
- North America > United States > Montana (0.04)
- Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
de Witt, Christian Schroeder, Peng, Bei, Kamienny, Pierre-Alexandre, Torr, Philip, Böhmer, Wendelin, Whiteson, Shimon
Deep multi-agent reinforcement learning (MARL) holds the promise of automating many real-world cooperative robotic manipulation and transportation tasks. Nevertheless, decentralised cooperative robotic control has received less attention from the deep reinforcement learning community, as compared to single-agent robotics and multi-agent games with discrete actions. To address this gap, this paper introduces Multi-Agent Mujoco, an easily extensible multi-agent benchmark suite for robotic control in continuous action spaces. The benchmark tasks are diverse and admit easily configurable partially observable settings. Inspired by the success of single-agent continuous value-based algorithms in robotic control, we also introduce COMIX, a novel extension to a common discrete action multi-agent $Q$-learning algorithm. We show that COMIX significantly outperforms state-of-the-art MADDPG on a partially observable variant of a popular particle environment and matches or surpasses it on Multi-Agent Mujoco. Thanks to this new benchmark suite and method, we can now pose an interesting question: what is the key to performance in such settings, the use of value-based methods instead of policy gradients, or the factorisation of the joint $Q$-function? To answer this question, we propose a second new method, FacMADDPG, which factors MADDPG's critic. Experimental results on Multi-Agent Mujoco suggest that factorisation is the key to performance.
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- North America > United States > Montana (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- Europe > Denmark (0.04)
Routine Design for Mechanical Engineering
The system described in this article is currently working in the field at the Sales Department of EKATO, one of the world's most successful manufacturers of industrial mixing machines. It was developed in close cooperation with the Fraunhofer Institute for Information and Data Processing (IITB) during a three-year period. Industrial mixing machines, better known as agitators, are widely used in industrial manufacturing. They are especially useful for the chemical and pharmaceutical industries, food production, and biotechnology. The basic structure of an industrial agitator is shown in figure 1.
Routine Design for Mechanical Engineering
Brinkop, Axel, Laudwein, Norbert, Maasen, Rudiger
COMIX (configuration of mixing machines) is a system that assists members of the EKATO Sales Department in designing a mixing machine that fulfills the requirements of a customer. It is used to help the engineer design the requested machine and prepare an offer that's to be submitted to the customer. comix integrates more traditional software techniques with explicit knowledge representation and constraint propagation. During the process of routine design, some design decisions have to be made with uncertainty. By including knowledge from process technology and company experience in the mechanical design, a sufficiently high degree of flexibility is achieved that the system can even assist in difficult design situations. The success of the system can be measured by the increase in the quantity and the quality of the submitted offers.
- Europe > Germany > Rhineland-Palatinate > Kaiserslautern (0.04)
- Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
- North America > United States > California > San Mateo County > Menlo Park (0.04)
- Europe > Germany > North Rhine-Westphalia > Düsseldorf Region > Düsseldorf (0.04)