Instructional Material
Minimal Sequent Calculus for Teaching First-Order Logic: Lessons Learned
We present MiniCalc, a web app for teaching first-order logic, based on a so-called minimal sequent calculus. We explain the sequent calculus in Section 2. More than 100 computer science students have used versions of MiniCalc in a course on automated reasoning in the period 2021-2024. The web app MiniCalc 1.0 has not yet been announced, but it is available here: https://proof.compute.dtu.dk/MiniCalc.zip Installation is easy: Just unpack MiniCalc.zip in a new directory and open index.html in a browser. MiniCalc displays the proof editor to the left and the result about the default example proof to the right. We explain the default example proof in Section 3. The files in the above zip are from 12 February 2024 and we are not aware of bugs as of 1 December 2024.
V-EfficientNets: Vector-Valued Efficiently Scaled Convolutional Neural Network Models
Neto, Guilherme Vieira, Valle, Marcos Eduardo
EfficientNet models are convolutional neural networks optimized for parameter allocation by jointly balancing network width, depth, and resolution. Renowned for their exceptional accuracy, these models have become a standard for image classification tasks across diverse computer vision benchmarks. While traditional neural networks learn correlations between feature channels during training, vector-valued neural networks inherently treat multidimensional data as coherent entities, taking for granted the inter-channel relationships. This paper introduces vector-valued EfficientNets (V-EfficientNets), a novel extension of EfficientNet designed to process arbitrary vector-valued data. The proposed models are evaluated on a medical image classification task, achieving an average accuracy of 99.46% on the ALL-IDB2 dataset for detecting acute lymphoblastic leukemia. V-EfficientNets demonstrate remarkable efficiency, significantly reducing parameters while outperforming state-of-the-art models, including the original EfficientNet. The source code is available at https://github.com/mevalle/v-nets.
GenAI in Entrepreneurship: a systematic review of generative artificial intelligence in entrepreneurship research: current issues and future directions
Kusetogullari, Anna, Kusetogullari, Huseyin, Andersson, Martin, Gorschek, Tony
Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) are recognized to have significant effects on industry and business dynamics, not least because of their impact on the preconditions for entrepreneurship. There is still a lack of knowledge of GenAI as a theme in entrepreneurship research. This paper presents a systematic literature review aimed at identifying and analyzing the evolving landscape of research on the effects of GenAI on entrepreneurship. We analyze 83 peer-reviewed articles obtained from leading academic databases: Web of Science and Scopus. Using natural language processing and unsupervised machine learning techniques with TF-IDF vectorization, Principal Component Analysis (PCA), and hierarchical clustering, five major thematic clusters are identified: (1) Digital Transformation and Behavioral Models, (2) GenAI-Enhanced Education and Learning Systems, (3) Sustainable Innovation and Strategic AI Impact, (4) Business Models and Market Trends, and (5) Data-Driven Technological Trends in Entrepreneurship. Based on the review, we discuss future research directions, gaps in the current literature, as well as ethical concerns raised in the literature. We highlight the need for more macro-level research on GenAI and LLMs as external enablers for entrepreneurship and for research on effective regulatory frameworks that facilitate business experimentation, innovation, and further technology development.
T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction
Peng, Kun, Tong, Chaodong, Cao, Cong, Peng, Hao, Li, Qian, Wu, Guanlin, Jiang, Lei, Liu, Yanbing, Yu, Philip S.
Aspect sentiment triplet extraction (ASTE) aims to extract triplets composed of aspect terms, opinion terms, and sentiment polarities from given sentences. The table tagging method is a popular approach to addressing this task, which encodes a sentence into a 2-dimensional table, allowing for the tagging of relations between any two words. Previous efforts have focused on designing various downstream relation learning modules to better capture interactions between tokens in the table, revealing that a stronger capability to capture relations can lead to greater improvements in the model. Motivated by this, we attempt to directly utilize transformer layers as downstream relation learning modules. Due to the powerful semantic modeling capability of transformers, it is foreseeable that this will lead to excellent improvement. However, owing to the quadratic relation between the length of the table and the length of the input sentence sequence, using transformers directly faces two challenges: overly long table sequences and unfair local attention interaction. To address these challenges, we propose a novel Table-Transformer (T-T) for the tagging-based ASTE method. Specifically, we introduce a stripe attention mechanism with a loop-shift strategy to tackle these challenges. The former modifies the global attention mechanism to only attend to a 2-dimensional local attention window, while the latter facilitates interaction between different attention windows. Extensive and comprehensive experiments demonstrate that the T-T, as a downstream relation learning module, achieves state-of-the-art performance with lower computational costs.
An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education
Sajja, Ramteja, Sermet, Yusuf, Demir, Ibrahim
Recent advances in AI have catalyzed the adoption of intelligent educational tools, yet many semantic retrieval systems remain ill-suited to the unique linguistic and structural characteristics of academic content. This study presents two open-source embedding models fine-tuned for educational question answering, particularly in the context of course syllabi. A synthetic dataset of 3,197 sentence pairs, spanning synonymous terminology, paraphrased questions, and implicit-explicit mappings, was constructed through a combination of manual curation and large language model (LLM)-assisted generation. Two training strategies were evaluated: (1) a baseline model fine-tuned using MultipleNegativesRankingLoss (MNRL), and (2) a dual-loss model that combines MNRL with CosineSimilarityLoss to improve both semantic ranking and similarity calibration. Evaluations were conducted on 28 university course syllabi using a fixed set of natural language questions categorized into course, faculty, and teaching assistant information. Results demonstrate that both fine-tuned models outperform strong open-source baselines, including all-MiniLM-L6-v2 and multi-qa-MiniLM-L6-cos-v1, and that the dual-loss model narrows the performance gap with high-performing proprietary embeddings such as OpenAI's text-embedding-3 series. This work contributes reusable, domain-aligned embedding models and provides a replicable framework for educational semantic retrieval, supporting downstream applications such as academic chatbots, retrieval-augmented generation (RAG) systems, and learning management system (LMS) integrations.
Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind
Abrini, Mouad, Abend, Omri, Acklin, Dina, Admoni, Henny, Aichinger, Gregor, Alon, Nitay, Ashktorab, Zahra, Atreja, Ashish, Auron, Moises, Aufreiter, Alexander, Awasthi, Raghav, Banerjee, Soumya, Barnby, Joe M., Basappa, Rhea, Bergsmann, Severin, Bouneffouf, Djallel, Callaghan, Patrick, Cavazza, Marc, Chaminade, Thierry, Chernova, Sonia, Chetouan, Mohamed, Choudhury, Moumita, Cleeremans, Axel, Cywinski, Jacek B., Cuzzolin, Fabio, Deng, Hokin, Diamond, N'yoma, Di Pasquasio, Camilla, Dumas, Guillaume, van Duijn, Max, Dwarikanath, Mahapatra, Gao, Qingying, Goel, Ashok, Goldstein, Rebecca, Gombolay, Matthew, Gonzalez, Gabriel Enrique, Halilovic, Amar, Halmdienst, Tobias, Islam, Mahimul, Jara-Ettinger, Julian, Kastel, Natalie, Keydar, Renana, Khanna, Ashish K., Khoramshahi, Mahdi, Kim, JiHyun, Kim, MiHyeon, Kim, YoungBin, Krivic, Senka, Krasnytskyi, Nikita, Kumar, Arun, Kwon, JuneHyoung, Lee, Eunju, Lee, Shane, Lewis, Peter R., Li, Xue, Li, Yijiang, Lewandowski, Michal, Lloyd, Nathan, Luebbers, Matthew B., Luo, Dezhi, Lyu, Haiyun, Mahapatra, Dwarikanath, Maheshwari, Kamal, Mainali, Mallika, Mathur, Piyush, Mederitsch, Patrick, Miura, Shuwa, de Miranda, Manuel Preston, Mirsky, Reuth, Mishra, Shreya, Moorman, Nina, Morrison, Katelyn, Muchovej, John, Nessler, Bernhard, Nessler, Felix, Nguyen, Hieu Minh Jord, Ortego, Abby, Papay, Francis A., Pasquali, Antoine, Rahimi, Hamed, Raghu, Charumathi, Royka, Amanda, Sarkadi, Stefan, Scheuerman, Jaelle, Schmid, Simon, Schrater, Paul, Sen, Anik, Sheikhbahaee, Zahra, Shi, Ke, Simmons, Reid, Singh, Nishant, Smith, Mason O., van der Meulen, Ramira, Solaki, Anthia, Sun, Haoran, Szolga, Viktor, Taylor, Matthew E., Taylor, Travis, Van Waveren, Sanne, Vargas, Juan David, Verbrugge, Rineke, Wagner, Eitan, Weisz, Justin D., Wen, Ximing, Yeoh, William, Zhang, Wenlong, Zhao, Michelle, Zilberstein, Shlomo
The ability to attribute mental states--such as beliefs, intentions, desires, and emotions--to oneself and others, is essential for predicting behavior. Thus ToM principles are crucial to enable better interpretation and response to human actions and intentions as AI systems evolve towards greater interactivity. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community. The first Theory of Mind for AI (ToM4AI) workshop took place on March 3, 2025, as part of the AAAI workshop series. It was an epic gathering of researchers from diverse fields, ranging from psychology, cognitive science, neuroscience, robotics, and AI, to explore the implications of ToM in developing advanced AI systems.
Machine Learning: a Lecture Note
This lecture note is intended to prepare early-year master's and PhD students in data science or a related discipline with foundational ideas in machine learning. It starts with basic ideas in modern machine learning with classification as a main target task. These basic ideas include loss formulation, backpropagation, stochastic gradient descent, generalization, model selection as well as fundamental blocks of artificial neural networks. Based on these basic ideas, the lecture note explores in depth the probablistic approach to unsupervised learning, covering directed latent variable models, product of experts, generative adversarial networks and autoregressive models. Finally, the note ends by covering a diverse set of further topics, such as reinforcement learning, ensemble methods and meta-learning. After reading this lecture note, a student should be ready to embark on studying and researching more advanced topics in machine learning and more broadly artificial intelligence.
A Tutorial on Discriminative Clustering and Mutual Information
Ohl, Louis, Mattei, Pierre-Alexandre, Precioso, Frédéric
To cluster data is to separate samples into distinctive groups that should ideally have some cohesive properties. Today, numerous clustering algorithms exist, and their differences lie essentially in what can be perceived as ``cohesive properties''. Therefore, hypotheses on the nature of clusters must be set: they can be either generative or discriminative. As the last decade witnessed the impressive growth of deep clustering methods that involve neural networks to handle high-dimensional data often in a discriminative manner; we concentrate mainly on the discriminative hypotheses. In this paper, our aim is to provide an accessible historical perspective on the evolution of discriminative clustering methods and notably how the nature of assumptions of the discriminative models changed over time: from decision boundaries to invariance critics. We notably highlight how mutual information has been a historical cornerstone of the progress of (deep) discriminative clustering methods. We also show some known limitations of mutual information and how discriminative clustering methods tried to circumvent those. We then discuss the challenges that discriminative clustering faces with respect to the selection of the number of clusters. Finally, we showcase these techniques using the dedicated Python package, GemClus, that we have developed for discriminative clustering.
Hierarchical Task Decomposition for Execution Monitoring and Error Recovery: Understanding the Rationale Behind Task Demonstrations
Willibald, Christoph, Lee, Dongheui
Multi-step manipulation tasks where robots interact with their environment and must apply process forces based on the perceived situation remain challenging to learn and prone to execution errors. Accurately simulating these tasks is also difficult. Hence, it is crucial for robust task performance to learn how to coordinate end-effector pose and applied force, monitor execution, and react to deviations. To address these challenges, we propose a learning approach that directly infers both low- and high-level task representations from user demonstrations on the real system. We developed an unsupervised task segmentation algorithm that combines intention recognition and feature clustering to infer the skills of a task. We leverage the inferred characteristic features of each skill in a novel unsupervised anomaly detection approach to identify deviations from the intended task execution. Together, these components form a comprehensive framework capable of incrementally learning task decisions and new behaviors as new situations arise. Compared to state-of-the-art learning techniques, our approach significantly reduces the required amount of training data and computational complexity while efficiently learning complex in-contact behaviors and recovery strategies. Our proposed task segmentation and anomaly detection approaches outperform state-of-the-art methods on force-based tasks evaluated on two different robotic systems.
CombiBench: Benchmarking LLM Capability for Combinatorial Mathematics
Liu, Junqi, Lin, Xiaohan, Bayer, Jonas, Dillies, Yael, Jiang, Weijie, Liang, Xiaodan, Soletskyi, Roman, Wang, Haiming, Xie, Yunzhou, Xiong, Beibei, Yang, Zhengfeng, Zhang, Jujian, Zhi, Lihong, Li, Jia, Liu, Zhengying
Neurosymbolic approaches integrating large language models with formal reasoning have recently achieved human-level performance on mathematics competition problems in algebra, geometry and number theory. In comparison, combinatorics remains a challenging domain, characterized by a lack of appropriate benchmarks and theorem libraries. To address this gap, we introduce CombiBench, a comprehensive benchmark comprising 100 combinatorial problems, each formalized in Lean~4 and paired with its corresponding informal statement. The problem set covers a wide spectrum of difficulty levels, ranging from middle school to IMO and university level, and span over ten combinatorial topics. CombiBench is suitable for testing IMO solving capabilities since it includes all IMO combinatorial problems since 2000 (except IMO 2004 P3 as its statement contain an images). Furthermore, we provide a comprehensive and standardized evaluation framework, dubbed Fine-Eval (for $\textbf{F}$ill-in-the-blank $\textbf{in}$ L$\textbf{e}$an Evaluation), for formal mathematics. It accommodates not only proof-based problems but also, for the first time, the evaluation of fill-in-the-blank questions. Using Fine-Eval as the evaluation method and Kimina Lean Server as the backend, we benchmark several LLMs on CombiBench and observe that their capabilities for formally solving combinatorial problems remain limited. Among all models tested (none of which has been trained for this particular task), Kimina-Prover attains the best results, solving 7 problems (out of 100) under both ``with solution'' and ``without solution'' scenarios. We open source the benchmark dataset alongside with the code of the proposed evaluation method at https://github.com/MoonshotAI/CombiBench/.