Instructional Material
The Michigan Robotics Undergraduate Curriculum: Defining the Discipline of Robotics for Equity and Excellence
Jenkins, Odest Chadwicke, Grizzle, Jessy, Atkins, Ella, Stirling, Leia, Rouse, Elliott, Guzdial, Mark, Provost, Damen, Mann, Kimberly, Millunchick, Joanna
The Michigan Robotics Undergraduate Program owes a tremendous debt of gratitude to many people across our Robotics Institute and Robotics Department, the University of Michigan, the College of Engineering, the State of Michigan, and the greater national and global robotics community. Creating a first-of-a-kind robotics program is an incredibly bold and daring undertaking that would not be possible without the support, contributions, empathy, and insights from all corners of our amazing university (Go Blue!). While it would be impossible to recognize everyone who played important roles in realizing the Robotics Major, we would like to acknowledge some individuals who were especially critical to the formation of the program. We must first thank Dean Alec Gallimore and the College of Engineering for their visionary leadership throughout our evolution. Under the guidance and stewardship of Dean Gallimore, the Robotics Institute was able to grow, thrive, and prove it has the right stuff to become a viable academic department and undergraduate program. None of this would be possible without your confidence in us and willingness to innovate for the Common Good. The Robotics Institute owes its origins to Dawn Tilbury - the founding Director of the Robotics Institute (in 2014 under Dean David Munson) and now the inaugural Chair of the Robotics Department - and her foresight to envision what has become the home of Michigan Robotics - the Ford Motor Company Robotics Building. Nadine Sarter, Associate Dean Michael Wellman, and the Robotics Future Committee did tremendous work between 2018-20 to explore the potential and opportunities for Michigan to establish a department and undergraduate program in robotics. Their work identified the path for Michigan to earn distinguished leadership in robotics.
Training Spiking Neural Networks Using Lessons From Deep Learning
Eshraghian, Jason K., Ward, Max, Neftci, Emre, Wang, Xinxin, Lenz, Gregor, Dwivedi, Girish, Bennamoun, Mohammed, Jeong, Doo Seok, Lu, Wei D.
The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what the future of deep learning might look like. This paper serves as a tutorial and perspective showing how to apply the lessons learnt from several decades of research in deep learning, gradient descent, backpropagation and neuroscience to biologically plausible spiking neural neural networks. We also explore the delicate interplay between encoding data as spikes and the learning process; the challenges and solutions of applying gradient-based learning to spiking neural networks (SNNs); the subtle link between temporal backpropagation and spike timing dependent plasticity, and how deep learning might move towards biologically plausible online learning. Some ideas are well accepted and commonly used amongst the neuromorphic engineering community, while others are presented or justified for the first time here. The fields of deep learning and spiking neural networks evolve very rapidly. We endeavour to treat this document as a 'dynamic' manuscript that will continue to be updated as the common practices in training SNNs also change. A series of companion interactive tutorials complementary to this paper using our Python package, snnTorch, are also made available. See https://snntorch.readthedocs.io/en/latest/tutorials/index.html .
LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Tu, Shangqing, Zhang, Zheyuan, Yu, Jifan, Li, Chunyang, Zhang, Siyu, Yao, Zijun, Hou, Lei, Li, Juanzi
Teaching assistants have played essential roles in the long history of education. However, few MOOC platforms are providing human or virtual teaching assistants to support learning for massive online students due to the complexity of real-world online education scenarios and the lack of training data. In this paper, we present a virtual MOOC teaching assistant, LittleMu with minimum labeled training data, to provide question answering and chit-chat services. Consisting of two interactive modules of heterogeneous retrieval and language model prompting, LittleMu first integrates structural, semi- and unstructured knowledge sources to support accurate answers for a wide range of questions. Then, we design delicate demonstrations named "Chain of Teach" prompts to exploit the large-scale pre-trained model to handle complex uncollected questions. Except for question answering, we develop other educational services such as knowledge-grounded chit-chat. We test the system's performance via both offline evaluation and online deployment. Since May 2020, our LittleMu system has served over 80,000 users with over 300,000 queries from over 500 courses on XuetangX MOOC platform, which continuously contributes to a more convenient and fair education. Our code, services, and dataset will be available at https://github.com/THU-KEG/VTA.
Assessing Student Errors in Experimentation Using Artificial Intelligence and Large Language Models: A Comparative Study with Human Raters
Bewersdorff, Arne, Seßler, Kathrin, Baur, Armin, Kasneci, Enkelejda, Nerdel, Claudia
Identifying logical errors in complex, incomplete or even contradictory and overall heterogeneous data like students' experimentation protocols is challenging. Recognizing the limitations of current evaluation methods, we investigate the potential of Large Language Models (LLMs) for automatically identifying student errors and streamlining teacher assessments. Our aim is to provide a foundation for productive, personalized feedback. Using a dataset of 65 student protocols, an Artificial Intelligence (AI) system based on the GPT-3.5 and GPT-4 series was developed and tested against human raters. Our results indicate varying levels of accuracy in error detection between the AI system and human raters. The AI system can accurately identify many fundamental student errors, for instance, the AI system identifies when a student is focusing the hypothesis not on the dependent variable but solely on an expected observation (acc. = 0.90), when a student modifies the trials in an ongoing investigation (acc. = 1), and whether a student is conducting valid test trials (acc. = 0.82) reliably. The identification of other, usually more complex errors, like whether a student conducts a valid control trial (acc. = .60), poses a greater challenge. This research explores not only the utility of AI in educational settings, but also contributes to the understanding of the capabilities of LLMs in error detection in inquiry-based learning like experimentation.
Reports of the Workshops Held at the 2023 AAAI Conference on Artificial Intelligence
The Workshop Program of the Association for the Advancement of Artificial Intelligence's 37th Conference on Artificial Intelligence (AAAI-23) was held in Washington, DC, USA on February 13-14, 2023. There were 32 workshops in the program: AI for Agriculture and Food Systems, AI for Behavior Change, AI for Credible Elections: A Call to Action with Trusted AI, AI for Energy Innovation, AI for Web Advertising, AI to Accelerate Science and Engineering, AI4EDU: AI for Education, Artificial Intelligence and Diplomacy, Artificial Intelligence for Cyber Security (AICS), Artificial Intelligence for Social Good (AI4SG), Artificial Intelligence Safety (SafeAI), Creative AI Across Modalities, Deep Learning on Graphs: Methods and Applications (DLG-AAAI'23), DEFACTIFY: Multimodal Fact-Checking and Hate Speech Detection, Deployable AI (DAI), DL-Hardware Co-Design for AI Acceleration, Energy Efficient Training and Inference of Transformer Based Models, Graphs and More Complex Structures for Learning and Reasoning (GCLR), Health Intelligence (W3PHIAI-23), Knowledge-Augmented Methods for Natural Language Processing, Modelling Uncertainty in the Financial World (MUFin'23), Multi-Agent Path Finding, Multimodal AI for Financial Forecasting (Muffin), Multimodal AI for Financial Forecasting (Muffin), Privacy-Preserving Artificial Intelligence, Recent Trends in Human-Centric AI, Reinforcement Learning Ready for Production, Scientific Document Understanding, Systems Neuroscience Approach to General Intelligence, Uncertainty Reasoning and Quantification in Decision Making (UDM'23), User-Centric Artificial Intelligence for Assistance in At-Home Tasks, and When Machine Learning Meets Dynamical Systems: Theory and Applications. This report contains summaries of the workshops, which were submitted by some, but not all of the workshop chairs. An increasing world population, coupled with finite arable land, changing diets, and the growing expense of agricultural inputs, is poised to stretch our agricultural systems to their limits. By the end of this century, the earth's population is projected to increase by 45% with available arable land decreasing by 20% coupled with changes in what crops these arable lands can best support; this creates the urgent need to enhance agricultural productivity by 70% before 2050.
1st AI4GOV Training Workshop: Bias In AI
The 1st AI4GOV training workshop titled Bias in AI (focused on fundamentals) is the first organized training workshop with others to follow within the scope of the Horizon Europe project AI4GOV- Trusted AI for Transparent Public Governance fostering Democratic Values. Do you remember the Rick and Morty cartoon? Recently, I learned that this movie even provided inspiration for video game development. A game called elastic man game features Morty as the primary character. This is the official website of the game: https://elastic-man.com
Optical Script Identification for multi-lingual Indic-script
Poddar, Sidhantha, Gupta, Rohan
Script identification and text recognition are some of the major domains in the application of Artificial Intelligence. In this era of digitalization, the use of digital note-taking has become a common practice. Still, conventional methods of using pen and paper is a prominent way of writing. This leads to the classification of scripts based on the method they are obtained. A survey on the current methodologies and state-of-art methods used for processing and identification would prove beneficial for researchers. The aim of this article is to discuss the advancement in the techniques for script pre-processing and text recognition. In India there are twelve prominent Indic scripts, unlike the English language, these scripts have layers of characteristics. Complex characteristics such as similarity in text shape make them difficult to recognize and analyze, thus this requires advance preprocessing methods for their accurate recognition. A sincere attempt is made in this survey to provide a comparison between all algorithms. We hope that this survey would provide insight to a researcher working not only on Indic scripts but also other languages.
Learning (With) Distributed Optimization
A, Aadharsh Aadhithya, S, Abinesh, J, Akshaya, M, Jayanth, Radhakrishnan, Vishnu, V, Sowmya, P, Soman K.
This paper provides an overview of the historical progression of distributed optimization techniques, tracing their development from early duality-based methods pioneered by Dantzig, Wolfe, and Benders in the 1960s to the emergence of the Augmented Lagrangian Alternating Direction Inexact Newton (ALADIN) algorithm. The initial focus on Lagrangian relaxation for convex problems and decomposition strategies led to the refinement of methods like the Alternating Direction Method of Multipliers (ADMM). The resurgence of interest in distributed optimization in the late 2000s, particularly in machine learning and imaging, demonstrated ADMM's practical efficacy and its unifying potential. This overview also highlights the emergence of the proximal center method and its applications in diverse domains. Furthermore, the paper underscores the distinctive features of ALADIN, which offers convergence guarantees for non-convex scenarios without introducing auxiliary variables, differentiating it from traditional augmentation techniques. In essence, this work encapsulates the historical trajectory of distributed optimization and underscores the promising prospects of ALADIN in addressing non-convex optimization challenges.
Adaptive Taxonomy Learning and Historical Patterns Modelling for Patent Classification
Zou, Tao, Yu, Le, Sun, Leilei, Du, Bowen, Wang, Deqing, Zhuang, Fuzhen
Patent classification aims to assign multiple International Patent Classification (IPC) codes to a given patent. Recent methods for automatically classifying patents mainly focus on analyzing the text descriptions of patents. However, apart from the texts, each patent is also associated with some assignees, and the knowledge of their applied patents is often valuable for classification. Furthermore, the hierarchical taxonomy formulated by the IPC system provides important contextual information and enables models to leverage the correlations between IPC codes for more accurate classification. However, existing methods fail to incorporate the above aspects. In this paper, we propose an integrated framework that comprehensively considers the information on patents for patent classification. To be specific, we first present an IPC codes correlations learning module to derive their semantic representations via adaptively passing and aggregating messages within the same level and across different levels along the hierarchical taxonomy. Moreover, we design a historical application patterns learning component to incorporate the corresponding assignee's previous patents by a dual channel aggregation mechanism. Finally, we combine the contextual information of patent texts that contains the semantics of IPC codes, and assignees' sequential preferences to make predictions. Experiments on real-world datasets demonstrate the superiority of our approach over the existing methods. Besides, we present the model's ability to capture the temporal patterns of assignees and the semantic dependencies among IPC codes.
Developing Effective Educational Chatbots with ChatGPT prompts: Insights from Preliminary Tests in a Case Study on Social Media Literacy (with appendix)
Koyuturk, Cansu, Yavari, Mona, Theophilou, Emily, Bursic, Sathya, Donabauer, Gregor, Telari, Alessia, Testa, Alessia, Boiano, Raffaele, Gabbiadini, Alessandro, Hernandez-Leo, Davinia, Ruskov, Martin, Ognibene, Dimitri
Educational chatbots come with a promise of interactive and personalized learning experiences, yet their development has been limited by the restricted free interaction capabilities of available platforms and the difficulty of encoding knowledge in a suitable format. Recent advances in language learning models with zero-shot learning capabilities, such as ChatGPT, suggest a new possibility for developing educational chatbots using a prompt-based approach. We present a case study with a simple system that enables mixed-turn chatbot interactions and discuss the insights and preliminary guidelines obtained from initial tests. We examine ChatGPT's ability to pursue multiple interconnected learning objectives, adapt the educational activity to users' characteristics, such as culture, age, and level of education, and its ability to use diverse educational strategies and conversational styles. Although the results are encouraging, challenges are posed by the limited history maintained for the conversation and the highly structured form of responses by ChatGPT, as well as their variability, which can lead to an unexpected switch of the chatbot's role from a teacher to a therapist. We provide some initial guidelines to address these issues and to facilitate the development of effective educational chatbots.