AITopics

2308.06905

Country:

North America > United States > Michigan (1.00)
North America > United States > Pennsylvania (0.04)
North America > United States > Virginia (0.04)
(4 more...)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Education > Educational Setting > Higher Education (1.00)
Education > Curriculum (1.00)
Automobiles & Trucks (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Robots > Locomotion (0.68)

arXiv.org Artificial IntelligenceAug-13-2023

Training Spiking Neural Networks Using Lessons From Deep Learning

Eshraghian, Jason K., Ward, Max, Neftci, Emre, Wang, Xinxin, Lenz, Gregor, Dwivedi, Girish, Bennamoun, Mohammed, Jeong, Doo Seok, Lu, Wei D.

The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what the future of deep learning might look like. This paper serves as a tutorial and perspective showing how to apply the lessons learnt from several decades of research in deep learning, gradient descent, backpropagation and neuroscience to biologically plausible spiking neural neural networks. We also explore the delicate interplay between encoding data as spikes and the learning process; the challenges and solutions of applying gradient-based learning to spiking neural networks (SNNs); the subtle link between temporal backpropagation and spike timing dependent plasticity, and how deep learning might move towards biologically plausible online learning. Some ideas are well accepted and commonly used amongst the neuromorphic engineering community, while others are presented or justified for the first time here. The fields of deep learning and spiking neural networks evolve very rapidly. We endeavour to treat this document as a 'dynamic' manuscript that will continue to be updated as the common practices in training SNNs also change. A series of companion interactive tutorials complementary to this paper using our Python package, snnTorch, are also made available. See https://snntorch.readthedocs.io/en/latest/tutorials/index.html .

artificial intelligence, machine learning, neuron, (16 more...)

2109.12894

Country:

North America > Cuba (0.04)
North America > United States > Michigan (0.04)
North America > United States > New York (0.04)
(2 more...)

Genre:

Instructional Material > Course Syllabus & Notes (0.85)
Research Report (0.81)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Education > Educational Setting > Online (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceAug-11-2023

LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts

Tu, Shangqing, Zhang, Zheyuan, Yu, Jifan, Li, Chunyang, Zhang, Siyu, Yao, Zijun, Hou, Lei, Li, Juanzi

Teaching assistants have played essential roles in the long history of education. However, few MOOC platforms are providing human or virtual teaching assistants to support learning for massive online students due to the complexity of real-world online education scenarios and the lack of training data. In this paper, we present a virtual MOOC teaching assistant, LittleMu with minimum labeled training data, to provide question answering and chit-chat services. Consisting of two interactive modules of heterogeneous retrieval and language model prompting, LittleMu first integrates structural, semi- and unstructured knowledge sources to support accurate answers for a wide range of questions. Then, we design delicate demonstrations named "Chain of Teach" prompts to exploit the large-scale pre-trained model to handle complex uncollected questions. Except for question answering, we develop other educational services such as knowledge-grounded chit-chat. We test the system's performance via both offline evaluation and online deployment. Since May 2020, our LittleMu system has served over 80,000 users with over 300,000 queries from over 500 courses on XuetangX MOOC platform, which continuously contributes to a more convenient and fair education. Our code, services, and dataset will be available at https://github.com/THU-KEG/VTA.

knowledge, littlemu, query, (12 more...)

2308.05935

Country:

Europe > United Kingdom > England > West Midlands > Birmingham (0.05)
North America > United States > New York > New York County > New York City (0.05)
Asia > China > Beijing > Beijing (0.04)
(7 more...)

Genre:

Instructional Material > Online (1.00)
Instructional Material > Course Syllabus & Notes (0.93)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Bewersdorff, Arne, Seßler, Kathrin, Baur, Armin, Kasneci, Enkelejda, Nerdel, Claudia

Assessing Student Errors in Experimentation Using Artificial Intelligence and Large Language Models: A Comparative Study with Human Raters

arXiv.org Artificial IntelligenceAug-11-2023

Identifying logical errors in complex, incomplete or even contradictory and overall heterogeneous data like students' experimentation protocols is challenging. Recognizing the limitations of current evaluation methods, we investigate the potential of Large Language Models (LLMs) for automatically identifying student errors and streamlining teacher assessments. Our aim is to provide a foundation for productive, personalized feedback. Using a dataset of 65 student protocols, an Artificial Intelligence (AI) system based on the GPT-3.5 and GPT-4 series was developed and tested against human raters. Our results indicate varying levels of accuracy in error detection between the AI system and human raters. The AI system can accurately identify many fundamental student errors, for instance, the AI system identifies when a student is focusing the hypothesis not on the dependent variable but solely on an expected observation (acc. = 0.90), when a student modifies the trials in an ongoing investigation (acc. = 1), and whether a student is conducting valid test trials (acc. = 0.82) reliably. The identification of other, usually more complex errors, like whether a student conducts a valid control trial (acc. = .60), poses a greater challenge. This research explores not only the utility of AI in educational settings, but also contributes to the understanding of the capabilities of LLMs in error detection in inquiry-based learning like experimentation.

ai system, large language model, machine learning, (19 more...)

2308.06088

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
North America > United States (0.04)
(4 more...)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.89)

Industry:

Education > Educational Setting > K-12 Education (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Interactive AI MagazineAug-10-2023, 13:34:57 GMT

Reports of the Workshops Held at the 2023 AAAI Conference on Artificial Intelligence

The Workshop Program of the Association for the Advancement of Artificial Intelligence's 37th Conference on Artificial Intelligence (AAAI-23) was held in Washington, DC, USA on February 13-14, 2023. There were 32 workshops in the program: AI for Agriculture and Food Systems, AI for Behavior Change, AI for Credible Elections: A Call to Action with Trusted AI, AI for Energy Innovation, AI for Web Advertising, AI to Accelerate Science and Engineering, AI4EDU: AI for Education, Artificial Intelligence and Diplomacy, Artificial Intelligence for Cyber Security (AICS), Artificial Intelligence for Social Good (AI4SG), Artificial Intelligence Safety (SafeAI), Creative AI Across Modalities, Deep Learning on Graphs: Methods and Applications (DLG-AAAI'23), DEFACTIFY: Multimodal Fact-Checking and Hate Speech Detection, Deployable AI (DAI), DL-Hardware Co-Design for AI Acceleration, Energy Efficient Training and Inference of Transformer Based Models, Graphs and More Complex Structures for Learning and Reasoning (GCLR), Health Intelligence (W3PHIAI-23), Knowledge-Augmented Methods for Natural Language Processing, Modelling Uncertainty in the Financial World (MUFin'23), Multi-Agent Path Finding, Multimodal AI for Financial Forecasting (Muffin), Multimodal AI for Financial Forecasting (Muffin), Privacy-Preserving Artificial Intelligence, Recent Trends in Human-Centric AI, Reinforcement Learning Ready for Production, Scientific Document Understanding, Systems Neuroscience Approach to General Intelligence, Uncertainty Reasoning and Quantification in Decision Making (UDM'23), User-Centric Artificial Intelligence for Assistance in At-Home Tasks, and When Machine Learning Meets Dynamical Systems: Theory and Applications. This report contains summaries of the workshops, which were submitted by some, but not all of the workshop chairs. An increasing world population, coupled with finite arable land, changing diets, and the growing expense of agricultural inputs, is poised to stretch our agricultural systems to their limits. By the end of this century, the earth's population is projected to increase by 45% with available arable land decreasing by 20% coupled with changes in what crops these arable lands can best support; this creates the urgent need to enhance agricultural productivity by 70% before 2050.

artificial intelligence, behavioral intervention, intervention, (13 more...)

Interactive AI Magazine

Country: North America > United States > District of Columbia > Washington (0.25)

Genre:

Instructional Material (0.57)
Research Report (0.54)

Industry:

Information Technology (1.00)
Food & Agriculture > Agriculture (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

VideoLectures.NETAug-10-2023, 04:43:42 GMT

1st AI4GOV Training Workshop: Bias In AI

The 1st AI4GOV training workshop titled Bias in AI (focused on fundamentals) is the first organized training workshop with others to follow within the scope of the Horizon Europe project AI4GOV- Trusted AI for Transparent Public Governance fostering Democratic Values. Do you remember the Rick and Morty cartoon? Recently, I learned that this movie even provided inspiration for video game development. A game called elastic man game features Morty as the primary character. This is the official website of the game: https://elastic-man.com

ai4gov training workshop

VideoLectures.NET

Country: Europe > Slovenia > Central Slovenia > Municipality of Ljubljana > Ljubljana (0.40)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.77)

Technology: Information Technology > Artificial Intelligence (1.00)

Poddar, Sidhantha, Gupta, Rohan

Optical Script Identification for multi-lingual Indic-script

Script identification and text recognition are some of the major domains in the application of Artificial Intelligence. In this era of digitalization, the use of digital note-taking has become a common practice. Still, conventional methods of using pen and paper is a prominent way of writing. This leads to the classification of scripts based on the method they are obtained. A survey on the current methodologies and state-of-art methods used for processing and identification would prove beneficial for researchers. The aim of this article is to discuss the advancement in the techniques for script pre-processing and text recognition. In India there are twelve prominent Indic scripts, unlike the English language, these scripts have layers of characteristics. Complex characteristics such as similarity in text shape make them difficult to recognize and analyze, thus this requires advance preprocessing methods for their accurate recognition. A sincere attempt is made in this survey to provide a comparison between all algorithms. We hope that this survey would provide insight to a researcher working not only on Indic scripts but also other languages.

evolutionary algorithm, machine learning, pattern recognition, (20 more...)

2308.0578

Country:

Asia > India > West Bengal > Kolkata (0.04)
Asia > Singapore (0.04)
Asia > India > Tamil Nadu > Vellore (0.04)
(3 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Instructional Material > Online (0.34)
Instructional Material > Course Syllabus & Notes (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(5 more...)

Learning (With) Distributed Optimization

A, Aadharsh Aadhithya, S, Abinesh, J, Akshaya, M, Jayanth, Radhakrishnan, Vishnu, V, Sowmya, P, Soman K.

This paper provides an overview of the historical progression of distributed optimization techniques, tracing their development from early duality-based methods pioneered by Dantzig, Wolfe, and Benders in the 1960s to the emergence of the Augmented Lagrangian Alternating Direction Inexact Newton (ALADIN) algorithm. The initial focus on Lagrangian relaxation for convex problems and decomposition strategies led to the refinement of methods like the Alternating Direction Method of Multipliers (ADMM). The resurgence of interest in distributed optimization in the late 2000s, particularly in machine learning and imaging, demonstrated ADMM's practical efficacy and its unifying potential. This overview also highlights the emergence of the proximal center method and its applications in diverse domains. Furthermore, the paper underscores the distinctive features of ALADIN, which offers convergence guarantees for non-convex scenarios without introducing auxiliary variables, differentiating it from traditional augmentation techniques. In essence, this work encapsulates the historical trajectory of distributed optimization and underscores the promising prospects of ALADIN in addressing non-convex optimization challenges.

artificial intelligence, machine learning, optimization problem, (15 more...)

2308.05548

Genre:

Instructional Material (0.93)
Overview (0.73)
Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Adaptive Taxonomy Learning and Historical Patterns Modelling for Patent Classification

Zou, Tao, Yu, Le, Sun, Leilei, Du, Bowen, Wang, Deqing, Zhuang, Fuzhen

Patent classification aims to assign multiple International Patent Classification (IPC) codes to a given patent. Recent methods for automatically classifying patents mainly focus on analyzing the text descriptions of patents. However, apart from the texts, each patent is also associated with some assignees, and the knowledge of their applied patents is often valuable for classification. Furthermore, the hierarchical taxonomy formulated by the IPC system provides important contextual information and enables models to leverage the correlations between IPC codes for more accurate classification. However, existing methods fail to incorporate the above aspects. In this paper, we propose an integrated framework that comprehensively considers the information on patents for patent classification. To be specific, we first present an IPC codes correlations learning module to derive their semantic representations via adaptively passing and aggregating messages within the same level and across different levels along the hierarchical taxonomy. Moreover, we design a historical application patterns learning component to incorporate the corresponding assignee's previous patents by a dual channel aggregation mechanism. Finally, we combine the contextual information of patent texts that contains the semantics of IPC codes, and assignees' sequential preferences to make predictions. Experiments on real-world datasets demonstrate the superiority of our approach over the existing methods. Besides, we present the model's ability to capture the temporal patterns of assignees and the semantic dependencies among IPC codes.

ipc code, machine learning, natural language, (19 more...)

2308.05385

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > China > Beijing > Beijing (0.05)
(15 more...)

Genre:

Research Report (0.82)
Instructional Material > Course Syllabus & Notes (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.88)

Developing Effective Educational Chatbots with ChatGPT prompts: Insights from Preliminary Tests in a Case Study on Social Media Literacy (with appendix)

Koyuturk, Cansu, Yavari, Mona, Theophilou, Emily, Bursic, Sathya, Donabauer, Gregor, Telari, Alessia, Testa, Alessia, Boiano, Raffaele, Gabbiadini, Alessandro, Hernandez-Leo, Davinia, Ruskov, Martin, Ognibene, Dimitri

Educational chatbots come with a promise of interactive and personalized learning experiences, yet their development has been limited by the restricted free interaction capabilities of available platforms and the difficulty of encoding knowledge in a suitable format. Recent advances in language learning models with zero-shot learning capabilities, such as ChatGPT, suggest a new possibility for developing educational chatbots using a prompt-based approach. We present a case study with a simple system that enables mixed-turn chatbot interactions and discuss the insights and preliminary guidelines obtained from initial tests. We examine ChatGPT's ability to pursue multiple interconnected learning objectives, adapt the educational activity to users' characteristics, such as culture, age, and level of education, and its ability to use diverse educational strategies and conversational styles. Although the results are encouraging, challenges are posed by the limited history maintained for the conversation and the highly structured form of responses by ChatGPT, as well as their variability, which can lead to an unexpected switch of the chatbot's role from a teacher to a therapist. We provide some initial guidelines to address these issues and to facilitate the development of effective educational chatbots.

large language model, machine learning, natural language, (19 more...)

2306.10645

Country:

Asia > Middle East > Iran (0.04)
Europe > Italy > Lombardy > Milan (0.04)
Europe > Switzerland (0.04)
(5 more...)

Genre:

Personal > Interview (1.00)
Instructional Material (1.00)
Research Report (0.82)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)
Health & Medicine > Consumer Health (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)