AITopics

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Multimodal Subtask Graph Generation from Instructional Videos

Jang, Yunseok, Sohn, Sungryull, Logeswaran, Lajanugen, Luo, Tiange, Lee, Moontae, Lee, Honglak

Real-world tasks consist of multiple inter-dependent subtasks (e.g., a dirty pan needs to be washed before it can be used for cooking). In this work, we aim to model the causal dependencies between such subtasks from instructional videos describing the task. This is a challenging problem since complete information about the world is often inaccessible from videos, which demands robust learning mechanisms to understand the causal structure of events. We present Multimodal Subtask Graph Generation (MSG2), an approach that constructs a Subtask Graph defining the dependency between a task's subtasks relevant to a task from noisy web videos. Graphs generated by our multimodal approach are closer to human-annotated graphs compared to prior approaches. MSG2 further performs the downstream task of next subtask prediction 85% and 30% more accurately than recent video transformer models in the ProceL and CrossTask datasets, respectively.

logic & formal reasoning, machine learning, natural language, (17 more...)

2302.08672

Country: North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)

Genre:

Research Report (0.81)
Instructional Material > Course Syllabus & Notes (0.61)

Industry:

Education > Educational Technology > Media (0.61)
Education > Educational Technology > Audio & Video (0.61)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.47)

Paaß, Gerhard, Giesselbach, Sven

Foundation Models for Natural Language Processing -- Pre-trained Language Models Integrating Media

This open access book provides a comprehensive overview of the state of the art in research and applications of Foundation Models and is intended for readers familiar with basic Natural Language Processing (NLP) concepts. Over the recent years, a revolutionary new paradigm has been developed for training models for NLP. These models are first pre-trained on large collections of text documents to acquire general syntactic knowledge and semantic information. Then, they are fine-tuned for specific tasks, which they can often solve with superhuman accuracy. When the models are large enough, they can be instructed by prompts to solve new tasks without any fine-tuning. Moreover, they can be applied to a wide range of different media and problem domains, ranging from image and video processing to robot control learning. Because they provide a blueprint for solving many tasks in artificial intelligence, they have been called Foundation Models. After a brief introduction to basic NLP models the main pre-trained language models BERT, GPT and sequence-to-sequence transformer are described, as well as the concepts of self-attention and context-sensitive embedding. Then, different approaches to improving these models are discussed, such as expanding the pre-training criteria, increasing the length of input texts, or including extra knowledge. An overview of the best-performing models for about twenty application areas is then presented, e.g., question answering, translation, story generation, dialog systems, generating images from text, etc. For each application area, the strengths and weaknesses of current models are discussed, and an outlook on further developments is given. In addition, links are provided to freely available program code. A concluding chapter summarizes the economic opportunities, mitigation of risks, and potential developments of AI.

large language model, machine learning, pattern recognition, (32 more...)

2302.08575

Country:

Europe > Ukraine > Kyiv Oblast > Kyiv (0.13)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)
North America > Canada > Ontario > Toronto (0.13)
(43 more...)

Genre:

Workflow (1.00)
Summary/Review (1.00)
Research Report > Promising Solution (1.00)
(4 more...)

Industry:

Transportation > Passenger (1.00)
Media > Television (1.00)
Media > News (1.00)
(21 more...)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
(23 more...)

Bacci, Giovanni, Ingólfsdóttir, Anna, Larsen, Kim G., Reynouard, Raphaël

MM Algorithms to Estimate Parameters in Continuous-time Markov Chains

Continuous-time Markov chains (CTMCs) are popular modeling formalism that constitutes the underlying semantics for real-time probabilistic systems such as queuing networks, stochastic process algebras, and calculi for systems biology. Prism and Storm are popular model checking tools that provide a number of powerful analysis techniques for CTMCs. These tools accept models expressed as the parallel composition of a number of modules interacting with each other. The outcome of the analysis is strongly dependent on the parameter values used in the model which govern the timing and probability of events of the resulting CTMC. However, for some applications, parameter values have to be empirically estimated from partially-observable executions. In this work, we address the problem of estimating parameter values of CTMCs expressed as Prism models from a number of partially-observable executions. We introduce the class parametric CTMCs -- CTMCs where transition rates are polynomial functions over a set of parameters -- as an abstraction of CTMCs covering a large class of Prism models. Then, building on a theory of algorithms known by the initials MM, for minorization-maximization, we present iterative maximum likelihood estimation algorithms for parametric CTMCs covering two learning scenarios: when both state-labels and dwell times are observable, or just state-labels are. We conclude by illustrating the use of our technique in a simple but non-trivial case study: the analysis of the spread of COVID-19 in presence of lockdown countermeasures.

artificial intelligence, machine learning, transition, (18 more...)

2302.08588

Country:

Europe > Iceland > Capital Region > Reykjavik (0.04)
Europe > Denmark > North Jutland > Aalborg (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.48)
Health & Medicine > Therapeutic Area > Immunology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

A Probabilistic Generative Model for Tracking Multi-Knowledge Concept Mastery Probability

Liu, Hengyu, Zhang, Tiancheng, Li, Fan, Yu, Minghe, Yu, Ge

Knowledge tracing aims to track students' knowledge status over time to predict students' future performance accurately. Markov chain-based knowledge tracking (MCKT) models can track knowledge concept mastery probability over time. However, as the number of tracked knowledge concepts increases, the time complexity of MCKT predicting student performance increases exponentially (also called explaining away problem. In addition, the existing MCKT models only consider the relationship between students' knowledge status and problems when modeling students' responses but ignore the relationship between knowledge concepts in the same problem. To address these challenges, we propose an inTerpretable pRobAbilistiC gEnerative moDel (TRACED), which can track students' numerous knowledge concepts mastery probabilities over time. To solve \emph{explain away problem}, we design Long and Short-Term Memory (LSTM)-based networks to approximate the posterior distribution, predict students' future performance, and propose a heuristic algorithm to train LSTMs and probabilistic graphical model jointly. To better model students' exercise responses, we proposed a logarithmic linear model with three interactive strategies, which models students' exercise responses by considering the relationship among students' knowledge status, knowledge concept, and problems. We conduct experiments with four real-world datasets in three knowledge-driven tasks. The experimental results show that TRACED outperforms existing knowledge tracing methods in predicting students' future performance and can learn the relationship among students, knowledge concepts, and problems from students' exercise sequences. We also conduct several case studies. The case studies show that TRACED exhibits excellent interpretability and thus has the potential for personalized automatic feedback in the real-world educational environment.

artificial intelligence, machine learning, student, (16 more...)

2302.08673

Country:

Asia > China > Liaoning Province > Shenyang (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > New York (0.04)
(5 more...)

Genre:

Instructional Material (0.92)
Research Report > New Finding (0.34)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.66)

Learning with Rejection for Abstractive Text Summarization

Cao, Meng, Dong, Yue, He, Jingyi, Cheung, Jackie Chi Kit

State-of-the-art abstractive summarization systems frequently hallucinate content that is not supported by the source document, mainly due to noise in the training dataset. Existing methods opt to drop the noisy samples or tokens from the training set entirely, reducing the effective training set size and creating an artificial propensity to copy words from the source. In this work, we propose a training objective for abstractive summarization based on rejection learning, in which the model learns whether or not to reject potentially noisy tokens. We further propose a regularized decoding objective that penalizes non-factual candidate summaries during inference by using the rejection probability learned during training. We show that our method considerably improves the factuality of generated summaries in automatic and human evaluations when compared to five baseline models and that it does so while increasing the abstractiveness of the generated summaries.

computational linguistic, machine learning, natural language, (17 more...)

2302.08531

Country:

Europe > United Kingdom (0.14)
North America > Canada > Quebec > Montreal (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(7 more...)

Genre:

Research Report (0.50)
Instructional Material (0.35)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Law > Criminal Law (0.94)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

#artificialintelligenceFeb-15-2023, 14:00:38 GMT

World Customs Organization

The World Customs Organization (WCO) recently conducted a BACUDA Data Analytics workshop for the Maldives Customs Service with 41 participants from the 30th of January to the 1st of February in Male, Maldives. The mission was financed by the Customs Cooperation Fund of Korea (CCF-Korea) and took place under the WCO's BACUDA initiative, the WCO capacity building project on Data Analytics. WCO experts and two BACUDA Scholarship graduates led the workshop. They delivered various sessions to equip the customs officials with the latest data analytics tools and techniques. One of the key highlights of the workshop was a hands-on session where the participants learned how to use Python language to work with the AI HS algorithm developed through the BACUDA project.

participant, workshop, world custom organization, (11 more...)

Country: Asia > Maldives > North Central Province > Kaafu Atoll > Malé (0.26)

Genre: Instructional Material > Course Syllabus & Notes (0.48)

Industry:

Education (0.76)
Law > International Law (0.62)
Government > Immigration & Customs (0.62)
Media > News (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence (0.87)

#artificialintelligenceFeb-15-2023, 05:51:17 GMT

The 2023 Machine Learning Engineer RoadMap

Learning this fabulous programming language is not just mandatory to start your journey in machine learning. Still, it is an investment in yourself that you may need all your life because you can even shift your career to another one and still use python in that new industry. This is almost the most popular course among python developers which will help you learn the basics of this language and use the Python built-in data structure, accessing the web, which will be very useful when you are trying to get the data from the web, and using python with the database. The course has more than a million students with a 4.8 rating score which is an excellent resource. Alternatively, you can start your Machine Learning Career with R programming language.

library, machine learning engineer roadmap, programming language, (7 more...)

Genre: Instructional Material (0.35)

Industry:

Education > Educational Setting > Online (0.51)
Education > Educational Technology > Educational Software > Computer Based Training (0.31)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.31)

#artificialintelligenceFeb-15-2023, 05:20:29 GMT

GitHub - girafe-ai/ml-course: Open Machine Learning course

Warning, repository has been renamed to represent its current status. This course aims to introduce students to modern state of Machine Learning and Artificial Intelligence. It is designed to take one full year - approximately 2 * 15 lectures and seminars. All learning materials are available here, full list of topics considered in the course are listed in program_*.pdf Although if you don't have any of this, you could substitude it with your diligence because the course provides additional materials to study requirements yourself.

conspect, github, open machine learning course, (1 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Mittal, Sudip, Chen, Jingdao

AI Security Threats against Pervasive Robotic Systems: A Course for Next Generation Cybersecurity Workforce

arXiv.org Artificial IntelligenceFeb-15-2023

Robotics, automation, and related Artificial Intelligence (AI) systems have become pervasive bringing in concerns related to security, safety, accuracy, and trust. With growing dependency on physical robots that work in close proximity to humans, the security of these systems is becoming increasingly important to prevent cyber-attacks that could lead to privacy invasion, critical operations sabotage, and bodily harm. The current shortfall of professionals who can defend such systems demands development and integration of such a curriculum. This course description includes details about seven self-contained and adaptive modules on "AI security threats against pervasive robotic systems". Topics include: 1) Introduction, examples of attacks, and motivation; 2) - Robotic AI attack surfaces and penetration testing; 3) - Attack patterns and security strategies for input sensors; 4) - Training attacks and associated security strategies; 5) - Inference attacks and associated security strategies; 6) - Actuator attacks and associated security strategies; and 7) - Ethics of AI, robotics, and cybersecurity.

artificial intelligence, machine learning, student, (19 more...)

2302.07953

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Mississippi > Mississippi County > Mississippi State (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Iran > Tehran Province > Tehran (0.04)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)