AITopics

2210.15767

Country:

North America > United States > Texas > Travis County > Austin (0.24)
North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > UAE (0.14)
(30 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Personal (1.00)
Instructional Material (1.00)

Industry:

Transportation > Ground > Road (1.00)
Media > News (1.00)
Law > Statutes (1.00)
(30 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Networks (1.00)
(10 more...)

Zucchet, Nicolas, Sacramento, João

Beyond backpropagation: bilevel optimization through implicit differentiation and equilibrium propagation

arXiv.org Artificial IntelligenceOct-27-2022

This paper reviews gradient-based techniques to solve bilevel optimization problems. Bilevel optimization is a general way to frame the learning of systems that are implicitly defined through a quantity that they minimize. This characterization can be applied to neural networks, optimizers, algorithmic solvers and even physical systems, and allows for greater modeling flexibility compared to an explicit definition of such systems. Here we focus on gradient-based approaches that solve such problems. We distinguish them in two categories: those rooted in implicit differentiation, and those that leverage the equilibrium propagation theorem. We present the mathematical foundations that are behind such methods, introduce the gradient-estimation algorithms in detail and compare the competitive advantages of the different approaches.

artificial intelligence, machine learning, optimization, (16 more...)

doi: 10.1162/neco_a_01547

2205.03076

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States (0.14)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Genre:

Research Report (1.00)
Overview (0.74)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Education (0.68)
Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Waghmare, Govind, Debnath, Ankur, Asthana, Siddhartha, Malhotra, Aakarsh

Modeling Inter-Dependence Between Time and Mark in Multivariate Temporal Point Processes

arXiv.org Artificial IntelligenceOct-27-2022

Temporal Point Processes (TPP) are probabilistic generative frameworks. They model discrete event sequences localized in continuous time. Generally, real-life events reveal descriptive information, known as marks. Marked TPPs model time and marks of the event together for practical relevance. Conditioned on past events, marked TPPs aim to learn the joint distribution of the time and the mark of the next event. For simplicity, conditionally independent TPP models assume time and marks are independent given event history. They factorize the conditional joint distribution of time and mark into the product of individual conditional distributions. This structural limitation in the design of TPP models hurt the predictive performance on entangled time and mark interactions. In this work, we model the conditional inter-dependence of time and mark to overcome the limitations of conditionally independent models. We construct a multivariate TPP conditioning the time distribution on the current event mark in addition to past events. Besides the conventional intensity-based models for conditional joint distribution, we also draw on flexible intensity-free TPP models from the literature. The proposed TPP models outperform conditionally independent and dependent models in standard prediction tasks. Our experimentation on various datasets with multiple evaluation metrics highlights the merit of the proposed approach.

artificial intelligence, dataset, machine learning, (12 more...)

doi: 10.1145/3511808.3557399

2210.15294

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
Asia > India (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre:

Research Report (0.64)
Instructional Material (0.47)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)

#artificialintelligenceOct-26-2022, 13:49:08 GMT

Learn Artificial Intelligence In Few Steps

This Artificial Intelligence tutorial provides basic and intermediate information on concepts of Artificial Intelligence. This course is intended for educators, programmers, It technicians and students. This course aims at identifying artificial intelligence and machine learning and their importance especially in the future. It also aims at applying machinelearningforkids website that helps creating such activities. Artificial intelligence makes it possible for machines to learn from experience, adjust to new inputs and perform human like tasks.

artificial intelligence, computer, learn artificial intelligence

Genre: Instructional Material > Course Syllabus & Notes (0.99)

Industry: Education (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.79)

#artificialintelligenceOct-26-2022, 01:45:32 GMT

What is the best GPU for deep learning?

Artificial intelligence, machine learning and deep learning are an integral part of the modern age, where business and science are finding a way to transform big data into actionable information. In this article, you will learn several things about deep learning: What is the deep learning process? What machines and mechanisms implement the deep learning process?

best gpu, deep learning, learning, (1 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.39)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Lee, Changyoon, Seonwoo, Yeon, Oh, Alice

CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course

arXiv.org Artificial IntelligenceOct-26-2022

We introduce CS1QA, a dataset for code-based question answering in the programming education domain. CS1QA consists of 9,237 question-answer pairs gathered from chat logs in an introductory programming class using Python, and 17,698 unannotated chat data with code. Each question is accompanied with the student's code, and the portion of the code relevant to answering the question. We carefully design the annotation process to construct CS1QA, and analyze the collected dataset in detail. The tasks for CS1QA are to predict the question type, the relevant code snippet given the question and the code and retrieving an answer from the annotated corpus. Results for the experiments on several baseline models are reported and thoroughly analyzed. The tasks for CS1QA challenge models to understand both the code and natural language. This unique dataset can be used as a benchmark for source code comprehension and question answering in the educational setting.

artificial intelligence, natural language, question answering, (19 more...)

doi: 10.18653/v1/2022.naacl-main.148

2210.14494

Country: North America > United States > Texas > Travis County > Austin (0.04)

Genre:

Research Report (0.50)
Instructional Material (0.34)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.93)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.92)

arXiv.org Artificial IntelligenceOct-26-2022

Improving Students' Academic Performance with AI and Semantic Technologies

Cheng, Yixin

Artificial intelligence and semantic technologies are evolving and have been applied in various research areas, including the education domain. Higher Education institutions strive to improve students' academic performance. Early intervention to at-risk students and a reasonable curriculum is vital for students' success. Prior research opted for deploying traditional machine learning models to predict students' performance. In terms of curriculum semantic analysis, after conducting a comprehensive systematic review regarding the use of semantic technologies in the Computer Science curriculum, a major finding of the study is that technologies used to measure similarity have limitations in terms of accuracy and ambiguity in the representation of concepts, courses, etc. To fill these gaps, in this study, three implementations were developed, that is, to predict students' performance using marks from the previous semester, to model a course representation in a semantic way and compute the similarity, and to identify the prerequisite between two similar courses. Regarding performance prediction, we used the combination of Genetic Algorithm and Long-Short Term Memory (LSTM) on a dataset from a Brazilian university containing 248730 records. As for similarity measurement, we deployed BERT to encode the sentences and used cosine similarity to obtain the distance between courses. With respect to prerequisite identification, TextRazor was applied to extract concepts from course description, followed by employing SemRefD to measure the degree of prerequisite between two concepts. The outcomes of this study can be summarized as: (i) a breakthrough result improves Manrique's work by 2.5% in terms of accuracy in dropout prediction; (ii) uncover the similarity between courses based on course description; (iii) identify the prerequisite over three compulsory courses of School of Computing at ANU.

evolutionary algorithm, machine learning, natural language, (20 more...)

2206.03213

Country:

Asia > Thailand (0.04)
Asia > Afghanistan (0.04)
South America > Ecuador (0.04)
(8 more...)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Curriculum (1.00)
Education > Assessment & Standards > Student Performance (1.00)
Education > Educational Setting > Higher Education (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

#artificialintelligenceOct-25-2022, 16:56:00 GMT

VSC: VSC School Seminar

Training week for selected students delivered by the SCtrain project.

claudia blaa-schenner, vsc team, vsc training course, (11 more...)

Country:

Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.11)
Europe > Austria > Tyrol > Innsbruck (0.08)
Europe > Austria > Vienna (0.05)
(7 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Online (0.74)

Technology:

Information Technology > Scientific Computing (0.69)
Information Technology > Artificial Intelligence (0.47)

#artificialintelligenceOct-25-2022, 16:55:15 GMT

How to Build a Deep Learning Based Recommender System

Amazon, Netflix, and Indeed don't simply provide more options than traditional retail stores, video rental stores, and newspapers -- they provide so many more options that the human mind can effectively comprehend and parse. Users need to be shown what will most appeal to them. There were recommender systems before deep learning, but until that advancement, technical constraints ensured choice remained tyrannical. Deep learning has become an essential component of recommender systems, and anyone who wants to understand the latter must understand the former. Traditional recommender systems make recommendations to users based on previous user interactions or attributes, depending on whether the recommender system uses content-based filtering, collaborative filtering, or a hybrid of the two. Content-based filtering recommends items with similar features to items a user interacted with in the past.

neural network, recommendation, recommender system, (13 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.70)

Industry:

Retail (0.89)
Information Technology > Services (0.50)
Media > Film (0.36)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-25-2022

Similarity between Units of Natural Language: The Transition from Coarse to Fine Estimation

Mu, Wenchuan

Capturing the similarities between human language units is crucial for explaining how humans associate different objects, and therefore its computation has received extensive attention, research, and applications. With the ever-increasing amount of information around us, calculating similarity becomes increasingly complex, especially in many cases, such as legal or medical affairs, measuring similarity requires extra care and precision, as small acts within a language unit can have significant real-world effects. My research goal in this thesis is to develop regression models that account for similarities between language units in a more refined way. Computation of similarity has come a long way, but approaches to debugging the measures are often based on continually fitting human judgment values. To this end, my goal is to develop an algorithm that precisely catches loopholes in a similarity calculation. Furthermore, most methods have vague definitions of the similarities they compute and are often difficult to interpret. The proposed framework addresses both shortcomings. It constantly improves the model through catching different loopholes. In addition, every refinement of the model provides a reasonable explanation. The regression model introduced in this thesis is called progressively refined similarity computation, which combines attack testing with adversarial training. The similarity regression model of this thesis achieves state-of-the-art performance in handling edge cases.

baseline algorithm and evaluation metric, data mining, machine learning, (24 more...)

2210.14275

Genre:

Research Report > New Finding (1.00)
Personal (1.00)
Overview (1.00)
Instructional Material (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(7 more...)