AITopics

2308.16375

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
(10 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Personal > Honors (0.45)
Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.92)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Li, Chunyuan, Gan, Zhe, Yang, Zhengyuan, Yang, Jianwei, Li, Linjie, Wang, Lijuan, Gao, Jianfeng

cvf international conference, in-context learning ability, information processing system, (16 more...)

This paper presents a comprehensive survey of the taxonomy and evolution of multimodal foundation models that demonstrate vision and vision-language capabilities, focusing on the transition from specialist models to general-purpose assistants. The research landscape encompasses five core topics, categorized into two classes. (i) We start with a survey of well-established research areas: multimodal foundation models pre-trained for specific purposes, including two topics -- methods of learning vision backbones for visual understanding and text-to-image generation. (ii) Then, we present recent advances in exploratory, open research areas: multimodal foundation models that aim to play the role of general-purpose assistants, including three topics -- unified vision models inspired by large language models (LLMs), end-to-end training of multimodal LLMs, and chaining multimodal tools with LLMs. The target audiences of the paper are researchers, graduate students, and professionals in computer vision and vision-language multimodal communities who are eager to learn the basics and recent advances in multimodal foundation models.

2309.1002

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Poland (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Instructional Material (1.00)

Industry:

Leisure & Entertainment (1.00)
Education (1.00)
Transportation > Passenger (0.92)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Khan, Valeriya, Cygert, Sebastian, Deja, Kamil, Trzciński, Tomasz, Twardowski, Bartłomiej

Looking through the past: better knowledge retention for generative replay in continual learning

In this work, we improve the generative replay in a continual learning setting to perform well on challenging scenarios. Current generative rehearsal methods are usually benchmarked on small and simple datasets as they are not powerful enough to generate more complex data with a greater number of classes. We notice that in VAE-based generative replay, this could be attributed to the fact that the generated features are far from the original ones when mapped to the latent space. Therefore, we propose three modifications that allow the model to learn and generate complex data. More specifically, we incorporate the distillation in latent space between the current and previous models to reduce feature drift. Additionally, a latent matching for the reconstruction and original data is proposed to improve generated features alignment. Further, based on the observation that the reconstructions are better for preserving knowledge, we add the cycling of generations through the previously trained model to make them closer to the original data. Our method outperforms other generative replay methods in various scenarios. Code available at https://github.com/valeriya-khan/looking-through-the-past.

accuracy, learning, replay, (14 more...)

2309.10012

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Poland > Pomerania Province > Gdańsk (0.04)
Europe > Poland > Masovia Province > Warsaw (0.04)
(2 more...)

Genre:

Instructional Material (0.46)
Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Khakhiashvili, Ilya, Dery, Lihi, Grinshpoun, Tal

Distributed course allocation with asymmetric friendships

Students' decisions on whether to take a class are strongly affected by whether their friends plan to take the class with them. A student may prefer to be assigned to a course they likes less, just to be with their friends, rather than taking a more preferred class alone. It has been shown that taking classes with friends positively affects academic performance. Thus, academic institutes should prioritize friendship relations when assigning course seats. The introduction of friendship relations results in several non-trivial changes to current course allocation methods. This paper explores how course allocation mechanisms can account for friendships between students and provide a unique, distributed solution. In particular, we model the problem as an asymmetric distributed constraint optimization problem and develop a new dedicated algorithm. Our extensive evaluation includes both simulated data and data derived from a user study on 177 students' preferences over courses and friends. The results show that our algorithm obtains high utility for the students while keeping the solution fair and observing courses' seat capacity limitations.

agent, algorithm, student, (16 more...)

2309.09684

Country:

Asia > Middle East > Israel (0.05)
North America > United States > Colorado (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Hungary (0.04)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)

Perdomo, Juan C., Britton, Tolani, Hardt, Moritz, Abebe, Rediet

Difficult Lessons on Social Prediction from Wisconsin Public Schools

Early warning systems (EWS) are predictive tools at the center of recent efforts to improve graduation rates in public schools across the United States. These systems assist in targeting interventions to individual students by predicting which students are at risk of dropping out. Despite significant investments in their widespread adoption, there remain large gaps in our understanding of the efficacy of EWS, and the role of statistical risk scores in education. In this work, we draw on nearly a decade's worth of data from a system used throughout Wisconsin to provide the first large-scale evaluation of the long-term impact of EWS on graduation outcomes. We present empirical evidence that the prediction system accurately sorts students by their dropout risk. We also find that it may have caused a single-digit percentage increase in graduation rates, though our empirical analyses cannot reliably rule out that there has been no positive treatment effect. Going beyond a retrospective evaluation of DEWS, we draw attention to a central question at the heart of the use of EWS: Are individual risk scores necessary for effectively targeting interventions? We propose a simple mechanism that only uses information about students' environments -- such as their schools, and districts -- and argue that this mechanism can target interventions just as efficiently as the individual risk score-based mechanism. Our argument holds even if individual predictions are highly accurate and effective interventions exist. In addition to motivating this simple targeting mechanism, our work provides a novel empirical backbone for the robust qualitative understanding among education researchers that dropout is structurally determined. Combined, our insights call into question the marginal value of individual predictions in settings where outcomes are driven by high levels of inequality.

intervention, prediction, student, (16 more...)

2304.06205

Country:

North America > United States > Wisconsin (0.62)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Instructional Material (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Education > Educational Setting > K-12 Education > Secondary School (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.87)
Information Technology > Data Science > Data Mining (0.67)

Leon, Vincent, Etesami, S. Rasoul

Online Reinforcement Learning in Markov Decision Process Using Linear Programming

We consider online reinforcement learning in episodic Markov decision process (MDP) with unknown transition function and stochastic rewards drawn from some fixed but unknown distribution. The learner aims to learn the optimal policy and minimize their regret over a finite time horizon through interacting with the environment. We devise a simple and efficient model-based algorithm that achieves $\widetilde{O}(LX\sqrt{TA})$ regret with high probability, where $L$ is the episode length, $T$ is the number of episodes, and $X$ and $A$ are the cardinalities of the state space and the action space, respectively. The proposed algorithm, which is based on the concept of ``optimism in the face of uncertainty", maintains confidence sets of transition and reward functions and uses occupancy measures to connect the online MDP with linear programming. It achieves a tighter regret bound compared to the existing works that use a similar confidence set framework and improves computational effort compared to those that use a different framework but with a slightly tighter regret bound.

algorithm, occupancy measure, probability, (12 more...)

2304.00155

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Illinois > Champaign County > Urbana (0.04)
(3 more...)

Genre: Instructional Material > Online (0.60)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.72)

Applying Automated Machine Translation to Educational Video Courses

Wang, Linden

We studied the capability of automated machine translation in the online video education space by automatically translating Khan Academy videos with state-of-the-art translation models and applying text-to-speech synthesis and audio/video synchronization to build engaging videos in target languages. We also analyzed and established two reliable translation confidence estimators based on round-trip translations in order to efficiently manage translation quality and reduce human translation effort. Finally, we developed a deployable system to deliver translated videos to end users and collect user corrections for iterative improvement.

threshold, translation, video, (12 more...)

2301.03141

Country:

North America > United States > California > Yolo County > Davis (0.14)
Africa > Sub-Saharan Africa (0.05)
South America > Chile (0.04)
(5 more...)

Genre: Instructional Material > Online (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.37)

Daily Mail - Science & techSep-17-2023, 13:04:24 GMT

EXCLUSIVE Former Google futurist predicts what classrooms of the future will look like - they include virtual reality lessons and downloadable knowledge

By 2050, students will download knowledge directly into their brains from AI tutors, with no human teacher present - and receive lessons tailored to their DNA, a futurist has predicted. Virtual reality will be the main mode of learning for many subjects, and the most important subject students will learn will be how to work as a'co-bot' alongside artificial intelligence, said Tracey Follows, a futurist who has worked with clients including Google, Virgin and Telefonica. Follows, who is listed as one of the top female futurists worldwide in Forbes, said that even classrooms might be a thing of the past as students'self teach' with the help of AI'tutors'. Follows produced a white paper on the subject in collaboration with online tutoring service GoStudent, and said that while her predictions may seem out there, they are'not science fiction'. She predicts that new subjects such as interstellar studies and biotech will become popular as humanity moves towards becoming an interplanetary species.

classroom, rob waugh midjourney, student, (7 more...)

Daily Mail - Science & tech

Country: Asia > China (0.05)

Genre: Instructional Material > Course Syllabus & Notes (0.86)

Industry: Education > Educational Setting > Online (0.85)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.64)

arXiv.org Artificial IntelligenceSep-17-2023

Performance of the Pre-Trained Large Language Model GPT-4 on Automated Short Answer Grading

Kortemeyer, Gerd

Automated Short Answer Grading (ASAG) has been an active area of machine-learning research for over a decade. It promises to let educators grade and give feedback on free-form responses in large-enrollment courses in spite of limited availability of human graders. Over the years, carefully trained models have achieved increasingly higher levels of performance. More recently, pre-trained Large Language Models (LLMs) emerged as a commodity, and an intriguing question is how a general-purpose tool without additional training compares to specialized models. We studied the performance of GPT-4 on the standard benchmark 2-way and 3-way datasets SciEntsBank and Beetle, where in addition to the standard task of grading the alignment of the student answer with a reference answer, we also investigated withholding the reference answer. We found that overall, the performance of the pre-trained general-purpose GPT-4 LLM is comparable to hand-engineered models, but worse than pre-trained LLMs that had specialized training.

gpt-4, reference answer, student answer, (11 more...)

2309.09338

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > District of Columbia > Washington (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Instructional Material (1.00)

Industry:

Education > Educational Setting (0.70)
Education > Educational Technology (0.47)
Information Technology > Security & Privacy (0.46)
Education > Assessment & Standards > Student Performance (0.38)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Hashemi, Navid, Qin, Xin, Lindemann, Lars, Deshmukh, Jyotirmoy V.

Data-Driven Reachability Analysis of Stochastic Dynamical Systems with Conformal Inference

arXiv.org Artificial IntelligenceSep-17-2023

We consider data-driven reachability analysis of discrete-time stochastic dynamical systems using conformal inference. We assume that we are not provided with a symbolic representation of the stochastic system, but instead have access to a dataset of $K$-step trajectories. The reachability problem is to construct a probabilistic flowpipe such that the probability that a $K$-step trajectory can violate the bounds of the flowpipe does not exceed a user-specified failure probability threshold. The key ideas in this paper are: (1) to learn a surrogate predictor model from data, (2) to perform reachability analysis using the surrogate model, and (3) to quantify the surrogate model's incurred error using conformal inference in order to give probabilistic reachability guarantees. We focus on learning-enabled control systems with complex closed-loop dynamics that are difficult to model symbolically, but where state transition pairs can be queried, e.g., using a simulator. We demonstrate the applicability of our method on examples from the domain of learning-enabled cyber-physical systems.

reachability analysis, surrogate model, trajectory, (14 more...)

2309.09187

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Portugal > Porto > Porto (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(4 more...)

Genre:

Research Report (0.64)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Energy (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)
Information Technology > Modeling & Simulation (0.68)