AITopics | Instructional Material

Collaborating Authors

Instructional Material

Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation

Wei, Xiwen, Li, Guihong, Marculescu, Radu

arXiv.org Artificial IntelligenceNov-8-2024

Catastrophic forgetting is a significant challenge in online continual learning (OCL), especially for non-stationary data streams that do not have well-defined task boundaries. This challenge is exacerbated by the memory constraints and privacy concerns inherent in rehearsal buffers. To tackle catastrophic forgetting, in this paper, we introduce Online-LoRA, a novel framework for task-free OCL. Online-LoRA allows to finetune pre-trained Vision Transformer (ViT) models in real-time to address the limitations of rehearsal buffers and leverage pre-trained models' performance benefits. As the main contribution, our approach features a novel online weight regularization strategy to identify and consolidate important model parameters. Moreover, Online-LoRA leverages the training dynamics of loss values to enable the automatic recognition of the data distribution shifts. Extensive experiments across many task-free OCL scenarios and benchmark datasets (including CIFAR-100, ImageNet-R, ImageNet-S, CUB-200 and CORe50) demonstrate that Online-LoRA can be robustly adapted to various ViT architectures, while achieving better performance compared to SOTA methods. Our code will be publicly available at: https://github.com/Christina200/Online-LoRA-official.git.

artificial intelligence, continual learning, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2411.05663

Country:

Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > California (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)

Genre:

Research Report (1.00)
Instructional Material > Online (0.85)

Industry: Education > Educational Setting > Online (0.70)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science (0.87)

Add feedback

Sony PlayStation 5 Pro Review: More Power, More Immersion, More Money

WIREDNov-6-2024, 20:41:20 GMT

I remember the first time I watched a tutorial on Blender, a 3D computer graphics software, explaining how metal surfaces have colored reflections, while nonmetal surfaces don't. It was a fascinating art lesson and something I don't think I ever would've noticed if no one had pointed it out. I felt excited to learn about such a cool, if inconsequential detail about how our world looks. While testing out Sony's PlayStation 5 Pro, I experienced that same feeling over and over again. Generally, video game graphics have reached the coveted point of "good enough."

artificial intelligence, playstation 5, sony playstation 5, (7 more...)

WIRED

Genre: Instructional Material > Course Syllabus & Notes (0.57)

Industry:

Semiconductors & Electronics (0.65)
Education > Curriculum > Subject-Specific Education (0.57)
Leisure & Entertainment > Games > Computer Games (0.39)

Technology: Information Technology > Artificial Intelligence > Games (0.39)

Add feedback

Enhancing classroom teaching with LLMs and RAG

Mullins, Elizabeth A, Portillo, Adrian, Ruiz-Rohena, Kristalys, Piplai, Aritran

arXiv.org Artificial IntelligenceNov-6-2024

Large Language Models have become a valuable source of information for our daily inquiries. However, after training, its data source quickly becomes out-of-date, making RAG a useful tool for providing even more recent or pertinent data. In this work, we investigate how RAG pipelines, with the course materials serving as a data source, might help students in K-12 education. The initial research utilizes Reddit as a data source for up-to-date cybersecurity information. Chunk size is evaluated to determine the optimal amount of context needed to generate accurate answers. After running the experiment for different chunk sizes, answer correctness was evaluated using RAGAs with average answer correctness not exceeding 50 percent for any chunk size. This suggests that Reddit is not a good source to mine for data for questions about cybersecurity threats. The methodology was successful in evaluating the data source, which has implications for its use to evaluate educational resources for effectiveness.

answer correctness, chunk size, information, (13 more...)

arXiv.org Artificial Intelligence

2411.04341

Country:

North America > United States > Texas > El Paso County > El Paso (0.17)
North America > United States > New York > New York County > New York City (0.05)
Asia > Singapore (0.05)
Asia > Indonesia > Bali (0.05)

Genre: Instructional Material (1.00)

Industry:

Government > Military > Cyberwarfare (0.77)
Education > Educational Setting > K-12 Education (0.70)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.99)

Add feedback

dsld: A Socially Relevant Tool for Teaching Statistics

Abdullah, Taha, Ashok, Arjun, Estrada, Brandon, Matloff, Norman, Mittal, Aditya

arXiv.org Artificial IntelligenceNov-6-2024

The growing power of data science can play a crucial role in addressing social discrimination, necessitating nuanced understanding and effective mitigation strategies of potential biases. Data Science Looks At Discrimination (dsld) is an R and Python package designed to provide users with a comprehensive toolkit of statistical and graphical methods for assessing possible discrimination related to protected groups, such as race, gender, and age. Our software offers techniques for discrimination analysis by identifying and mitigating confounding variables, along with methods for reducing bias in predictive models. In educational settings, dsld offers instructors powerful tools to teach important statistical principles through motivating real world examples of discrimination analysis. The inclusion of an 80-page Quarto book further supports users, from statistics educators to legal professionals, in effectively applying these analytical tools to real world scenarios.

dataset, lsat score, student, (17 more...)

arXiv.org Artificial Intelligence

2411.04228

Country:

North America > United States > California > Yolo County > Davis (0.05)
North America > United States > New York (0.04)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.68)

Industry:

Law (1.00)
Education > Educational Setting > Higher Education (0.93)
Health & Medicine (0.93)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

SEGMN: A Structure-Enhanced Graph Matching Network for Graph Similarity Learning

Wang, Wenjun, Lu, Jiacheng, Chen, Kejia, Liu, Zheng, Sang, Shilong

arXiv.org Artificial IntelligenceNov-5-2024

Graph similarity computation (GSC) aims to quantify the similarity score between two graphs. Although recent GSC methods based on graph neural networks (GNNs) take advantage of intra-graph structures in message passing, few of them fully utilize the structures presented by edges to boost the representation of their connected nodes. Moreover, previous cross-graph node embedding matching lacks the perception of the overall structure of the graph pair, due to the fact that the node representations from GNNs are confined to the intra-graph structure, causing the unreasonable similarity score. Intuitively, the cross-graph structure represented in the assignment graph is helpful to rectify the inappropriate matching. Therefore, we propose a structure-enhanced graph matching network (SEGMN). Equipped with a dual embedding learning module and a structure perception matching module, SEGMN achieves structure enhancement in both embedding learning and cross-graph matching. The dual embedding learning module incorporates adjacent edge representation into each node to achieve a structure-enhanced representation. The structure perception matching module achieves cross-graph structure enhancement through assignment graph convolution. The similarity score of each cross-graph node pair can be rectified by aggregating messages from structurally relevant node pairs. Experimental results on benchmark datasets demonstrate that SEGMN outperforms the state-of-the-art GSC methods in the GED regression task, and the structure perception matching module is plug-and-play, which can further improve the performance of the baselines by up to 25%.

artificial intelligence, graph, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2411.03624

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
North America > Canada > Quebec > Capitale-Nationale Region > Québec (0.04)
North America > Canada > Quebec > Capitale-Nationale Region > Quebec City (0.04)
(2 more...)

Genre:

Research Report (0.82)
Instructional Material > Course Syllabus & Notes (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Distributionally Robust Optimization

Kuhn, Daniel, Shafiee, Soroosh, Wiesemann, Wolfram

arXiv.org Machine LearningNov-4-2024

With its early roots in the development of calculus by Isaac Newton, Gottfried Wilhelm Leibniz, Pierre de Ferma t and others in the late 17th century, mathematical optimization has a rich his tory that involves contributions from numerous mathematicians, economists, eng ineers, and scientists. The birth of modern mathematical optimization is commonly c redited to George Dantzig, whose simplex algorithm developed in 1947 solves l inear optimization problems where ℓ is affine and X is a polyhedron ( Dantzig 1956). Subsequent milestones include the development of the rich theory of convex a nalysis ( Rockafellar 1970) as well as the discovery of polynomial-time solution metho ds for linear ( Khachiyan 1979, Karmarkar 1984) and broad classes of nonlinear convex optimization problems ( Nesterov and Nemirovskii 1994). Classical optimization problems are deterministic, that is, all problem data are assumed to be known with certainty. However, most decision pro blems encountered in practice depend on parameters that are corrupted by measu rement errors or that are revealed only after a decision must be determined and committed. A naïve approach to model uncertainty-affected decision problems a s deterministic optimization problems would be to replace all uncertain paramete rs with their expected values or with appropriate point predictions. However, it h as long been known and well-documented that decision-makers who replace an un certain parameter of an optimization problem with its mean value fall victim to th e'flaw of averages' ( Savage, Scholtes and Zweidler 2006, Savage 2012).

artificial intelligence, machine learning, wasserstein distribu-tionally robust optimization, (20 more...)

arXiv.org Machine Learning

2411.02549

Country:

Europe > United Kingdom > England (0.27)
Asia > Middle East (0.27)
North America > United States > California (0.27)

Genre:

Research Report (1.00)
Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.45)

Industry:

Banking & Finance (1.00)
Energy > Oil & Gas > Upstream (0.67)
Government (0.67)
Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.45)

Add feedback

Optimization Algorithm Design via Electric Circuits

Boyd, Stephen P., Parshakova, Tetiana, Ryu, Ernest K., Suh, Jaewook J.

arXiv.org Artificial IntelligenceNov-4-2024

We present a novel methodology for convex optimization algorithm design using ideas from electric RLC circuits. Given an optimization problem, the first stage of the methodology is to design an appropriate electric circuit whose continuous-time dynamics converge to the solution of the optimization problem at hand. Then, the second stage is an automated, computer-assisted discretization of the continuous-time dynamics, yielding a provably convergent discrete-time algorithm. Our methodology recovers many classical (distributed) optimization algorithms and enables users to quickly design and explore a wide range of new algorithms with convergence guarantees.

algorithm, artificial intelligence, optimization problem, (14 more...)

arXiv.org Artificial Intelligence

2411.02573

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Massachusetts (0.04)
(3 more...)

Genre:

Instructional Material > Course Syllabus & Notes (0.67)
Research Report (0.50)
Overview (0.45)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Detecting Student Disengagement in Online Classes Using Deep Learning: A Review

Mohamed, Ahmed, Ali, Mostafa, Ahmed, Shahd, Hani, Nouran, Hisham, Mohammed, Mahmoud, Meram

arXiv.org Artificial IntelligenceNov-4-2024

Student disengagement in online learning has become a critical challenge, particularly post-pandemic. This review explores deep learning techniques used to detect disengagement, emphasizing computer vision and affective computing as effective approaches. We examine recent studies focusing on facial expressions, eye movements, and posture to assess student attention, along with non-face-based indicators like mouse activity. A systematic review of 38 selected studies outlines the indicators, methods, and models employed in this field, providing insights for future research on real-time engagement monitoring in online classrooms

artificial intelligence, engagement, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2411.10464

Country:

Africa > Middle East > Egypt > Cairo Governorate > Cairo (0.07)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre:

Research Report (1.00)
Instructional Material > Online (0.83)
Instructional Material > Course Syllabus & Notes (0.50)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

RoboCrowd: Scaling Robot Data Collection through Crowdsourcing

Mirchandani, Suvir, Yuan, David D., Burns, Kaylee, Islam, Md Sazzad, Zhao, Tony Z., Finn, Chelsea, Sadigh, Dorsa

arXiv.org Artificial IntelligenceNov-4-2024

In recent years, imitation learning from large-scale human demonstrations has emerged as a promising paradigm for training robot policies. However, the burden of collecting large quantities of human demonstrations is significant in terms of collection time and the need for access to expert operators. We introduce a new data collection paradigm, RoboCrowd, which distributes the workload by utilizing crowdsourcing principles and incentive design. RoboCrowd helps enable scalable data collection and facilitates more efficient learning of robot policies. We build RoboCrowd on top of ALOHA (Zhao et al. 2023) -- a bimanual platform that supports data collection via puppeteering -- to explore the design space for crowdsourcing in-person demonstrations in a public environment. We propose three classes of incentive mechanisms to appeal to users' varying sources of motivation for interacting with the system: material rewards, intrinsic interest, and social comparison. We instantiate these incentives through tasks that include physical rewards, engaging or challenging manipulations, as well as gamification elements such as a leaderboard. We conduct a large-scale, two-week field experiment in which the platform is situated in a university cafe. We observe significant engagement with the system -- over 200 individuals independently volunteered to provide a total of over 800 interaction episodes. Our findings validate the proposed incentives as mechanisms for shaping users' data quantity and quality. Further, we demonstrate that the crowdsourced data can serve as useful pre-training data for policies fine-tuned on expert demonstrations -- boosting performance up to 20% compared to when this data is not available. These results suggest the potential for RoboCrowd to reduce the burden of robot data collection by carefully implementing crowdsourcing and incentive design principles.

demonstration, interface, trajectory, (16 more...)

arXiv.org Artificial Intelligence

2411.01915

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (0.93)

Industry:

Leisure & Entertainment > Games > Computer Games (0.48)
Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Eurekaverse: Environment Curriculum Generation via Large Language Models

Liang, William, Wang, Sam, Wang, Hung-Ju, Bastani, Osbert, Jayaraman, Dinesh, Ma, Yecheng Jason

arXiv.org Artificial IntelligenceNov-3-2024

Recent work has demonstrated that a promising strategy for teaching robots a wide range of complex skills is by training them on a curriculum of progressively more challenging environments. However, developing an effective curriculum of environment distributions currently requires significant expertise, which must be repeated for every new domain. Our key insight is that environments are often naturally represented as code. Thus, we probe whether effective environment curriculum design can be achieved and automated via code generation by large language models (LLM). In this paper, we introduce Eurekaverse, an unsupervised environment design algorithm that uses LLMs to sample progressively more challenging, diverse, and learnable environments for skill training. We validate Eurekaverse's effectiveness in the domain of quadrupedal parkour learning, in which a quadruped robot must traverse through a variety of obstacle courses. The automatic curriculum designed by Eurekaverse enables gradual learning of complex parkour skills in simulation and can successfully transfer to the real-world, outperforming manual training courses designed by humans.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2411.01775

Country: North America > United States > Pennsylvania (0.04)

Genre: Instructional Material > Course Syllabus & Notes (0.88)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback