jam
Can an AI-Powered Presentation Platform Based On The Game "Just a Minute" Be Used To Improve Students' Public Speaking Skills?
This study explores the effectiveness of applying AI and gamification into a presentation platform aimed at University students wanting to improve their public speaking skills in their native tongue. Specifically, a platform based on the radio show, Just a Minute (JAM), is explored. In this game, players are challenged to speak fluently on a topic for 60 seconds without repeating themselves, hesitating or deviating from the topic. JAM has proposed benefits such as allowing students to improve their spontaneous speaking skills and reduce their use of speech disfluencies ("um", "uh", etc.). Previous research has highlighted the difficulties students face when speaking publicly, the main one being anxiety. AI Powered Presentation Platforms (AI-PPPs), where students can speak with an immersive AI audience and receive real-time feedback, have been explored as a method to improve student's speaking skills and confidence. So far they have shown promising results which this study aims to build upon. A group of students from the University of York are enlisted to evaluate the effectiveness of the JAM platform. They are asked to fill in a questionnaire, play through the game twice and then complete a final questionnaire to discuss their experiences playing the game. Various statistics are gathered during their gameplay such as the number of points they gained and the number of rules they broke. The results showed that students found the game promising and believed that their speaking skills could improve if they played the game for longer. More work will need to be carried out to prove the effectiveness of the game beyond the short term.
- North America > United States (0.04)
- North America > Canada > Manitoba > Winnipeg Metropolitan Region > Winnipeg (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Switzerland (0.04)
- Research Report > New Finding (1.00)
- Questionnaire & Opinion Survey (1.00)
- Education > Educational Setting (1.00)
- Leisure & Entertainment > Games > Computer Games (0.68)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
Liu, Renhang, Hung, Chia-Yu, Majumder, Navonil, Gautreaux, Taylor, Bagherzadeh, Amir Ali, Li, Chuan, Herremans, Dorien, Poria, Soujanya
Diffusion and flow-matching models have revolutionized automatic text-to-audio generation in recent times. These models are increasingly capable of generating high quality and faithful audio outputs capturing to speech and acoustic events. However, there is still much room for improvement in creative audio generation that primarily involves music and songs. Recent open lyrics-to-song models, such as, DiffRhythm, ACE-Step, and LeVo, have set an acceptable standard in automatic song generation for recreational use. However, these models lack fine-grained word-level controllability often desired by musicians in their workflows. To the best of our knowledge, our flow-matching-based JAM is the first effort toward endowing word-level timing and duration control in song generation, allowing fine-grained vocal control. To enhance the quality of generated songs to better align with human preferences, we implement aesthetic alignment through Direct Preference Optimization, which iteratively refines the model using a synthetic dataset, eliminating the need or manual data annotations. Furthermore, we aim to standardize the evaluation of such lyrics-to-song models through our public evaluation dataset JAME. We show that JAM outperforms the existing models in terms of the music-specific attributes.
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- Europe > United Kingdom > North Sea > Southern North Sea (0.04)
- Asia > Singapore (0.04)
- Media > Music (1.00)
- Leisure & Entertainment (1.00)
JAM: Keypoint-Guided Joint Prediction after Classification-Aware Marginal Proposal for Multi-Agent Interaction
Lin, Fangze, He, Ying, Yu, Fei, Zhang, Hong
Predicting the future motion of road participants is a critical task in autonomous driving. In this work, we address the challenge of low-quality generation of low-probability modes in multi-agent joint prediction. To tackle this issue, we propose a two-stage multi-agent interactive prediction framework named \textit{keypoint-guided joint prediction after classification-aware marginal proposal} (JAM). The first stage is modeled as a marginal prediction process, which classifies queries by trajectory type to encourage the model to learn all categories of trajectories, providing comprehensive mode information for the joint prediction module. The second stage is modeled as a joint prediction process, which takes the scene context and the marginal proposals from the first stage as inputs to learn the final joint distribution. We explicitly introduce key waypoints to guide the joint prediction module in better capturing and leveraging the critical information from the initial predicted trajectories. We conduct extensive experiments on the real-world Waymo Open Motion Dataset interactive prediction benchmark. The results show that our approach achieves competitive performance. In particular, in the framework comparison experiments, the proposed JAM outperforms other prediction frameworks and achieves state-of-the-art performance in interactive trajectory prediction. The code is available at https://github.com/LinFunster/JAM to facilitate future research.
- Asia > China > Guangdong Province > Shenzhen (0.05)
- North America > Canada > Ontario > National Capital Region > Ottawa (0.04)
- Transportation > Ground > Road (0.35)
- Information Technology (0.35)
Just Ask for Music (JAM): Multimodal and Personalized Natural Language Music Recommendation
Melchiorre, Alessandro B., Epure, Elena V., Masoudian, Shahed, Escobedo, Gustavo, Hausberger, Anna, Moussallam, Manuel, Schedl, Markus
Natural language interfaces offer a compelling approach for music recommendation, enabling users to express complex preferences conversationally. While Large Language Models (LLMs) show promise in this direction, their scalability in recommender systems is limited by high costs and latency. Retrieval-based approaches using smaller language models mitigate these issues but often rely on single-modal item representations, overlook long-term user preferences, and require full model retraining, posing challenges for real-world deployment. In this paper, we present JAM (Just Ask for Music), a lightweight and intuitive framework for natural language music recommendation. JAM models user-query-item interactions as vector translations in a shared latent space, inspired by knowledge graph embedding methods like TransE. To capture the complexity of music and user intent, JAM aggregates multimodal item features via cross-attention and sparse mixture-of-experts. We also introduce JAMSessions, a new dataset of over 100k user-query-item triples with anonymized user/item embeddings, uniquely combining conversational queries and user long-term preferences. Our results show that JAM provides accurate recommendations, produces intuitive representations suitable for practical use cases, and can be easily integrated with existing music recommendation stacks.
Noise-Enabled Goal Attainment in Crowded Collectives
Liu, Lucy, Werfel, Justin, Toschi, Federico, Mahadevan, L.
Departments of Physics, and Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138 In crowded environments, individuals must navigate around other occupants to reach their destinations. Understanding and controlling traffic flows in these spaces is relevant to coordinating robot swarms and designing infrastructure for dense populations. Here, we combine simulations, theory, and robotic experiments to study how noisy motion can disrupt traffic jams and enable flow as agents travel to individual goals. Above a critical noise level, large jams do not persist. From this observation, we analytically approximate the goal attainment rate as a function of the noise level, then solve for the optimal agent density and noise level that maximize the swarm's goal attainment rate. We perform robotic experiments to corroborate our simulated and theoretical results. Finally, we compare simple, local navigation approaches with a sophisticated but computationally costly central planner. A simple reactive scheme performs well up to moderate densities and is far more computationally efficient than a planner, suggesting lessons for real-world problems. Consider a robot team with a time-sensitive distributed task such as assembling a machine, fulfilling orders in a warehouse, or cleaning up hazardous debris. Robots must transport items to specific goal locations. When the space is relatively empty, adding robots is advantageous: several robots together work faster than a lone one. However, adding too many robots will lead to traffic that slows the entire team down. Emergent traffic patterns like jam formation, laning, and various transitions between ordered and disordered behavior have been studied in diverse settings spanning car traffic [1, 2], colloids and bacteria [3], robots [4, 5], ants [6, 7], and humans [8, 9]. In these systems, the simple constraint that two agents cannot occupy the same location at the same time, so that agents must stop or slow down in high-traffic regions, produces a set of rich and interesting phenomena. For instance, it is known that collectives of random walkers with exclusion constraints alone and no attraction can self-organize into large jams [3], and that ants or pedestrians following simple rules can self-organize into lanes [7, 9].
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.24)
- Europe > Netherlands > North Brabant > Eindhoven (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- (2 more...)
JAM: Controllable and Responsible Text Generation via Causal Reasoning and Latent Vector Manipulation
Huang, Yingbing, Chen, Deming, Umrawal, Abhishek K.
While large language models (LLMs) have made significant strides in generating coherent and contextually relevant text, they often function as opaque black boxes, trained on vast unlabeled datasets with statistical objectives, lacking an interpretable framework for responsible control. In this paper, we introduce JAM (Just A Move), a novel framework that interprets and controls text generation by integrating cause-effect analysis within the latent space of LLMs. Based on our observations, we uncover the inherent causality in LLM generation, which is critical for producing responsible and realistic outputs. Moreover, we explore latent vectors as fundamental components in LLM architectures, aiming to understand and manipulate them for more effective and efficient controllable text generation. We evaluate our framework using a range of tools, including the HHH criteria, toxicity reduction benchmarks, and GPT-4 alignment measures. Our results show that JAM achieves up to a 22% improvement over previous Controllable Text Generation (CTG) methods across multiple quantitative metrics and human-centric evaluations. Furthermore, JAM demonstrates greater computational efficiency compared to other CTG methods. These results highlight the effectiveness and efficiency of JAM for responsible and realistic text generation, paving the way for more interpretable and controllable models.
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.93)
Single file motion of robot swarms
Alonso-Llanes, Laciel, Garcimartín, Angel, Zuriguel, Iker
We present experimental results on the single file motion of a group of robots interacting with each other through position sensors. We successfully replicate the fundamental diagram typical of these systems, with a transition from free flow to congested traffic as the density of the system increases. In the latter scenario we also observe the characteristic stop-and-go waves. The unique advantages of this novel system, such as experimental stability and repeatability, allow for extended experimental runs, facilitating a comprehensive statistical analysis of the global dynamics. Above a certain density, we observe a divergence of the average jam duration and the average number of robots involved in it. This discovery enables us to precisely identify another transition: from congested intermittent flow (for intermediate densities) to a totally congested scenario for high densities. Beyond this finding, the present work demonstrates the suitability of robot swarms to model complex behaviors in many particle systems.
- Europe > Switzerland (0.04)
- Europe > Spain > Navarre > Pamplona (0.04)
- North America > United States > New York (0.04)
- Europe > France (0.04)
The 20 most puzzling questions in modern life revealed - so do YOU know the answers?
What is an NFT? (34%) Non-fungible tokens (NFTs) are generally digital art pieces or music that can be bought or traded online. These are unique computer files encrypted with an artist's signature. As a result, they cannot be replicated, acting as a digital certificate of ownership and authenticity. In other words, buying an NFT is almost like the more traditional purchasing of fine art - except in a digital form. Artists can sell pieces that may be tricky to advertise otherwise, such as digital stickers.
- North America > United States (0.14)
- Europe > United Kingdom > England (0.05)
- Asia > Japan (0.05)
- (6 more...)
- Banking & Finance (0.96)
- Leisure & Entertainment > Sports > Cricket (0.50)
- Information Technology > Services (0.49)
- (2 more...)
The great British scone debate is SOLVED: ChatGPT reveals whether you should put jam or cream first
With their crumbly texture and smeared with clotted cream and jam, scones are a favourite treat with Brits across the UK. But despite dating back to the early 1500s, one question remains – should you put the cream or jam on first? Now, ChatGPT claims to have settled the debate, just in time for King Charles' coronation. The AI chatbot says it would opt for the'Devon method' of putting the clotted cream on the scone first, followed by the jam on top. Its choice has enraged many scone fans on Twitter, with comedian Dawn French replying: 'You are a robot with no taste (literally & figuratively) & no respect for all that is holy.
Massive traffic experiment pits machine learning against 'phantom' jams
CIRCLES Consortium research is supported by the National Science Foundation, the U.S. Department of Transportation and the U.S. Department of Energy. Additional funding was provided by Nissan, Toyota North America, General Motors, the Federal Highway Administration, the Tennessee Department of Transportation, the California Department of Transportation, the Nashville Department of Transportation, Gresham Smith, Siemens, Deutsches Zentrum für Luft- und Raumfahrt (DLR), Amazon Web Services (AWS), C3.ai Digital Transformation Institute, the UC Berkeley Institute of Transportation Studies, Vanderbilt University, the University of Arizona, Rutgers University, Temple University, Ecole des Ponts ParisTech and the Université Gustave Eiffel.
- North America > United States > Tennessee (0.34)
- North America > United States > California (0.34)
- North America > United States > Arizona (0.34)