AITopics

2501.17099

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
(24 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Education (1.00)
Leisure & Entertainment (0.93)
Health & Medicine > Consumer Health (0.87)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.71)

arXiv.org Artificial IntelligenceJan-28-2025

Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding

Li, Yun, Liu, Zhe, Kong, Yajing, Li, Guangrui, Zhang, Jiyuan, Bian, Chao, Liu, Feng, Yao, Lina, Sun, Zhenbang

Applying Multimodal Large Language Models (MLLMs) to video understanding presents significant challenges due to the need to model temporal relations across frames. Existing approaches adopt either implicit temporal modeling, relying solely on the LLM decoder, or explicit temporal modeling, employing auxiliary temporal encoders. To investigate this debate between the two paradigms, we propose the Stackable Temporal Encoder (STE). STE enables flexible explicit temporal modeling with adjustable temporal receptive fields and token compression ratios. Using STE, we systematically compare implicit and explicit temporal modeling across dimensions such as overall performance, token compression effectiveness, and temporal-specific understanding. We also explore STE's design considerations and broader impacts as a plug-in module and in image modalities. Our findings emphasize the critical role of explicit temporal modeling, providing actionable insights to advance video MLLMs.

large language model, machine learning, temporal modeling, (14 more...)

2501.16786

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Lei, Chunyu, Chen, Guang-Ze, Chen, C. L. Philip, Zhang, Tong

Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification

arXiv.org Artificial IntelligenceJan-28-2025

The state-of-the-art online learning models generally conduct a single online gradient descent when a new sample arrives and thus suffer from suboptimal model weights. To this end, we introduce an online broad learning system framework with closed-form solutions for each online update. Different from employing existing incremental broad learning algorithms for online learning tasks, which tend to incur degraded accuracy and expensive online update overhead, we design an effective weight estimation algorithm and an efficient online updating strategy to remedy the above two deficiencies, respectively. Specifically, an effective weight estimation algorithm is first developed by replacing notorious matrix inverse operations with Cholesky decomposition and forward-backward substitution to improve model accuracy. Second, we devise an efficient online updating strategy that dramatically reduces online update time. Theoretical analysis exhibits the splendid error bound and low time complexity of our model. The most popular test-then-training evaluation experiments on various real-world datasets prove its superiority and efficiency. Furthermore, our framework is naturally extended to data stream scenarios with concept drift and exceeds state-of-the-art baselines.

algorithm, dataset, online-bl, (10 more...)

2501.16932

Country:

Oceania > Australia > New South Wales (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(2 more...)

Genre:

Instructional Material > Online (0.61)
Research Report (0.50)

Industry: Education > Educational Setting > Online (0.55)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Daily Mail - Science & techJan-27-2025, 13:36:43 GMT

It pays to be pretty! Attractive people earn up to 11% MORE than their ugly colleagues, study finds

Whether it's taking on more responsibilities or staying late in the office, many employees will go above and beyond to try to get a pay rise. But now a study suggests that if you're not good looking, your efforts may be futile. Researchers from the Institute for Operations Research and the Management Sciences in Baltimore have uncovered a'striking' link between physical attractiveness and career success. In their study, the team analysed the careers of more than 40,000 graduates who had completed MBAs. They found attractive respondents earned up to 11 per cent more than their colleagues who were seen as less good looking.

attractive people, beauty premium, ugly colleague, (10 more...)

Daily Mail - Science & tech

Country: Oceania > Australia > Western Australia (0.05)

Genre: Research Report > New Finding (0.50)

Technology: Information Technology > Artificial Intelligence (0.32)

The New YorkerJan-27-2025, 11:00:00 GMT

Why We're in Love with Apocalypse

It's a mite soon to start grieving, but scientists now project that life on Earth will probably end in about a billion years. A Monday in February, 1,000,002,025, would be my guess. On that inhospitable day, give or take a few million years, the sun will become so hot that the oceans will boil, Earth's oxygen will disappear, and photosynthesis will cease, as will all living things. We should be so lucky. There's a pretty fair chance that life could be wiped out well before then--say, in early June, 2034, or on a cloudy Sunday in November, 3633. Plenty of people do, as it turns out, and, if you want to know who they are, Dorian Lynskey's "Everything Must Go: The Stories We Tell About the End of the World" (Pantheon) is a good place to start. Lynskey, a British journalist and podcaster, has assembled biological, geological, archeological, literary, and cinematic permutations of existential finales, leaving no stone unturned, be it meteor, comet, or asteroid. If a book, a song, a story, a film, a headline, a title, or a study has "world" and "end" in it, Lynskey has unearthed it.

apocalypse, lynskey, revelation, (16 more...)

The New Yorker

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
Oceania > Australia (0.04)
North America > United States > New York (0.04)
(6 more...)

Genre: Summary/Review (0.50)

Industry:

Leisure & Entertainment (1.00)
Government > Military (0.47)
Health & Medicine > Therapeutic Area (0.47)
Media > Film (0.46)

Technology: Information Technology > Artificial Intelligence (0.68)

STAR: Stepwise Task Augmentation and Relation Learning for Aspect Sentiment Quad Prediction

Lai, Wenna, Xie, Haoran, Xu, Guandong, Li, Qing

Aspect-based sentiment analysis (ABSA) aims to identify four sentiment elements, including aspect term, aspect category, opinion term, and sentiment polarity. These elements construct the complete picture of sentiments. The most challenging task, aspect sentiment quad prediction (ASQP), predicts these elements simultaneously, hindered by difficulties in accurately coupling different sentiment elements. A key challenge is insufficient annotated data that limits the capability of models in semantic understanding and reasoning about quad prediction. To address this, we propose stepwise task augmentation and relation learning (STAR), a strategy inspired by human reasoning. STAR constructs auxiliary data to learn quadruple relationships incrementally by augmenting with pairwise and overall relation tasks derived from training data. By encouraging the model to infer causal relationships among sentiment elements without requiring additional annotations, STAR effectively enhances quad prediction. Extensive experiments demonstrate the proposed STAR exhibits superior performance on four benchmark datasets.

artificial intelligence, machine learning, natural language, (16 more...)

2501.16093

Country:

Oceania > Australia > New South Wales > Sydney (0.14)
Asia > China > Hong Kong (0.06)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.36)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.36)

Regulatory Science Innovation for Generative AI and Large Language Models in Health and Medicine: A Global Call for Action

Ong, Jasmine Chiat Ling, Ning, Yilin, Liu, Mingxuan, Ma, Yian, Liang, Zhao, Singh, Kuldev, Chang, Robert T, Vogel, Silke, Lim, John CW, Tan, Iris Siu Kwan, Freyer, Oscar, Gilbert, Stephen, Bitterman, Danielle S, Liu, Xiaoxuan, Denniston, Alastair K, Liu, Nan

The integration of generative AI (GenAI) and large language models (LLMs) in healthcare presents both unprecedented opportunities and challenges, necessitating innovative regulatory approaches. GenAI and LLMs offer broad applications, from automating clinical workflows to personalizing diagnostics. However, the non-deterministic outputs, broad functionalities and complex integration of GenAI and LLMs challenge existing medical device regulatory frameworks, including the total product life cycle (TPLC) approach. Here we discuss the constraints of the TPLC approach to GenAI and LLM-based medical device regulation, and advocate for global collaboration in regulatory science research. This serves as the foundation for developing innovative approaches including adaptive policies and regulatory sandboxes, to test and refine governance in real-world settings. International harmonization, as seen with the International Medical Device Regulators Forum, is essential to manage implications of LLM on global health, including risks of widening health inequities driven by inherent model biases. By engaging multidisciplinary expertise, prioritizing iterative, data-driven approaches, and focusing on the needs of diverse populations, global regulatory science research enables the responsible and equitable advancement of LLM innovations in healthcare.

large language model, machine learning, natural language, (16 more...)

2502.07794

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Singapore > Central Region > Singapore (0.05)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(8 more...)

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.85)

Indiana Jones: There Are Always Some Useful Ancient Relics

Ding, Junchen, Zhang, Jiahao, Liu, Yi, Ding, Ziqi, Deng, Gelei, Li, Yuekang

This paper introduces Indiana Jones, an innovative approach to jailbreaking Large Language Models (LLMs) by leveraging inter-model dialogues and keyword-driven prompts. Through orchestrating interactions among three specialised LLMs, the method achieves near-perfect success rates in bypassing content safeguards in both white-box and black-box LLMs. The research exposes systemic vulnerabilities within contemporary models, particularly their susceptibility to producing harmful or unethical outputs when guided by ostensibly innocuous prompts framed in historical or contextual contexts. Experimental evaluations highlight the efficacy and adaptability of Indiana Jones, demonstrating its superiority over existing jailbreak methods. These findings emphasise the urgent need for enhanced ethical safeguards and robust security measures in the development of LLMs. Moreover, this work provides a critical foundation for future studies aimed at fortifying LLMs against adversarial exploitation while preserving their utility and flexibility.

large language model, machine learning, natural language, (19 more...)

2501.18628

Country:

North America > United States > Indiana (0.83)
Europe > United Kingdom (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Han, Sukjin, Lee, Kyungho

Copyright and Competition: Estimating Supply and Demand with Unstructured Data

arXiv.org Machine LearningJan-27-2025

Copyright policies play a pivotal role in protecting the intellectual property of creators and companies in creative industries. The advent of cost-reducing technologies, such as generative AI, in these industries calls for renewed attention to the role of these policies. This paper studies product positioning and competition in a market of creatively differentiated products and the competitive and welfare effects of copyright protection. A common feature of products with creative elements is that their key attributes (e.g., images and text) are unstructured and thus high-dimensional. We focus on a stylized design product, fonts, and use data from the world's largest online marketplace for fonts. We use neural network embeddings to quantify unstructured attributes and measure the visual similarity. We show that this measure closely aligns with actual human perception. Based on this measure, we empirically find that competitions occur locally in the visual characteristics space. We then develop a structural model for supply and demand that integrate the embeddings. Through counterfactual analyses, we find that local copyright protection can enhance consumer welfare when products are relocated, and the interplay between copyright and cost-reducing technologies is essential in determining an optimal policy for social welfare. We believe that the embedding analysis and empirical models introduced in this paper can be applicable to a range of industries where unstructured data captures essential features of products and markets.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2501.1612

Country:

Europe > United Kingdom (0.45)
North America > Canada (0.14)
Europe > Italy (0.04)
(10 more...)

Genre:

Research Report > New Finding (0.68)
Research Report > Experimental Study (0.68)

Industry:

Leisure & Entertainment (1.00)
Law > Intellectual Property & Technology Law (1.00)
Government (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Determining Mosaic Resilience in Sugarcane Plants using Hyperspectral Images

Zia, Ali, Zhou, Jun, Olayemi, Muyiwa

Sugarcane mosaic disease poses a serious threat to the Australian sugarcane industry, leading to yield losses of up to 30% in susceptible varieties. Existing manual inspection methods for detecting mosaic resilience are inefficient and impractical for large-scale application. This study introduces a novel approach using hyperspectral imaging and machine learning to detect mosaic resilience by leveraging global feature representation from local spectral patches. Hyperspectral data were collected from eight sugarcane varieties under controlled and field conditions. Local spectral patches were analyzed to capture spatial and spectral variations, which were then aggregated into global feature representations using a ResNet18 deep learning architecture. While classical methods like Support Vector Machines struggled to utilize spatial-spectral relationships effectively, the deep learning model achieved high classification accuracy, demonstrating its capacity to identify mosaic resilience from fine-grained hyperspectral data. This approach enhances early detection capabilities, enabling more efficient management of susceptible strains and contributing to sustainable sugarcane production.

artificial intelligence, detection, machine learning, (17 more...)

2501.167

Country:

Oceania > Australia > Queensland (0.04)
North America (0.04)
Asia (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)