AITopics | Law

Collaborating Authors

Law

ImpliHateVid: A Benchmark Dataset and Two-stage Contrastive Learning Framework for Implicit Hate Speech Detection in Videos

Rehman, Mohammad Zia Ur, Bhatnagar, Anukriti, Kabde, Omkar, Bansal, Shubhi, Kumar, Nagendra

arXiv.org Artificial IntelligenceAug-18-2025

The existing research has primarily focused on text and image-based hate speech detection, video-based approaches remain underexplored. In this work, we introduce a novel dataset, ImpliHateVid, specifically curated for implicit hate speech detection in videos. ImpliHateVid consists of 2,009 videos comprising 509 implicit hate videos, 500 explicit hate videos, and 1,000 non-hate videos, making it one of the first large-scale video datasets dedicated to implicit hate detection. We also propose a novel two-stage contrastive learning framework for hate speech detection in videos. In the first stage, we train modality-specific encoders for audio, text, and image using contrastive loss by concatenating features from the three encoders. In the second stage, we train cross-encoders using contrastive learning to refine multimodal representations. Additionally, we incorporate sentiment, emotion, and caption-based features to enhance implicit hate detection. We evaluate our method on two datasets, ImpliHateVid for implicit hate speech detection and another dataset for general hate speech detection in videos, HateMM dataset, demonstrating the effectiveness of the proposed multimodal contrastive learning for hateful content detection in videos and the significance of our dataset.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2025.acl-long.842

2508.0657

Country:

Asia > India (0.28)
North America > United States > Minnesota (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Law (0.46)
Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

MMESGBench: Pioneering Multimodal Understanding and Complex Reasoning Benchmark for ESG Tasks

Zhang, Lei, Zhou, Xin, He, Chaoyue, Wang, Di, Wu, Yi, Xu, Hong, Liu, Wei, Miao, Chunyan

arXiv.org Artificial IntelligenceAug-18-2025

Environmental, Social, and Governance (ESG) reports are essential for evaluating sustainability practices, ensuring regulatory compliance, and promoting financial transparency. However, these documents are often lengthy, structurally diverse, and multimodal, comprising dense text, structured tables, complex figures, and layout-dependent semantics. Existing AI systems often struggle to perform reliable document-level reasoning in such settings, and no dedicated benchmark currently exists in ESG domain. To fill the gap, we introduce \textbf{MMESGBench}, a first-of-its-kind benchmark dataset targeted to evaluate multimodal understanding and complex reasoning across structurally diverse and multi-source ESG documents. This dataset is constructed via a human-AI collaborative, multi-stage pipeline. First, a multimodal LLM generates candidate question-answer (QA) pairs by jointly interpreting rich textual, tabular, and visual information from layout-aware document pages. Second, an LLM verifies the semantic accuracy, completeness, and reasoning complexity of each QA pair. This automated process is followed by an expert-in-the-loop validation, where domain specialists validate and calibrate QA pairs to ensure quality, relevance, and diversity. MMESGBench comprises 933 validated QA pairs derived from 45 ESG documents, spanning across seven distinct document types and three major ESG source categories. Questions are categorized as single-page, cross-page, or unanswerable, with each accompanied by fine-grained multimodal evidence. Initial experiments validate that multimodal and retrieval-augmented models substantially outperform text-only baselines, particularly on visually grounded and cross-page tasks. MMESGBench is publicly available as an open-source dataset at https://github.com/Zhanglei1103/MMESGBench.

large language model, machine learning, question answering, (18 more...)

arXiv.org Artificial Intelligence

2507.18932

Country: Asia > Singapore (0.17)

Genre: Research Report (0.50)

Industry:

Law (1.00)
Government (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

LIPS - Learning Industrial Physical Simulation benchmark suite - Appendix

Neural Information Processing SystemsAug-17-2025, 23:51:07 GMT

As we can see in Figure 7, the speed-up factor depends on the batch size and increases with the batch size.

artificial intelligence, dataset, machine learning, (16 more...)

Neural Information Processing Systems

Country: Europe > France > Île-de-France > Paris > Paris (0.04)

Genre: Research Report (0.67)

Industry:

Energy > Power Industry (1.00)
Law (0.92)
Information Technology (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

de043a5e421240eb846da8effe472ff1-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 23:49:43 GMT

explanation, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Europe (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > Colorado > El Paso County > Colorado Springs (0.04)
Africa (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.93)

Industry:

Law (0.93)
Health & Medicine > Therapeutic Area (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
(2 more...)

Add feedback

NAS-Bench-Suite-Zero: Accelerating Research on Zero Cost Proxies

Neural Information Processing SystemsAug-17-2025, 23:25:16 GMT

Algorithms for neural architecture search (NAS) seek to automate the design of high-performing neural architectures for a given dataset.

artificial intelligence, machine learning, zc proxy, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Law (0.67)
Government (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Overleaf Example

Neural Information Processing SystemsAug-17-2025, 23:02:19 GMT

artificial intelligence, data quality, machine learning, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > Richmond County > New York City (0.14)
North America > United States > New York > Queens County > New York City (0.14)
North America > United States > New York > New York County > New York City (0.14)
(23 more...)

Genre:

Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law Enforcement & Public Safety > Corrections (1.00)
(8 more...)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Quality (1.00)
Information Technology > Communications (0.67)
Information Technology > Artificial Intelligence > Machine Learning (0.67)

Add feedback

b3640c2d3e58f716c67066046318db0f-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsAug-17-2025, 23:02:15 GMT

artificial intelligence, data mining, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > Virginia (0.04)
(10 more...)

Genre:

Research Report (1.00)
Questionnaire & Opinion Survey (0.68)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

A Additional prompt data details

Neural Information Processing SystemsAug-17-2025, 20:43:14 GMT

Desination will be a red barn on the right 1. Continued on next page 18 Use Case Example rewrite Rewrite the following text to be more light-hearted: -- {very formal text} -- chat The following is a conversation with an AI assistant.

completion, large language model, machine learning, (23 more...)

Neural Information Processing Systems

Country:

Europe > Greece (0.04)
Asia > Southeast Asia (0.04)
Oceania > New Zealand (0.04)
North America > United States > District of Columbia > Washington (0.04)

Genre:

Questionnaire & Opinion Survey (0.93)
Personal > Obituary (0.45)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance > Economy (1.00)
(3 more...)

Technology:

Information Technology > Communications (1.00)
Information Technology > Human Computer Interaction (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)
(3 more...)

Add feedback

A Human Evaluation Details A.1 Unlearning Toxicity Human Eval Details

Neural Information Processing SystemsAug-17-2025, 19:39:47 GMT

In total we have 1200 comparisons, and each comparison is rated by 3 raters. In total we have 2400 comparisons, and each comparison is rated by 3 raters. These were: 1. Coherence: Is the system's generation aligned in meaning and topic with the prompt? We sampled 100 prompts randomly from the corpus, and then evaluated 19 different algorithms. HITs was 2.2K, and the total number of ratings was 6.6K.

artificial intelligence, lieutenant colonel, machine learning, (12 more...)

Neural Information Processing Systems

Country: