Goto

Collaborating Authors

 Law


AssurAI: Experience with Constructing Korean Socio-cultural Datasets to Discover Potential Risks of Generative AI

arXiv.org Artificial Intelligence

The rapid evolution of generative AI necessitates robust safety evaluations. However, current safety datasets are predominantly English-centric, failing to capture specific risks in non-English, socio-cultural contexts such as Korean, and are often limited to the text modality. To address this gap, we introduce AssurAI, a new quality-controlled Korean multimodal dataset for evaluating the safety of generative AI. First, we define a taxonomy of 35 distinct AI risk factors, adapted from established frameworks by a multidisciplinary expert group to cover both universal harms and relevance to the Korean socio-cultural context. Second, leveraging this taxonomy, we construct and release AssurAI, a large-scale Korean multimodal dataset comprising 11,480 instances across text, image, video, and audio. Third, we apply the rigorous quality control process used to ensure data integrity, featuring a two-phase construction (i.e., expert-led seeding and crowdsourced scaling), triple independent annotation, and an iterative expert red-teaming loop. Our pilot study validates AssurAI's effectiveness in assessing the safety of recent LLMs. We release AssurAI to the public to facilitate the development of safer and more reliable generative AI systems for the Korean community.


Structured Definitions and Segmentations for Legal Reasoning in LLMs: A Study on Indian Legal Data

arXiv.org Artificial Intelligence

Large Language Models (LLMs), trained on extensive datasets from the web, exhibit remarkable general reasoning skills. Despite this, they often struggle in specialized areas like law, mainly because they lack domain-specific pretraining. The legal field presents unique challenges, as legal documents are generally long and intricate, making it hard for models to process the full text efficiently. Previous studies have examined in-context approaches to address the knowledge gap, boosting model performance in new domains without full domain alignment. In our paper, we analyze model behavior on legal tasks by conducting experiments in three areas: (i) reorganizing documents based on rhetorical roles to assess how structured information affects long context processing and model decisions, (ii) defining rhetorical roles to familiarize the model with legal terminology, and (iii) emulating the step-by-step reasoning of courts regarding rhetorical roles to enhance model reasoning. These experiments are conducted in a zero-shot setting across three Indian legal judgment prediction datasets. Our results reveal that organizing data or explaining key legal terms significantly boosts model performance, with a minimum increase of ~1.5% and a maximum improvement of 4.36% in F1 score compared to the baseline.


ConceptGuard: Proactive Safety in Text-and-Image-to-Video Generation through Multimodal Risk Detection

arXiv.org Artificial Intelligence

Recent progress in video generative models has enabled the creation of high-quality videos from multimodal prompts that combine text and images. While these systems offer enhanced controllability, they also introduce new safety risks, as harmful content can emerge from individual modalities or their interaction. Existing safety methods are often text-only, require prior knowledge of the risk category, or operate as post-generation auditors, struggling to proactively mitigate such compositional, multimodal risks. To address this challenge, we present ConceptGuard, a unified safeguard framework for proactively detecting and mitigating unsafe semantics in multimodal video generation. ConceptGuard operates in two stages: First, a contrastive detection module identifies latent safety risks by projecting fused image-text inputs into a structured concept space; Second, a semantic suppression mechanism steers the generative process away from unsafe concepts by intervening in the prompt's multimodal conditioning. To support the development and rigorous evaluation of this framework, we introduce two novel benchmarks: ConceptRisk, a large-scale dataset for training on multimodal risks, and T2VSafetyBench-TI2V, the first benchmark adapted from T2VSafetyBench for the Text-and-Image-to-Video (TI2V) safety setting. Comprehensive experiments on both benchmarks show that ConceptGuard consistently outperforms existing baselines, achieving state-of-the-art results in both risk detection and safe video generation. Our code is available at https://github.com/Ruize-Ma/ConceptGuard.


Rigor in AI: Doing Rigorous AI Work Requires a Broader, Responsible AI-Informed Conception of Rigor

arXiv.org Artificial Intelligence

In AI research and practice, rigor remains largely understood in terms of methodological rigor -- such as whether mathematical, statistical, or computational methods are correctly applied. We argue that this narrow conception of rigor has contributed to the concerns raised by the responsible AI community, including overblown claims about the capabilities of AI systems. Our position is that a broader conception of what rigorous AI research and practice should entail is needed. We believe such a conception -- in addition to a more expansive understanding of (1) methodological rigor -- should include aspects related to (2) what background knowledge informs what to work on (epistemic rigor); (3) how disciplinary, community, or personal norms, standards, or beliefs influence the work (normative rigor); (4) how clearly articulated the theoretical constructs under use are (conceptual rigor); (5) what is reported and how (reporting rigor); and (6) how well-supported the inferences from existing evidence are (interpretative rigor). In doing so, we also provide useful language and a framework for much-needed dialogue about the AI community's work by researchers, policymakers, journalists, and other stakeholders.


LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?

arXiv.org Artificial Intelligence

Recent advances in Large Multimodal Models (LMMs) have revolutionized their reasoning and Optical Character Recognition (OCR) capabilities. However, their complex logical reasoning performance on text-rich images remains underexplored. To bridge this gap, we introduce LogicOCR, a benchmark comprising 2780 questions with two subsets, i.e., LogicOCR-Gen with 1100 multi-choice questions on generated images, and LogicOCR-Real with 1680 meticulously designed free-form questions on real-world images. For constructing LogicOCR-Gen, we first curate a text corpus from the Chinese National Civil Servant Examination, and customize an automatic pipeline to steer GPT-Image-1 to generate images with varied layouts and fonts, ensuring contextual relevance and visual realism. Then, the generated images are manually verified. We evaluate a range of representative LMMs under Chain-of-Thought (CoT) and direct-answer settings. Our multi-dimensional analysis reveals key insights, such as the impact of test-time scaling, input modality differences, and sensitivity to visual-text orientation. Notably, LMMs still lag in multimodal reasoning compared to text-only inputs, indicating that they have not fully bridged visual reading with reasoning. Moreover, we propose TextCue, a training-free method that enhances LMMs' perception of image regions containing important text cues for solving questions. We leverage LMMs' attention maps and an off-the-shelf text segmentation specialist to determine the region, which is then cropped and enlarged to augment the original image. Experiments show its effectiveness, e.g., a 1.8% accuracy gain over LLaVA-OV-1.5-8B under the CoT setting. Our benchmark is available at https://github.com/MiliLab/LogicOCR.


How Do Companies Manage the Environmental Sustainability of AI? An Interview Study About Green AI Efforts and Regulations

arXiv.org Artificial Intelligence

With the ever-growing adoption of artificial intelligence (AI), AI-based software and its negative impact on the environment are no longer negligible, and studying and mitigating this impact has become a critical area of research. However, it is currently unclear which role environmental sustainability plays during AI adoption in industry and how AI regulations influence Green AI practices and decision-making in industry. We therefore aim to investigate the Green AI perception and management of industry practitioners. To this end, we conducted a total of 11 interviews with participants from 10 different organizations that adopted AI-based software. The interviews explored three main themes: AI adoption, current efforts in mitigating the negative environmental impact of AI, and the influence of the EU AI Act and the Corporate Sustainability Reporting Directive (CSRD). Our findings indicate that 9 of 11 participants prioritized business efficiency during AI adoption, with minimal consideration of environmental sustainability. Monitoring and mitigation of AI's environmental impact were very limited. Only one participant monitored negative environmental effects. Regarding applied mitigation practices, six participants reported no actions, with the others sporadically mentioning techniques like prompt engineering, relying on smaller models, or not overusing AI. Awareness and compliance with the EU AI Act are low, with only one participant reporting on its influence, while the CSRD drove sustainability reporting efforts primarily in larger companies. All in all, our findings reflect a lack of urgency and priority for sustainable AI among these companies. We suggest that current regulations are not very effective, which has implications for policymakers. Additionally, there is a need to raise industry awareness, but also to provide user-friendly techniques and tools for Green AI practices.


A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

arXiv.org Artificial Intelligence

Large language models (LLMs) are widely applied in chatbots, code generators, and search engines. Workload such as chain-of-throught, complex reasoning, agent services significantly increase the inference cost by invoke the model repeatedly. Optimization methods such as parallelism, compression, and caching have been adopted to reduce costs, but the diverse service requirements make it hard to select the right method. Recently, specialized LLM inference engines have emerged as a key component for integrating the optimization methods into service-oriented infrastructures. However, a systematic study on inference engines is still lacking.This paper provides a comprehensive evaluation of 25 open-source and commercial inference engines. We examine each inference engine in terms of ease-of-use, ease-of-deployment, general-purpose support, scalability, and suitability for throughput- and latency-aware computation. Furthermore, we explore the design goals of each inference engine by investigating the optimization techniques it supports. In addition, we assess the ecosystem maturity of open source inference engines and handle the performance and cost policy of commercial solutions.We outline future research directions that include support for complex LLM-based services, support of various hardware, and enhanced security, offering practical guidance to researchers and developers in selecting and designing optimized LLM inference engines. We also provide a public repository to continually track developments in this fast-evolving field: \href{https://github.com/sihyeong/Awesome-LLM-Inference-Engine}{https://github.com/sihyeong/Awesome-LLM-Inference-Engine}.


Campbell's FIRES executive secretly recorded saying its soups are full of 'bioengineered meat' and made for 'poor people'

Daily Mail - Science & tech

Karoline Leavitt's family member was swarmed by ICE agents while picking up son from school as child's father tell her to'self deport' Deaths from highly infectious virus are growing... as states brace for widespread outbreaks My book on the Kennedys was used as a'mistress manual' by Olivia Nuzzi... then this wannabe Carolyn Bessette had the nerve to hound me with these outrageous texts: MAUREEN CALLAHAN Katy Perry's legal victory as judge orders disabled veteran to pay singer nearly $2m over Montecito mansion Trump reveals next DC renovation project to remove'Biden filth' after White House ballroom Cracker Barrel CEO whines that she got'fired by America' for woke redesign Kroger employee reveals shocking amount laundry products have increased by... 'biggest price jump I've seen in a single week' Hollywood heir, 23, whose mom Anne Heche died in horror car fireball has secret LOVE CHILD with 43-year-old... now she's telling all Missing Melodee Buzzard's mom'left her daughter with strangers she met at the zoo' Rachel Zoe reveals why she dumped husband of 26 years... and if she has started dating again Horrific moment cops found body of Cowboys star Marshawn Kneeland after he shot himself at end of 145 mph chase'This is pretty lurid' Jenny McCarthy, 53, reveals health emergency that involved NINE surgeries, her'teeth falling out' and'growth' on her eyeballs Maryland grandma, 58, dragged across floor after being deported to country she'has never even visited' Campbell's FIRES executive secretly recorded saying its soups are full of'bioengineered meat' and made for'poor people' Campbell's Soup has fired the executive caught in a secret recording insulting customers and claiming the company's products were filled with bioengineered meat. Vice President and Chief Information Security Officer Martin Bally was originally placed on administrative leave after a lawsuit against Campbell's was filed last week and the audio recording was released. In the audio, a speaker identified as Bally was heard saying: 'We have s**t for f***king poor people. It's not healthy now that I know what the f**'s in it.' The voice, alleged to be Bally, also claimed that the chicken used in the brand's soups'came from a 3D printer.' Campbell's revealed on Wednesday that their investigation concluded that the voice on the secret recording was Bally and the executive was removed from the company on Tuesday.


Easter Island mystery is SOLVED: Scientists finally pinpoint who built the iconic stone heads 900 years ago

Daily Mail - Science & tech

Karoline Leavitt's family member was swarmed by ICE agents while picking up son from school as child's father tell her to'self deport' Deaths from highly infectious virus are growing... as states brace for widespread outbreaks My book on the Kennedys was used as a'mistress manual' by Olivia Nuzzi... then this wannabe Carolyn Bessette had the nerve to hound me with these outrageous texts: MAUREEN CALLAHAN Katy Perry's legal victory as judge orders disabled veteran to pay singer nearly $2m over Montecito mansion Trump reveals next DC renovation project to remove'Biden filth' after White House ballroom Cracker Barrel CEO whines that she got'fired by America' for woke redesign Kroger employee reveals shocking amount laundry products have increased by... 'biggest price jump I've seen in a single week' Hollywood heir, 23, whose mom Anne Heche died in horror car fireball has secret LOVE CHILD with 43-year-old... now she's telling all Missing Melodee Buzzard's mom'left her daughter with strangers she met at the zoo' Rachel Zoe reveals why she dumped husband of 26 years... and if she has started dating again Horrific moment cops found body of Cowboys star Marshawn Kneeland after he shot himself at end of 145 mph chase'This is pretty lurid' Jenny McCarthy, 53, reveals health emergency that involved NINE surgeries, her'teeth falling out' and'growth' on her eyeballs Maryland grandma, 58, dragged across floor after being deported to country she'has never even visited' READ MORE: New'stone head' statue mysteriously appears on Easter Island One of the biggest mysteries surrounding Easter Island may finally be solved - as scientists pinpoint who built the iconic stone heads over 900 years ago. In the past, researchers assumed that the 12 to 80-ton statues would have required the combined efforts of hundreds of labourers to build and move. However, new archaeological evidence shows that the statues, known as moai, were not carved by a single powerful chiefdom. Instead, each moai was carved by a small clan or by an individual family, with as few as four to six people working on a single statue. Using a new 3D model of the island's main moai quarry, which you can explore below, archaeologists identified 30 unique'workshops' where the statues were produced.


New scam sends fake Microsoft 365 login pages

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by Refinitiv Lipper .