Goto

Collaborating Authors

 Media


Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

arXiv.org Artificial Intelligence

We propose Ming-Flash-Omni, an upgraded version of Ming-Omni, built upon a sparser Mixture-of-Experts (MoE) variant of Ling-Flash-2.0 with 100 billion total parameters, of which only 6.1 billion are active per token. This architecture enables highly efficient scaling (dramatically improving computational efficiency while significantly expanding model capacity) and empowers stronger unified multimodal intelligence across vision, speech, and language, representing a key step toward Artificial General Intelligence (AGI). Compared to its predecessor, the upgraded version exhibits substantial improvements across multimodal understanding and generation. We significantly advance speech recognition capabilities, achieving state-of-the-art performance in contextual ASR and highly competitive results in dialect-aware ASR. In image generation, Ming-Flash-Omni introduces high-fidelity text rendering and demonstrates marked gains in scene consistency and identity preservation during image editing. Furthermore, Ming-Flash-Omni introduces generative segmentation, a capability that not only achieves strong standalone segmentation performance but also enhances spatial control in image generation and improves editing consistency. Notably, Ming-Flash-Omni achieves state-of-the-art results in text-to-image generation and generative segmentation, and sets new records on all 12 contextual ASR benchmarks, all within a single unified architecture.


FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

arXiv.org Artificial Intelligence

We conduct a moderate-scale contamination-free (to some extent) evaluation of current large reasoning models (LRMs) with some preliminary findings. We also release ROME, our evaluation benchmark for vision language models intended to test reasoning from visual clues. We attach links to the benchmark, evaluation data, and other updates on this website: https://flageval-baai.github.io/LRM-Eval/


Unified Text-Image-to-Video Generation: A Training-Free Approach to Flexible Visual Conditioning

arXiv.org Artificial Intelligence

Text-image-to-video (TI2V) generation is a critical problem for controllable video generation using both semantic and visual conditions. Most existing methods typically add visual conditions to text-to-video (T2V) foundation models by finetuning, which is costly in resources and only limited to a few pre-defined conditioning settings. To tackle these constraints, we introduce a unified formulation for TI2V generation with flexible visual conditioning. Furthermore, we propose an innovative training-free approach, dubbed FlexTI2V, that can condition T2V foundation models on an arbitrary amount of images at arbitrary positions. Specifically, we firstly invert the condition images to noisy representation in a latent space. Then, in the denoising process of T2V models, our method uses a novel random patch swapping strategy to incorporate visual features into video representations through local image patches. To balance creativity and fidelity, we use a dynamic control mechanism to adjust the strength of visual conditioning to each video frame. Extensive experiments validate that our method surpasses previous training-free image conditioning methods by a notable margin. Our method can also generalize to both UNet-based and transformer-based architectures.


From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation

arXiv.org Artificial Intelligence

Infodemics and health misinformation have significant negative impact on individuals and society, exacerbating confusion and increasing hesitancy in adopting recommended health measures. Recent advancements in generative AI, capable of producing realistic, human like text and images, have significantly accelerated the spread and expanded the reach of health misinformation, resulting in an alarming surge in its dissemination. To combat the infodemics, most existing work has focused on developing misinformation datasets from social media and fact checking platforms, but has faced limitations in topical coverage, inclusion of AI generation, and accessibility of raw content. To address these issues, we present MM Health, a large scale multimodal misinformation dataset in the health domain consisting of 34,746 news article encompassing both textual and visual information. MM Health includes human-generated multimodal information (5,776 articles) and AI generated multimodal information (28,880 articles) from various SOTA generative AI models. Additionally, We benchmarked our dataset against three tasks (reliability checks, originality checks, and fine-grained AI detection) demonstrating that existing SOTA models struggle to accurately distinguish the reliability and origin of information. Our dataset aims to support the development of misinformation detection across various health scenarios, facilitating the detection of human and machine generated content at multimodal levels.


MA-COIR: Leveraging Semantic Search Index and Generative Models for Ontology-Driven Biomedical Concept Recognition

arXiv.org Artificial Intelligence

Recognizing biomedical concepts in the text is vital for ontology refinement, knowledge graph construction, and concept relationship discovery. However, traditional concept recognition methods, relying on explicit mention identification, often fail to capture complex concepts not explicitly stated in the text. To overcome this limitation, we introduce MA-COIR, a framework that reformulates concept recognition as an indexing-recognition task. By assigning semantic search indexes (ssIDs) to concepts, MA-COIR resolves ambiguities in ontology entries and enhances recognition efficiency. Using a pretrained BART-based model fine-tuned on small datasets, our approach reduces computational requirements to facilitate adoption by domain experts. Furthermore, we incorporate large language models (LLMs)-generated queries and synthetic data to improve recognition in low-resource settings. Experimental results on three scenarios (CDR, HPO, and HOIP) highlight the effectiveness of MA-COIR in recognizing both explicit and implicit concepts without the need for mention-level annotations during inference, advancing ontology-driven concept recognition in biomedical domain applications. Our code and constructed data are available at https://github.com/sl-633/macoir-master.


Dying for fame: Singers die 4 YEARS earlier than non-famous people on average - and their celebrity status is to blame, scientists say

Daily Mail - Science & tech

Karoline Leavitt's family member'abruptly arrested' by ICE after living in US for decades Residents in liberal Western US city feel'isolated' as state turns extremely red What HAS happened to Beyoncé? Suddenly desperate, I know what's really going on... and it's ugly: CAROLINE BULLOCK LIZ JONES: Sorry, but it's now time for Kate to stop making excuses'I fell for Joan the moment I saw her': The emotional love letter Sir Richard Branson penned to his'rock' on their anniversary - as he announces her death after 50 years together Ina Garten, 77, vulnerably addresses her decision not to have children: 'I can't imagine my life any other way' Sports broadcaster's wife suffers unimaginable tragedy just before he goes on air New'Hollywood of the South' emerges as booming industry generates $1bn... but long-time residents are furious University of Minnesota program offers guidelines to'reverse the whiteness pandemic' Emmy-winning CBS anchor reveals her devastating health battle: 'I've been silently struggling' Bethany MaGee's family issue heartbreaking statement about her injuries after devout Christian, 26, was set ablaze'by 72-time arrestee' on Chicago train Celebrities are known for living life in the fast lane - but being famous really can prove deadly, according to a new study. Researchers have discovered that being in the limelight comes with a higher mortality risk compared to those who never quite'make it'. It could explain why some singers such as Janis Joplin, Whitney Houston and Jimi Hendrix died so young. And it suggests that fame comes with'unique psychosocial stress' that leads to'harmful coping behaviours' like substance abuse, they said.


Dark matter is seen for the first time: Eerie image shows first direct evidence of the elusive substance that makes up 25% of the universe

Daily Mail - Science & tech

Karoline Leavitt's family member'abruptly arrested' by ICE after living in US for decades Residents in liberal Western US city feel'isolated' as state turns extremely red What HAS happened to Beyoncé? Suddenly desperate, I know what's really going on... and it's ugly: CAROLINE BULLOCK LIZ JONES: Sorry, but it's now time for Kate to stop making excuses'I fell for Joan the moment I saw her': The emotional love letter Sir Richard Branson penned to his'rock' on their anniversary - as he announces her death after 50 years together Ina Garten, 77, vulnerably addresses her decision not to have children: 'I can't imagine my life any other way' Sports broadcaster's wife suffers unimaginable tragedy just before he goes on air New'Hollywood of the South' emerges as booming industry generates $1bn... but long-time residents are furious University of Minnesota program offers guidelines to'reverse the whiteness pandemic' Emmy-winning CBS anchor reveals her devastating health battle: 'I've been silently struggling' Bethany MaGee's family issue heartbreaking statement about her injuries after devout Christian, 26, was set ablaze'by 72-time arrestee' on Chicago train Scientists have captured the first-ever direct evidence for dark matter, the elusive substance that makes up more than a quarter of the universe. Using NASA's Fermi telescope, researchers have detected powerful gamma-ray radiation emerging from a'halo-like' structure surrounding the Milky Way. Its frequency and intensity suggest that this could be dark matter. According to the study's author, Professor Tomonori Totani of the University of Tokyo, this eerie image is the first time that humanity has been able to'see' the mysterious substance.


Religious leader issues doomsday warning for the end of 2025: 'The last day of this world'

Daily Mail - Science & tech

Sports broadcaster's wife suffers unimaginable tragedy just before he goes on air Bethany MaGee's family issue heartbreaking statement about her injuries after devout Christian, 26, was set ablaze'by 72-time arrestee' on Chicago train Couple left red-faced after buying $25K'dirt alley' at auction thinking it was bargain San Francisco home LIZ JONES: Sorry, but it's now time for Kate to stop making excuses Troubled 350lb son of Hollywood icon is forced to humiliating new low... as his movie star brother luxuriates in $7m Montecito mansion Ina Garten, 77, vulnerably addresses her decision not to have children: 'I can't imagine my life any other way' Doctors appalled by North West's new body modification warn parents to stop children from chasing the dangerous fad Alex appeared to have the dream Manhattan mom life. But she was hiding a dark secret... and it almost killed her Shocking extent America has turned on ICE is revealed as Joe Rogan breaks from conservatives still cheering Trump's army of masked men Sir Richard Branson's wife Joan dies: 'Heartbroken' Virgin tycoon pays tribute to his'best friend' after she passed away Trump gives Thanksgiving turkeys scathing nicknames and calls Pritzker a'fat slob' in fiery White House holiday speech How to tell if a man is using'therapy speak' to manipulate you: If he says any of these 15 toxic phrases, run for the hills... I'll tell you what he REALLY means: JANA HOCKING I know why Usha Vance ditched her wedding ring. Most women would do the same if they'd suffered her humiliation: KENNEDY A comet has been predicted to strike the Earth by the end of the year, on what a controversial religious leader called'the last day of this world.' The doomsday warning came from the writings of Riaz Ahmed Gohar Shahi, a Pakistani spiritual leader and mystic, who claimed that God was sending a comet to collide with Earth because humanity had strayed too far from spiritual truths . He founded several organizations to spread his teachings of'divine love,' including the spiritual movement called Anjuman Serfaroshan-e-Islam and the Messiah Foundation International (MFI).


Director James Cameron says he can still work with Elon Musk despite political differences

FOX News

Hollywood director James Cameron said he can remain friends with Tesla owner Elon Musk, despite their political differences, over shared goals in AI and space travel.


Gruesome death ordered for 172 bears as hunt ritual is approved for first time in more than a decade

Daily Mail - Science & tech

Sports broadcaster's wife suffers unimaginable tragedy just before he goes on air Bethany MaGee's family issue heartbreaking statement about her injuries after devout Christian, 26, was set ablaze'by 72-time arrestee' on Chicago train Couple left red-faced after buying $25K'dirt alley' at auction thinking it was bargain San Francisco home LIZ JONES: Sorry, but it's now time for Kate to stop making excuses Troubled 350lb son of Hollywood icon is forced to humiliating new low... as his movie star brother luxuriates in $7m Montecito mansion Ina Garten, 77, vulnerably addresses her decision not to have children: 'I can't imagine my life any other way' Doctors appalled by North West's new body modification warn parents to stop children from chasing the dangerous fad Alex appeared to have the dream Manhattan mom life. But she was hiding a dark secret... and it almost killed her Shocking extent America has turned on ICE is revealed as Joe Rogan breaks from conservatives still cheering Trump's army of masked men Sir Richard Branson's wife Joan dies: 'Heartbroken' Virgin tycoon pays tribute to his'best friend' after she passed away Trump gives Thanksgiving turkeys scathing nicknames and calls Pritzker a'fat slob' in fiery White House holiday speech How to tell if a man is using'therapy speak' to manipulate you: If he says any of these 15 toxic phrases, run for the hills... I'll tell you what he REALLY means: JANA HOCKING I know why Usha Vance ditched her wedding ring. Most women would do the same if they'd suffered her humiliation: KENNEDY As many as 172 black bears are at risk of death in Florida after a judge approved the first hunt in a decade. Leon County Circuit Judge Angela Dempsey rejected a request from Bear Warriors United, a Central Florida-based nonprofit, to halt this year's hunt, saying the group had failed to show a'substantial likelihood of success on the merits' in its lawsuit. The hunt is scheduled for December 6 through 28 on lands outside the wildlife management area system.