AITopics | sheldon

Collaborating Authors

sheldon

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

Li, Wenyu, Jiao, Xiaoqi, Chang, Yi, Zhang, Guangyan, Guo, Yiwen

arXiv.org Artificial IntelligenceSep-30-2025

The creation of high-quality multimodal datasets remains fundamental for advancing role-playing capabilities in large language models (LLMs). While existing works predominantly focus on text-based persona simulation, Audio Role-Playing (ARP) presents unique challenges due to the need for synchronized alignment of semantic content and vocal characteristics. To address this gap, we propose AudioRole, a meticulously curated dataset from 13 TV series spanning 1K+ hours with 1M+ character-grounded dialogues, providing synchronized audio-text pairs annotated with speaker identities and contextual metadata. In addition, to demonstrate the effectiveness of the dataset, we introduced ARP-Eval, a dual-aspect evaluation framework that assesses both response quality and role fidelity. Empirical validation showing GLM-4-Voice trained on AudioRole (which we called ARP-Model) achieve an average Acoustic Personalization score of 0.31, significantly outperforming the original GLM-4-voice and the more powerful model MiniCPM-O-2.6, which specifically supports role-playing in one-shot scenarios. The ARP-Model also achieves a Content Personalization score of 0.36, surpassing the untrained original model by about 38% and maintaining the same level as MiniCPM-O-2.6. AudioRole features dialogues from over 115 main characters, 6 trained ARP-Models that role-play different characters, and evaluation protocols. Together, they provide an essential resource for advancing audio-grounded role-playing research.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.23435

Genre: Research Report (1.00)

Industry:

Media > Television (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Amazon, Google and Meta are 'pillaging culture, data and creativity' to train AI, Australian inquiry finds

The GuardianNov-27-2024, 06:25:23 GMT

Tech companies Amazon, Google and Meta have been criticised by a Senate select committee inquiry for being especially vague over how they used Australian data to train their powerful artificial intelligence products. Labor senator Tony Sheldon, the inquiry's chair, was frustrated by the multinationals' refusal to answer direct questions about their use of Australians' private and personal information. "Watching Amazon, Meta, and Google dodge questions during the hearings was like sitting through a cheap magic trick – plenty of hand-waving, a puff of smoke, and nothing to show for it in the end," Sheldon said in a statement, after releasing the final report of the inquiry on Tuesday. He called the tech companies "pirates" that were "pillaging our culture, data, and creativity for their gain while leaving Australians empty-handed." The report found some general-purpose AI models – such as OpenAI's GPT, Meta's Llama and Google's Gemini – should automatically default to a "high risk" category, and be subjected to mandated transparency and accountability requirements.

amazon, australian inquiry find, google and meta, (9 more...)

The Guardian

Country:

North America > United States > California (0.16)
Oceania > Australia (0.10)
Europe (0.05)

Industry: Information Technology > Security & Privacy (0.52)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Reviews: Tomography of the London Underground: a Scalable Model for Origin-Destination Data

Neural Information Processing SystemsOct-8-2024, 07:13:08 GMT

I thank the authors for the clarification in their rebuttal. It is even more clear that the authors should better contrast their work with aggregate approaches such as Dan Sheldon's collective graphical models (e.g., Sheldon and Dietterich (2011), Kumar et al. 2013, Bernstein and Sheldon 2016). Part of the confusion came from some of the modeling choices: In equation (1) the travel times added by one station is Poisson distributed?! Poisson is often used for link loads (how many people there are in a given station), not to model time. Is the quantization of time too coarse for a continuous-time model? Wouldn't a phase-type distribution(e.g., Erlang) be a better choice for time? Such modeling choices must be explained.

london underground, origin-destination data, scalable model, (7 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Greater London > London (0.40)

Industry:

Transportation > Passenger (0.40)
Transportation > Ground > Rail (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Meta's AI is scraping users' photos and posts. Europeans can opt out, but Australians cannot

The GuardianSep-11-2024, 01:34:56 GMT

Meta is using the public Facebook and Instagram photos and posts of its users to train artificial intelligence and, while European users have been allowed to opt out of the mass-scraping of their content, Australian users do not have that option, a parliamentary committee has heard. The parent company of Facebook and Instagram paused the launch of its AI product in Europe in July due to the General Data Protection Regulation (GDPR) privacy rules, and as a result of GDPR law. Meta was ordered to stop training its large language model on data from European users on privacy concerns, and Meta has given European users an opt-out option. Labor's chair of the inquiry examining AI adoption in Australia, senator Tony Sheldon, questioned Meta executives on Tuesday why that option had not been extended to Australian users. "I'll be very frank with you. I'd like to opt out in Australia … and I'd like to have the options similar to Europe, for all Australians, including for myself personally. Why can't I have that option?"

european user, meta, photo and post, (6 more...)

The Guardian

Country:

Oceania > Australia (0.50)
Europe (0.50)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Hi Sheldon! Creating Deep Personalized Characters from TV Shows

Xuanyuan, Meidai, Wang, Yuwang, Guo, Honglei, Ma, Xiao, Guo, Yuchen, Yu, Tao, Dai, Qionghai

arXiv.org Artificial IntelligenceApr-8-2023

Imagine an interesting multimodal interactive scenario that you can see, hear, and chat with an AI-generated digital character, who is capable of behaving like Sheldon from The Big Bang Theory, as a DEEP copy from appearance to personality. Towards this fantastic multimodal chatting scenario, we propose a novel task, named Deep Personalized Character Creation (DPCC): creating multimodal chat personalized characters from multimodal data such as TV shows. Specifically, given a single- or multi-modality input (text, audio, video), the goal of DPCC is to generate a multi-modality (text, audio, video) response, which should be well-matched the personality of a specific character such as Sheldon, and of high quality as well. To support this novel task, we further collect a character centric multimodal dialogue dataset, named Deep Personalized Character Dataset (DPCD), from TV shows. DPCD contains character-specific multimodal dialogue data of ~10k utterances and ~6 hours of audio/video per character, which is around 10 times larger compared to existing related datasets.On DPCD, we present a baseline method for the DPCC task and create 5 Deep personalized digital Characters (DeepCharacters) from Big Bang TV Shows. We conduct both subjective and objective experiments to evaluate the multimodal response from DeepCharacters in terms of characterization and quality. The results demonstrates that, on our collected DPCD dataset, the proposed baseline can create personalized digital characters for generating multimodal response.Our collected DPCD dataset, the code of data collection and our baseline will be published soon.

deepcharacter, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2304.11093

Country:

North America > Canada > Quebec > Montreal (0.04)
Europe > Italy > Tuscany > Florence (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(11 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Media > Television (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Speech (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.46)

Add feedback

Voila raises $6M for its A.I.-powered storefronts for online creators – TechCrunch

#artificialintelligenceFeb-16-2022, 14:50:07 GMT

Voila, a startup building infrastructure for social commerce, is bringing concepts from China's e-commerce market to the U.S. The company offers an alternative to the "link in bio" solutions used today by creators, like Linktree and Beacons, which direct followers to creators' social profiles, personal websites, and other recommendations. Instead of a link list or landing page, Voila creates A.I.-powered customizable, shoppable storefronts by automatically detecting items in the creators' online content then generating shoppable links. With now over 10,000 creators signed up for the service, Voila is today announcing the close of its $6 million Series A led by Sinnovation Ventures and joined by Fosun Rz Capital. To date, Voila has raised $7.5 million, including from investors SOSV and Artesian. Voila founder Ke Shang first moved from China to the U.S. to attend college.

creator, image credit, voilà, (13 more...)

#artificialintelligence

Country:

Asia > China (0.50)
North America > United States > California (0.05)
Europe (0.05)

Industry:

Information Technology > Services (0.51)
Banking & Finance > Capital Markets (0.36)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.75)

Add feedback

SCROLLS: Standardized CompaRison Over Long Language Sequences

Shaham, Uri, Segal, Elad, Ivgi, Maor, Efrat, Avia, Yoran, Ori, Haviv, Adi, Gupta, Ankit, Xiong, Wenhan, Geva, Mor, Berant, Jonathan, Levy, Omer

arXiv.org Artificial IntelligenceJan-10-2022

NLP benchmarks have largely focused on short texts, such as sentences and paragraphs, even though long texts comprise a considerable amount of natural language in the wild. We introduce SCROLLS, a suite of tasks that require reasoning over long texts. We examine existing long-text datasets, and handpick ones where the text is naturally long, while prioritizing tasks that involve synthesizing information across the input. SCROLLS contains summarization, question answering, and natural language inference tasks, covering multiple domains, including literature, science, business, and entertainment. Initial baselines, including Longformer Encoder-Decoder, indicate that there is ample room for improvement on SCROLLS. We make all datasets available in a unified text-to-text format and host a live leaderboard to facilitate research on model architecture and pretraining methods.

computational linguistic, dataset, linguistic, (17 more...)

arXiv.org Artificial Intelligence

2201.03533

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > Dominican Republic (0.04)
(11 more...)

Genre: Research Report (0.50)

Industry:

Law (0.68)
Government (0.68)
Leisure & Entertainment (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Using Artificial Intelligence to Track Birds' Dark-of-Night Migrations - insideBIGDATA

#artificialintelligenceSep-7-2019, 20:07:28 GMT

On many evenings during spring and fall migration, tens of millions of birds take flight at sunset and pass over our heads, unseen in the night sky. Though these flights have been recorded for decades by the National Weather Services' network of constantly scanning weather radars, until recently these data have been mostly out of reach for bird researchers. That's because the sheer magnitude of information and lack of tools to analyze it made only limited studies possible, says artificial intelligence (AI) researcher Dan Sheldon at the University of Massachusetts Amherst. Ornithologists and ecologists with the time and expertise to analyze individual radar images could clearly see patterns that allowed them to discriminate precipitation from birds and study migration, he adds. But the massive amount of information – over 200 million images and hundreds of terabytes of data – significantly limited their ability to sample enough nights, over enough years and in enough locations to be useful in characterizing, let alone tracking, seasonal, continent-wide migrations, he explains.

migration, mistnet, sheldon, (9 more...)

#artificialintelligence

Country: North America > United States > Massachusetts > Hampshire County > Amherst (0.25)

Industry: Government > Regional Government > North America Government > United States Government (0.72)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback

AI tracks migratory birds using weather radar

#artificialintelligenceAug-28-2019, 17:18:25 GMT

Tens of millions of birds make migratory flights for the winter each year, often flying during nighttime. They're frequently spotted by the National Weather Services' network of 159 ground-based radars, which scan the skies every 4 to 10 minutes by emitting pulses of microwaves and measuring their reflections. However, ecologists have historically struggled to make use of the resulting data sets because of their sheer magnitude, which can range up to hundreds of millions of images and hundreds of terabytes over decades. In an effort to lighten the workload, scientists at Cornell's Lab of Ornithology and the University of Massachusetts' College of Information and Computer Sciences recently investigated an AI system capable of distinguishing birds in radar images from precipitation. They say that their tool, dubbed MistNet after the fine nets ornithologists use to capture migratory songbirds, not only aids with classification tasks, but can be used to estimate birds' flying velocity and traffic rates.

ai track migratory bird, artificial intelligence, machine learning, (6 more...)

#artificialintelligence

Country: North America > United States > Massachusetts (0.26)

Genre: Research Report > New Finding (0.33)

Industry: Government > Regional Government > North America Government > United States Government (0.57)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)

Add feedback

Artificial intelligence helps scientists track birds migrating at night

#artificialintelligenceAug-28-2019, 13:47:01 GMT

It's difficult to track birds flying across the sky in the dark of night, but every fall and spring, millions of birds migrate through the night. Weather radar can offer a spotty view of the phenomenon, but to track nighttime migrations with greater accuracy and reliability, a group of researchers at the University of Massachusetts at Amherst turned to artificial intelligence. Scientists designed a machine-learning algorithm to analyze weather radar images and differentiate migrating birds from precipitation. The algorithm replicates the power of neural networks to analyze and classify radar images. Researchers used the new artificial intelligence program to survey decades-long radar data sets, revealing seasonal and continent-wide migration patterns.

artificial intelligence, intelligence help scientist track bird, machine learning, (3 more...)

#artificialintelligence

Country: North America > United States > Massachusetts (0.27)

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback