AITopics | caesar

Collaborating Authors

caesar

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

This is the most underrated sci-fi film franchise of the 21st century

New ScientistMay-20-2026, 18:00:00 GMT

AS A sci-fi fan, you learn not to dwell on the films that could have been. Whether it's Alejandro Jodorowsky's unmade Dune, Guillermo del Toro's cancelled take on At the Mountains of Madness, or the versions of Return of the Jedi that Davids Lynch and Cronenberg could have made, it's best not to torture yourself over cinematic what-ifs. That's why I had given up hope of there being a new instalment of the most underrated sci-fi film franchise of the 21st century so far. Though well received by critics and audiences alike, none of the four films have won Oscars or seem to have made much of an impact on pop culture. But then, earlier this month, we got confirmation that a fifth movie was on the way.

artificial intelligence, science fiction, social media, (14 more...)

New Scientist

Industry:

Leisure & Entertainment (0.71)
Media > Film (0.52)
Health & Medicine > Therapeutic Area (0.34)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Science Fiction (0.73)

Add feedback

844f722dbbcb27933ff5baf58a1f00c8-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsFeb-10-2026, 09:57:36 GMT

dataset, expression, representation, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Albemarle County > Charlottesville (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets

Neural Information Processing SystemsDec-24-2025, 16:01:42 GMT

Humans naturally use verbal utterances and nonverbal gestures to refer to various objects (known as $\textit{referring expressions}$) in different interactional scenarios. As collecting real human interaction datasets are costly and laborious, synthetic datasets are often used to train models to unambiguously detect relationships among objects. However, existing synthetic data generation tools that provide referring expressions generally neglect nonverbal gestures. Additionally, while a few small-scale datasets contain multimodal cues (verbal and nonverbal), these datasets only capture the nonverbal gestures from an exo-centric perspective (observer). As models can use complementary information from multimodal cues to recognize referring expressions, generating multimodal data from multiple views can help to develop robust models.

embodied simulator, generating multimodal, name change, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Is a robot programmed to prank you annoying? Yes

New ScientistNov-5-2025, 18:00:00 GMT

Is a robot programmed to prank you annoying? Feedback discovers a robot that can mimic Turkish ice cream vendors, who are known for playing tricks on their customers. Researchers concluded that customers, perhaps predictably, don't trust it Feedback is a grumpy sort, so we run a mile when faced with any kind of enforced fun. It is possible, therefore, that we would struggle to buy an ice cream in Turkey, because doing so requires enjoying, or at least tolerating, an extended prank. Turkish ice cream vendors are prone to playing tricks on their customers, like handing them a cone full of ice cream only to whisk it out of their grasp using sleight of hand.

robot, shakespeare, turkish ice cream vendor, (12 more...)

New Scientist

Country:

Asia > Middle East > Republic of Türkiye (0.25)
Indian Ocean (0.05)
Europe > United Kingdom > Scotland (0.05)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Watermarking Diffusion Language Models

Gloaguen, Thibaud, Staab, Robin, Jovanović, Nikola, Vechev, Martin

arXiv.org Artificial IntelligenceSep-30-2025

We introduce the first watermark tailored for diffusion language models (DLMs), an emergent LLM paradigm able to generate tokens in arbitrary order, in contrast to standard autoregressive language models (ARLMs) which generate tokens sequentially. While there has been much work in ARLM watermarking, a key challenge when attempting to apply these schemes directly to the DLM setting is that they rely on previously generated tokens, which are not always available with DLM generation. In this work we address this challenge by: (i) applying the watermark in expectation over the context even when some context tokens are yet to be determined, and (ii) promoting tokens which increase the watermark strength when used as context for other tokens. This is accomplished while keeping the watermark detector unchanged. Our experimental evaluation demonstrates that the DLM watermark leads to a >99% true positive rate with minimal quality impact and achieves similar robustness to existing ARLM watermarks, enabling for the first time reliable DLM watermarking.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2509.24368

Country:

Europe (1.00)
North America > United States (0.46)
Asia > Japan (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Add feedback

CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets

Neural Information Processing SystemsAug-16-2025, 14:12:19 GMT

However, existing synthetic data generation tools that provide referring expressions generally neglect nonverbal gestures.

machine learning, natural language, object-oriented architecture, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia > Albemarle County > Charlottesville (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Defense against Prompt Injection Attacks via Mixture of Encodings

Zhang, Ruiyi, Sullivan, David, Jackson, Kyle, Xie, Pengtao, Chen, Mei

arXiv.org Artificial IntelligenceApr-11-2025

Large Language Models (LLMs) have emerged as a dominant approach for a wide range of NLP tasks, with their access to external information further enhancing their capabilities. However, this introduces new vulnerabilities, known as prompt injection attacks, where external content embeds malicious instructions that manipulate the LLM's output. Recently, the Base64 defense has been recognized as one of the most effective methods for reducing success rate of prompt injection attacks. Despite its efficacy, this method can degrade LLM performance on certain NLP tasks. To address this challenge, we propose a novel defense mechanism: mixture of encodings, which utilizes multiple character encodings, including Base64. Extensive experimental results show that our method achieves one of the lowest attack success rates under prompt injection attacks, while maintaining high performance across all NLP tasks, outperforming existing character encoding-based defense methods. This underscores the effectiveness of our mixture of encodings strategy for both safety and task performance metrics.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2504.07467

Country:

North America > United States (1.00)
Asia (0.94)

Genre: Research Report > New Finding (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Add feedback

CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets

Neural Information Processing SystemsJan-16-2025, 18:49:54 GMT

Humans naturally use verbal utterances and nonverbal gestures to refer to various objects (known as \textit{referring expressions}) in different interactional scenarios. As collecting real human interaction datasets are costly and laborious, synthetic datasets are often used to train models to unambiguously detect relationships among objects. However, existing synthetic data generation tools that provide referring expressions generally neglect nonverbal gestures. Additionally, while a few small-scale datasets contain multimodal cues (verbal and nonverbal), these datasets only capture the nonverbal gestures from an exo-centric perspective (observer). As models can use complementary information from multimodal cues to recognize referring expressions, generating multimodal data from multiple views can help to develop robust models.

embodied simulator, expression dataset, generating multimodal, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

Caesar: A Low-deviation Compression Approach for Efficient Federated Learning

Yan, Jiaming, Liu, Jianchun, Xu, Hongli, Huang, Liusheng, Gong, Jiantao, Liu, Xudong, Hou, Kun

arXiv.org Artificial IntelligenceDec-27-2024

Compression is an efficient way to relieve the tremendous communication overhead of federated learning (FL) systems. However, for the existing works, the information loss under compression will lead to unexpected model/gradient deviation for the FL training, significantly degrading the training performance, especially under the challenges of data heterogeneity and model obsolescence. To strike a delicate trade-off between model accuracy and traffic cost, we propose Caesar, a novel FL framework with a low-deviation compression approach. For the global model download, we design a greedy method to optimize the compression ratio for each device based on the staleness of the local model, ensuring a precise initial model for local training. Regarding the local gradient upload, we utilize the device's local data properties (\ie, sample volume and label distribution) to quantify its local gradient's importance, which then guides the determination of the gradient compression ratio. Besides, with the fine-grained batch size optimization, Caesar can significantly diminish the devices' idle waiting time under the synchronized barrier. We have implemented Caesar on two physical platforms with 40 smartphones and 80 NVIDIA Jetson devices. Extensive results show that Caesar can reduce the traffic costs by about 25.54%$\thicksim$37.88% compared to the compression-based baselines with the same target accuracy, while incurring only a 0.68% degradation in final test accuracy relative to the full-precision communication.

artificial intelligence, compression ratio, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2412.19989

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Multiple-policy Evaluation via Density Estimation

Chen, Yilei, Pacchiano, Aldo, Paschalidis, Ioannis Ch.

arXiv.org Artificial IntelligenceMay-27-2024

We study the multiple-policy evaluation problem where we are given a set of $K$ policies and the goal is to evaluate their performance (expected total reward over a fixed horizon) to an accuracy $\epsilon$ with probability at least $1-\delta$. We propose an algorithm named $\mathrm{CAESAR}$ for this problem. Our approach is based on computing an approximate optimal offline sampling distribution and using the data sampled from it to perform the simultaneous estimation of the policy values. $\mathrm{CAESAR}$ has two phases. In the first we produce coarse estimates of the visitation distributions of the target policies at a low order sample complexity rate that scales with $\tilde{O}(\frac{1}{\epsilon})$. In the second phase, we approximate the optimal offline sampling distribution and compute the importance weighting ratios for all target policies by minimizing a step-wise quadratic loss function inspired by the DualDICE \cite{nachum2019dualdice} objective. Up to low order and logarithmic terms $\mathrm{CAESAR}$ achieves a sample complexity $\tilde{O}\left(\frac{H^4}{\epsilon^2}\sum_{h=1}^H\max_{k\in[K]}\sum_{s,a}\frac{(d_h^{\pi^k}(s,a))^2}{\mu^*_h(s,a)}\right)$, where $d^{\pi}$ is the visitation distribution of policy $\pi$, $\mu^*$ is the optimal sampling distribution, and $H$ is the horizon.

estimator, probability, sample complexity, (14 more...)

arXiv.org Artificial Intelligence

2404.00195

Country: Asia > China > Hunan Province > Changsha (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.50)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback