calendar
- Africa > Ghana (0.05)
- North America > United States > Pennsylvania > Lackawanna County > Scranton (0.04)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Europe > United Kingdom (0.04)
- Leisure & Entertainment (0.68)
- Health & Medicine (0.68)
- Government > Regional Government > North America Government > United States Government (0.47)
- Media > Music (0.46)
Apple's Most Overlooked App Just Got a Lot Better
Apple Shortcuts, which lets users write custom automations, recently earned some new capabilities thanks to Apple Intelligence. Here's how to make the most of this upgrade. As sentences go, "Apple Intelligence now works in Apple Shortcuts" isn't the most likely to inspire a lot of people to click a link. And that's too bad: This change, one of the more overlooked new features in macOS 26, means you can use Apple's on-board AI to do all kinds of things while designing shortcuts. Look, I get it: Apple Intelligence makes AI a feature, not a product, and features are generally less interesting to read about than full-blown products.
- Asia > Nepal (0.15)
- North America > United States > California (0.05)
- Europe > Slovakia (0.05)
- Europe > Czechia (0.05)
- Information Technology > Communications > Mobile (0.70)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.31)
Can Language Models Handle a Non-Gregorian Calendar? The Case of the Japanese wareki
Sasaki, Mutsumi, Kamoda, Go, Takahashi, Ryosuke, Sato, Kosuke, Inui, Kentaro, Sakaguchi, Keisuke, Heinzerling, Benjamin
Temporal reasoning and knowledge are essential capabilities for language models (LMs). While much prior work has analyzed and improved temporal reasoning in LMs, most studies have focused solely on the Gregorian calendar. However, many non-Gregorian systems, such as the Japanese, Hijri, and Hebrew calendars, are in active use and reflect culturally grounded conceptions of time. If and how well current LMs can accurately handle such non-Gregorian calendars has not been evaluated so far. Here, we present a systematic evaluation of how well language models handle one such non-Gregorian system: the Japanese wareki. We create datasets that require temporal knowledge and reasoning in using wareki dates. Evaluating open and closed LMs, we find that some models can perform calendar conversions, but GPT-4o, Deepseek V3, and even Japanese-centric models struggle with Japanese calendar arithmetic and knowledge involving wareki dates. Error analysis suggests corpus frequency of Japanese calendar expressions and a Gregorian bias in the model's knowledge as possible explanations. Our results show the importance of developing LMs that are better equipped for culture-specific tasks such as calendar understanding.
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
A API Details
API calls for each position identified in a piece of text. Question Answering We use the Atlas model of Izacard et al. (2022) finetuned on Natural Questions Calculator Our calculator is based on a simple Python script and only supports the operators " It does not return any result for syntactically invalid equations. "=", "equals", "equal to", "total of", "average of" followed by a number, or (iii) contain at least three English text before generating API calls. Below, we list the prompts used to sample API calls for each tool considered. Your task is to add calls to a Question Answering API to a piece of text. Input: Joe Biden was born in Scranton, Pennsylvania. Output: Joe Biden was born in [QA("Where was Joe Biden born?")] Scranton, [QA("In Output: Coca-Cola, or [QA("What other name is Coca-Cola known by?")] Coke, is Your task is to add calls to a Calculator API to a piece of text.
- North America > United States > Pennsylvania > Lackawanna County > Scranton (0.24)
- Africa > Ghana (0.05)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Europe > United Kingdom (0.04)
- Government > Regional Government > North America Government > United States Government (1.00)
- Leisure & Entertainment (0.68)
- Health & Medicine (0.68)
Better Privilege Separation for Agents by Restricting Data Types
Jacob, Dennis, Alghamdi, Emad, Hu, Zhanhao, Alomair, Basel, Wagner, David
Large language models (LLMs) have become increasingly popular due to their ability to interact with unstructured content. As such, LLMs are now a key driver behind the automation of language processing systems, such as AI agents. Unfortunately, these advantages have come with a vulnerability to prompt injections, an attack where an adversary subverts the LLM's intended functionality with an injected task. Past approaches have proposed detectors and finetuning to provide robustness, but these techniques are vulnerable to adaptive attacks or cannot be used with state-of-the-art models. To this end we propose type-directed privilege separation for LLMs, a method that systematically prevents prompt injections. We restrict the ability of an LLM to interact with third-party data by converting untrusted content to a curated set of data types; unlike raw strings, each data type is limited in scope and content, eliminating the possibility for prompt injections. We evaluate our method across several case studies and find that designs leveraging our principles can systematically prevent prompt injection attacks while maintaining high utility.
- North America > United States > California > Alameda County > Berkeley (0.14)
- Asia > Middle East > Saudi Arabia > Riyadh Province > Riyadh (0.04)
- Europe > Switzerland > Basel-City > Basel (0.04)
- Information Technology > Security & Privacy (1.00)
- Government (0.93)
OpenID Connect for Agents (OIDC-A) 1.0: A Standard Extension for LLM-Based Agent Identity and Authorization
Nagabhushanaradhya, Subramanya
OpenID Connect for Agents (OIDC-A) 1.0 is an extension to OpenID Connect Core 1.0 that provides a comprehensive framework for representing, authenticating, and authorizing LLM-based agents within the OAuth 2.0 ecosystem. As autonomous AI agents become increasingly prevalent in digital systems, there is a critical need for standardized protocols to establish agent identity, verify agent attestation, represent delegation chains, and enable fine-grained authorization based on agent attributes. This specification defines standard claims, endpoints, and protocols that address these requirements while maintaining compatibility with existing OAuth 2.0 and OpenID Connect infrastructure. The proposed framework introduces mechanisms for agent identity representation, delegation chain validation, attestation verification, and capability-based authorization, providing a foundation for secure and trustworthy agent-to-service interactions in modern distributed systems.
Can You Really Live One Day at a Time?
Productivity culture encourages us to live inside our tasks and projects. But nature offers its own organizational system. This summer, I reread the novel " Aurora," by Kim Stanley Robinson, a science-fiction writer whom I profiled a few years ago. Robinson has an ecological orientation, and "Aurora" is basically a book about how we fit into nature. It ends on a beach, with an extended description of swimming in big waves. It's early morning, and the waves, as they rise, "turn a deep translucent green."
- North America > United States > New York (0.05)
- Europe > Netherlands > North Holland > Amsterdam (0.05)
- North America > United States > Michigan (0.04)
- (2 more...)
- Leisure & Entertainment (0.69)
- Education (0.69)
ASPERA: A Simulated Environment to Evaluate Planning for Complex Action Execution
Coca, Alexandru, Gaynor, Mark, Zhang, Zhenxing, Cheng, Jianpeng, Tseng, Bo-Hsiang, Boothroyd, Pete, Alonso, Héctor Martinez, Séaghdha, Diarmuid Ó, Johannsen, Anders
This work evaluates the potential of large language models (LLMs) to power digital assistants capable of complex action execution. These assistants rely on pre-trained programming knowledge to execute multi-step goals by composing objects and functions defined in assistant libraries into action execution programs. To achieve this, we develop ASPERA, a framework comprising an assistant library simulation and a human-assisted LLM data generation engine. Our engine allows developers to guide LLM generation of high-quality tasks consisting of complex user queries, simulation state and corresponding validation programs, tackling data availability and evaluation robustness challenges. Alongside the framework we release Asper-Bench, an evaluation dataset of 250 challenging tasks generated using ASPERA, which we use to show that program generation grounded in custom assistant libraries is a significant challenge to LLMs compared to dependency-free code generation.
- Europe > Austria > Vienna (0.14)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- Asia > Thailand > Bangkok > Bangkok (0.04)
- (21 more...)
- Workflow (1.00)
- Research Report (0.81)
- Instructional Material (0.67)