AITopics | pedestal

Collaborating Authors

pedestal

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Towards Understanding Camera Motions in Any Video

Neural Information Processing SystemsJun-22-2026, 16:28:17 GMT

We introduce CameraBench, a large-scale dataset and benchmark designed to assess and improve camera motion understanding. CameraBench consists of 3,000 diverse internet videos, annotated by experts through a rigorous multi-stage quality control process. One of our core contributions is a taxonomy or "language" of camera motion primitives, designed in collaboration with cinematographers. We find, for example, that some primitives like "follow" (or tracking) require understanding scene content like moving subjects. We conduct a large-scale human study to quantify human annotation performance, revealing that domain expertise and tutorial-based training can significantly enhance accuracy. For example, a novice may confuse zoom-in(a change of intrinsics) with translating forward (a change of extrinsics), but can be trained to differentiate the two. Using CameraBench, we evaluate Structure-from-Motion (SfM) and Video-Language Models (VLMs), finding that SfM models struggle to capture semantic primitives that depend on scene content, while VLMs struggle to capture geometric primitives that require precise estimation of trajectories. We then fine-tune a generative VLM on CameraBench to achieve the best of both worlds and showcase its applications, including motion-augmented captioning, video question answering, and video-text retrieval. We hope our taxonomy, benchmark, and tutorials will drive future efforts towards the ultimate goal of understanding camera motions in any video.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Overview (0.92)
Instructional Material (0.67)

Industry:

Media > Photography (1.00)
Media > Film (1.00)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Towards Understanding Camera Motions in Any Video

Neural Information Processing SystemsJun-22-2026, 16:28:13 GMT

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Overview (0.92)
Instructional Material (0.67)

Industry:

Media > Photography (1.00)
Media > Film (1.00)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

Finding Local Minima Efficiently in Decentralized Optimization

Neural Information Processing SystemsApr-29-2026, 17:19:50 GMT

In this paper we study the second-order optimality of decentralized stochastic algorithm that escapes saddle point efficiently for nonconvex optimization problems. We propose a new pure gradient-based decentralized stochastic algorithm PEDESTAL with a novel convergence analysis framework to address the technical challenges unique to the decentralized stochastic setting. Our method is the first decentralized stochastic algorithm to achieve second-order optimality with non-asymptotic analysis. We provide theoretical guarantees with the gradient complexity of O(ϵ 3)to find O(ϵ, ϵ)-second-order stationary point, which matches state-of-the-art results of centralized counterparts or decentralized methods to find first-order stationary point. We also conduct two decentralized tasks in our experiments, a matrix sensing task with synthetic data and a matrix factorization task with a real-world dataset to validate the performance of our method.

algorithm, artificial intelligence, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)

Add feedback

Towards Understanding Camera Motions in Any Video

Lin, Zhiqiu, Cen, Siyuan, Jiang, Daniel, Karhade, Jay, Wang, Hewei, Mitra, Chancharik, Ling, Tiffany, Huang, Yuhan, Liu, Sifan, Chen, Mingyu, Zawar, Rushikesh, Bai, Xue, Du, Yilun, Gan, Chuang, Ramanan, Deva

arXiv.org Artificial IntelligenceSep-1-2025

We introduce CameraBench, a large-scale dataset and benchmark designed to assess and improve camera motion understanding. CameraBench consists of ~3,000 diverse internet videos, annotated by experts through a rigorous multi-stage quality control process. One of our contributions is a taxonomy of camera motion primitives, designed in collaboration with cinematographers. We find, for example, that some motions like "follow" (or tracking) require understanding scene content like moving subjects. We conduct a large-scale human study to quantify human annotation performance, revealing that domain expertise and tutorial-based training can significantly enhance accuracy. For example, a novice may confuse zoom-in (a change of intrinsics) with translating forward (a change of extrinsics), but can be trained to differentiate the two. Using CameraBench, we evaluate Structure-from-Motion (SfM) and Video-Language Models (VLMs), finding that SfM models struggle to capture semantic primitives that depend on scene content, while VLMs struggle to capture geometric primitives that require precise estimation of trajectories. We then fine-tune a generative VLM on CameraBench to achieve the best of both worlds and showcase its applications, including motion-augmented captioning, video question answering, and video-text retrieval. We hope our taxonomy, benchmark, and tutorials will drive future efforts towards the ultimate goal of understanding camera motions in any video.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2504.15376

Genre: Research Report (1.00)

Industry:

Media > Photography (1.00)
Media > Film (1.00)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Ye, Xi, Yin, Fangcong, He, Yinghui, Zhang, Joie, Yen, Howard, Gao, Tianyu, Durrett, Greg, Chen, Danqi

arXiv.org Artificial IntelligenceJan-9-2025

Existing benchmarks for evaluating long-context language models (LCLMs) primarily focus on long-context recall, requiring models to produce short responses based on a few critical snippets while processing thousands of irrelevant tokens. We introduce LongProc (Long Procedural Generation), a new benchmark that requires both the integration of highly dispersed information and long-form generation. LongProc consists of six diverse procedural generation tasks, such as extracting structured information from HTML pages into a TSV format and executing complex search procedures to create travel plans. These tasks challenge LCLMs by testing their ability to follow detailed procedural instructions, synthesize and reason over dispersed information, and generate structured, long-form outputs (up to 8K tokens). Furthermore, as these tasks adhere to deterministic procedures and yield structured outputs, they enable reliable rule-based evaluation. We evaluate 17 LCLMs on LongProc across three difficulty levels, with maximum numbers of output tokens set at 500, 2K, and 8K. Notably, while all tested models claim a context window size above 32K tokens, open-weight models typically falter on 2K-token tasks, and closed-source models like GPT-4o show significant degradation on 8K-token tasks. Further analysis reveals that LCLMs struggle to maintain long-range coherence in long-form generations. These findings highlight critical limitations in current LCLMs and suggest substantial room for improvement. Data and code available at: https://princeton-pli.github.io/LongProc

benchmarking long-context language model, carol location, tweezers location, (13 more...)

arXiv.org Artificial Intelligence

2501.05414

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Europe > Sweden > Stockholm > Stockholm (0.05)
Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.05)
(33 more...)

Genre: Research Report (1.00)

Industry: Consumer Products & Services > Travel (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.92)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Drones examine Japan's damaged Fukushima nuclear reactor for the first time

FOX NewsMar-19-2024, 16:36:54 GMT

U.S. Ambassador to Japan Rahm Emanuel visited a Fukushima coastal city to support the local fishing industry after China and South Korea raised the alarm over water discharge began from the Fukushima Daiichi nuclear plant. Images taken by miniature drones from deep inside a badly damaged reactor at the Fukushima nuclear plant show displaced control equipment and misshapen materials but leave many questions unanswered, underscoring the daunting task of decommissioning the plant. The 12 photos released by the plant's operator are the first from inside the main structural support called the pedestal in the hardest-hit No. 1 reactor's primary containment vessel, an area directly under the reactor's core. Officials had long hoped to reach the area to examine the core and melted nuclear fuel which dripped there when the plant's cooling systems were damaged by a massive earthquake and tsunami in 2011. Earlier attempts with robots were unable to reach the area.

drone examine japan, fukushima nuclear reactor, reactor, (6 more...)

FOX News

Country:

Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (1.00)
Asia > South Korea (0.26)
Asia > China (0.26)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.06)

Industry: Energy > Power Industry > Utilities > Nuclear (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (0.42)

Add feedback

EuroPED-NN: Uncertainty aware surrogate model

Alvarez, A. Panera, Ho, A., Jarvinen, A., Saarelma, S., Wiesen, S., Contributors, JET

arXiv.org Artificial IntelligenceFeb-1-2024

This work successfully generates uncertainty aware surrogate models, via the Bayesian neural network with noise contrastive prior (BNN-NCP) technique, of the EuroPED plasma pedestal model using data from the JET-ILW pedestal database and subsequent model evaluations. All this conform EuroPED-NN. The BNN-NCP technique is proven to be a good fit for uncertainty aware surrogate models, matching the output results as a regular neural network, providing prediction's confidence as uncertainties, and highlighting the out of distribution (OOD) regions using surrogate model uncertainties. This provides critical insights into model robustness and reliability. EuroPED-NN has been physically validated, first, analyzing electron density $n_e\!\left(\psi_{\text{pol}}=0.94\right)$ with respect to increasing plasma current, $I_p$, and second, validating the $\Delta-\beta_{p,ped}$ relation associated with the EuroPED model. Affirming the robustness of the underlying physics learned by the surrogate model.

europed-nn, prediction, surrogate model, (15 more...)

arXiv.org Artificial Intelligence

2402.0076

Country:

Europe > United Kingdom (0.14)
Europe > Finland (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Energy (0.46)
Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

TCL 6-Series 2022 Model R655 Review: The Best Value TV Right Now

WIREDJan-23-2023, 12:00:00 GMT

There's nothing like something you can easily unbox, plug in, and turn on without cracking a manual or downloading a PDF. So why am I still so obsessed with TCL's 6-Series? In the past I've said that the 6-Series was the best TV for most people based largely on how well the screen looks for your dollars. It wasn't for TCL's looks or sleek interfaces and apps--there were, and are, plenty of other mid-tier options from Vizio, Hisense, and others that get that job done right. But the latest 6-Series now wins out for sheer physical simplicity. It comes with a center pedestal stand, and you barely have to touch a settings menu.

artificial intelligence, interface, soundbar, (7 more...)

WIRED

Industry: Leisure & Entertainment > Games > Computer Games (0.31)

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

22 Best Cyber Monday Soundbar and TV Deals (2022): Samsung, Vizio, LG, and More

WIREDNov-28-2022, 12:45:00 GMT

It's a great time to upgrade your home theater thanks to some excellent Cyber Monday TV and soundbar deals. If you've yet to take the plunge to a modern 4K TV, or you are still listening to your favorite shows and movies through those tinny built-in TV speakers, there are massive reasons to upgrade. Modern home theater technology now has better backlighting, sharper resolution, and immersive surround sound for less money required than ever before. Go on, convert your living room into a mini cinema. Updated Monday, November 28: We've added two new TV deals on sets from LG and Sony and moved a group of dead TV deals to the bottom of the article, just in case you want to check if they're back in stock. We've also updated prices and retailers throughout.

monday soundbar and tv deal, soundbar, wired recommend, (13 more...)

WIRED

Country: Europe > Monaco (0.04)

Industry:

Appliances & Durable Goods (1.00)
Semiconductors & Electronics (0.85)
Retail > Online (0.61)

Technology: Information Technology > Artificial Intelligence (0.71)

Add feedback

Reusable neural skill embeddings for vision-guided whole body movement and object manipulation

Merel, Josh, Tunyasuvunakool, Saran, Ahuja, Arun, Tassa, Yuval, Hasenclever, Leonard, Pham, Vu, Erez, Tom, Wayne, Greg, Heess, Nicolas

arXiv.org Artificial IntelligenceNov-15-2019

Both in simulation settings and robotics, there is an ambition to produce flexible control systems that can enable complex bodies to perform dynamic locomotion and natural object manipulation. In previous work, we developed a framework to train locomotor skills and reuse these skills for whole-body visuomotor tasks. Here, we extend this line of work to tasks involving whole body movement as well as visually guided manipulation of objects. This setting poses novel challenges in terms of task specification, exploration, and generalization. We develop an integrated approach consisting of a flexible motor primitive module, demonstrations, an instructed training regime as well as curricula in the form of task variations. We demonstrate the utility of our approach for solving challenging whole body tasks that require joint locomotion and manipulation, and characterize its behavioral robustness. We also provide a high-level overview video, see https://youtu.be/t0RDGSnE3cM .

demonstration, pedestal, warehouse task, (15 more...)

arXiv.org Artificial Intelligence

1911.06636

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback