AITopics | robot photographer

Collaborating Authors

robot photographer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

PhotoBot: Reference-Guided Interactive Photography via Natural Language

Limoyo, Oliver, Li, Jimmy, Rivkin, Dmitriy, Kelly, Jonathan, Dudek, Gregory

arXiv.org Artificial IntelligenceJan-19-2024

We introduce PhotoBot, a framework for automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. We propose to communicate photography suggestions to the user via a reference picture that is retrieved from a curated gallery. We exploit a visual language model (VLM) and an object detector to characterize reference pictures via textual descriptions and use a large language model (LLM) to retrieve relevant reference pictures based on a user's language query through text-based reasoning. To correspond the reference picture and the observed scene, we exploit pre-trained features from a vision transformer capable of capturing semantic similarity across significantly varying images. Using these features, we compute pose adjustments for an RGB-D camera by solving a Perspective-n-Point (PnP) problem. We demonstrate our approach on a real-world manipulator equipped with a wrist camera. Our user studies show that photos taken by PhotoBot are often more aesthetically pleasing than those taken by users themselves, as measured by human feedback.

correspondence, photobot, reference picture, (17 more...)

arXiv.org Artificial Intelligence

2401.11061

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report (0.82)
Questionnaire & Opinion Survey (0.69)

Industry: Media > Photography (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

ANSEL Photobot: A Robot Event Photographer with Semantic Intelligence

Rivkin, Dmitriy, Dudek, Gregory, Kakodkar, Nikhil, Meger, David, Limoyo, Oliver, Liu, Xue, Hogan, Francois

arXiv.org Artificial IntelligenceFeb-15-2023

Our work examines the way in which large language models can be used for robotic planning and sampling, specifically the context of automated photographic documentation. Specifically, we illustrate how to produce a photo-taking robot with an exceptional level of semantic awareness by leveraging recent advances in general purpose language (LM) and vision-language (VLM) models. Given a high-level description of an event we use an LM to generate a natural-language list of photo descriptions that one would expect a photographer to capture at the event. We then use a VLM to identify the best matches to these descriptions in the robot's video stream. The photo portfolios generated by our method are consistently rated as more appropriate to the event by human evaluators than those generated by existing methods.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2302.07931

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Media > Photography (1.00)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)

Add feedback

Shutter, the Robot Photographer: Leveraging Behavior Trees for Public, In-the-Wild Human-Robot Interactions

Lew, Alexander, Thompson, Sydney, Tsoi, Nathan, Vázquez, Marynel

arXiv.org Artificial IntelligenceJan-31-2023

Deploying interactive systems in-the-wild requires adaptability to situations not encountered in lab environments. Our work details our experience about the impact of architecture choice on behavior reusability and reactivity while deploying a public interactive system. In particular, we introduce Shutter, a robot photographer and a platform for public interaction. In designing Shutter's architecture, we focused on adaptability for in-the-wild deployment, while developing a reusable platform to facilitate future research in public human-robot interaction. We find that behavior trees allow reactivity, especially in group settings, and encourage designing reusable behaviors.

artificial intelligence, interaction, shutter, (16 more...)

arXiv.org Artificial Intelligence

2302.00191

Country: North America > United States > Connecticut > New Haven County > New Haven (0.05)

Genre: Research Report (0.50)

Industry: Media > Photography (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)
Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.62)

Add feedback

Pixy drone hands-on: A flying robot photographer for Snapchat users

EngadgetMay-25-2022, 13:00:50 GMT

Drones are everywhere these days, filming dramatic reveals and awe-inspiring scenery for social media platforms. The problem is, they're not exactly approachable for beginners who have only ever used a smartphone. Last month, Snap debuted the $230 Pixy drone exactly for those people. It requires very little skill and acts like a personal robot photographer to help you produce nifty aerial shots. You don't need to pilot the Pixy.

photographer, pixy, robot photographer, (13 more...)

Engadget

Industry: Media > Photography (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.50)

Add feedback

1673

AI MagazineJan-4-2018, 14:49:41 GMT

We have developed an autonomous robot system that takes well-composed photographs of people at social events, such as weddings and conference receptions. In this article, we outline the overall architecture of the system and describe how the various components interrelate. We also describe our experiences deploying the robot photographer at a number of real-world events. The system is capable of operating in unaltered environments and has been deployed at a number of real-world events. This article gives an overview of the entire robot photographer system, and provides details of the architecture underlying the implementation.

artificial intelligence, robot, system, (17 more...)

AI Magazine

Industry:

Media > Photography (1.00)
Information Technology (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Say Cheese! Experiences with a Robot Photographer

Byers, Zachary, Dixon, Michael, Smart, William D., Grimm, Cindy M.

AI MagazineSep-15-2004

artificial intelligence, management and information, robot photographer, (3 more...)

AI Magazine

Industry:

Media > Photography (1.00)
Information Technology > Robotics & Automation (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Say Cheese! Experiences with a Robot Photographer

Byers, Zachary, Dixon, Michael, Smart, William D., Grimm, Cindy M.

AI MagazineSep-15-2004

This model makes system debugging significantly easier, because we know We introduced a sensor abstraction layer to exactly what each sensor reading is at every separate the task layer from concerns about point in the computation; something that physical sensing devices. We process the sensor would not be the case if we were reading from information (from the laser rangefinder in this the sensors every time a reading was used in a application) into distance measurements from calculation. This model also allows us to inject the center of the robot, thus allowing consideration modified sensor readings into the system, as of sensor error models and performance described in the next section.

module, photograph, robot, (15 more...)

AI Magazine

Country:

North America > United States > Texas > Bexar County > San Antonio (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
(2 more...)

Industry: Media > Photography (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.47)

Add feedback