AITopics | akira

Collaborating Authors

akira

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AKiRa: Augmentation Kit on Rays for optical video generation

Wang, Xi, Courant, Robin, Christie, Marc, Kalogeiton, Vicky

arXiv.org Artificial IntelligenceDec-29-2024

Recent advances in text-conditioned video diffusion have greatly improved video quality. However, these methods offer limited or sometimes no control to users on camera aspects, including dynamic camera motion, zoom, distorted lens and focus shifts. These motion and optical aspects are crucial for adding controllability and cinematic elements to generation frameworks, ultimately resulting in visual content that draws focus, enhances mood, and guides emotions according to filmmakers' controls. In this paper, we aim to close the gap between controllable video generation and camera optics. To achieve this, we propose AKiRa (Augmentation Kit on Rays), a novel augmentation framework that builds and trains a camera adapter with a complex camera model over an existing video generation backbone. It enables fine-tuned control over camera motion as well as complex optical parameters (focal length, distortion, aperture) to achieve cinematic effects such as zoom, fisheye effect, and bokeh. Extensive experiments demonstrate AKiRa's effectiveness in combining and composing camera optics while outperforming all state-of-the-art methods. This work sets a new landmark in controlled and optically enhanced video generation, paving the way for future optical video generation methods.

akira, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.14158

Genre: Research Report (1.00)

Industry:

Media > Photography (1.00)
Media > Film (1.00)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

Akira's Machine Learning News -- Issue #38

#artificialintelligenceDec-24-2021, 01:05:50 GMT

In the following sections, I will introduce various articles and papers not only on the above contents but also on the following five topics. Since molecules have different interatomic distances depending on the nature of the target atoms, they proposed multi-scale Self-Attention, which adjusts the application of Attention according to the distance, and AFPS, which downsamples according to the Attention score. It showed good performance on quantum chemical molecular data sets.

artificial intelligence, machine learning news, model applicable, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Akira's Machine Learning news -- #26

#artificialintelligenceSep-1-2021, 05:46:31 GMT

In the following sections, I will introduce various articles and papers not only on the above contents but also on the following five topics. MERLOT: Multimodal Neural Script Knowledge Models Using as much as 6 million video data and accompanying subtitles, MERIOT is proposed to perform self-supervised learning on both temporal and spatial tasks. It does not use any label information but can achieve SotA performance. Also, the accuracy of the pre-training continues to increase even with 6 million data, which is considered a promising research direction for the future.

akira, machine learning news

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Akira's Machine Learning news -- #21

#artificialintelligenceJul-9-2021, 07:10:30 GMT

In the following sections, I will introduce various articles and papers not only on the above contents but also on the following five topics. Winning tickets in pre-training are transferable -- arxiv.org The results show that Winning Ticket is present regardless of whether pre-training is supervised or unsupervised.

akira, hypothesis, machine learning news, (1 more...)

#artificialintelligence

Genre:

Contests & Prizes (1.00)
Research Report (0.74)

Industry: Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Classifying old Japanese characters using CNN

#artificialintelligenceJul-2-2017, 06:55:10 GMT

Jiro's pick this week is CNN for Old Japanese Character Classification by one of my colleagues Akira Agata. Nowadays, I probably go many days without seeing a handwritten document. From computers, to smartphones, to TVs, to books, almost every character I see is a printed character. So it's refreshing to see a handwritten document from time to time. This demo by Akira uses deep learning (convolutional neural networks) to classify various handwritten Japanese characters.

artificial intelligence, japanese character, machine learning, (8 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback