AITopics | Birch, Alex

Collaborating Authors

Birch, Alex

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Improvements to SDXL in NovelAI Diffusion V3

Ossa, Juan, Doğan, Eren, Birch, Alex, Johnson, F.

arXiv.org Artificial IntelligenceSep-26-2024

This technical report is structured as follows. In Section 2, we describe our enhancements in detail. Following that, we evaluate our contributions in Section 5. Finally, we draw conclusions in Section 6.

artificial intelligence, diffusion model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2409.15997

Genre: Research Report (0.53)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Sensing and Signal Processing > Image Processing (0.70)

Add feedback

Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

Crowson, Katherine, Baumann, Stefan Andreas, Birch, Alex, Abraham, Tanishq Mathew, Kaplan, Daniel Z., Shippole, Enrico

arXiv.org Artificial IntelligenceJan-21-2024

We present the Hourglass Diffusion Transformer (HDiT), an image generative model that exhibits linear scaling with pixel count, supporting training at high-resolution (e.g. $1024 \times 1024$) directly in pixel-space. Building on the Transformer architecture, which is known to scale to billions of parameters, it bridges the gap between the efficiency of convolutional U-Nets and the scalability of Transformers. HDiT trains successfully without typical high-resolution training techniques such as multiscale architectures, latent autoencoders or self-conditioning. We demonstrate that HDiT performs competitively with existing models on ImageNet $256^2$, and sets a new state-of-the-art for diffusion models on FFHQ-$1024^2$.

large language model, machine learning, resolution, (17 more...)

arXiv.org Artificial Intelligence

2401.11605

Country: Europe (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Add feedback