AITopics | salle

Collaborating Authors

salle

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

'I sent AI to art school!' The postmodern master who taught a machine to beef up his old work

The GuardianApr-15-2025, 15:51:27 GMT

By the time you read this article, there's a good chance it will have already been scanned by an artificially intelligent machine. If asked about the artist David Salle, large language models such as ChatGPT or Gemini may repurpose some of the words below to come up with their answer. The bigger the data set, the more convincing the response – and Salle has been written about exhaustively since he first rose to art world stardom in the 1980s. The question is whether AI can ever say anything new about the artist and his work, or if it's for ever condemned to generate more of the same. A similar question lingers beneath the surface of the paintings that Salle has been making since 2023, a new series of which he has just unveiled at Thaddaeus Ropac in London.

art school, pastoral, salle, (15 more...)

The Guardian

Country:

North America > United States > Oklahoma (0.05)
North America > United States > New York (0.05)
North America > United States > Kansas > Sedgwick County > Wichita (0.05)

Industry: Education > Curriculum > Subject-Specific Education (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models

Ji, Shengpeng, Zuo, Jialong, Fang, Minghui, Jiang, Ziyue, Chen, Feiyang, Duan, Xinyu, Huai, Baoxing, Zhao, Zhou

arXiv.org Artificial IntelligenceAug-28-2023

Recently, there has been a growing interest in the field of controllable Text-to-Speech (TTS). While previous studies have relied on users providing specific style factor values based on acoustic knowledge or selecting reference speeches that meet certain requirements, generating speech solely from natural text prompts has emerged as a new challenge for researchers. This challenge arises due to the scarcity of high-quality speech datasets with natural text style prompt and the absence of advanced text-controllable TTS models. In light of this, 1) we propose TextrolSpeech, which is the first large-scale speech emotion dataset annotated with rich text attributes. The dataset comprises 236,220 pairs of style prompt in natural text descriptions with five style factors and corresponding speech samples. Through iterative experimentation, we introduce a multi-stage prompt programming approach that effectively utilizes the GPT model for generating natural style descriptions in large volumes. 2) Furthermore, to address the need for generating audio with greater style diversity, we propose an efficient architecture called Salle. This architecture treats text controllable TTS as a language model task, utilizing audio codec codes as an intermediate representation to replace the conventional mel-spectrogram. Finally, we successfully demonstrate the ability of the proposed model by showing a comparable performance in the controllable TTS task. Audio samples are available at https://sall-e.github.io/

dataset, style prompt, textrolspeech, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICASSP48485.2024.10445879

2308.1443

Country: North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.88)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback