AITopics | Delp, Edward J.

Collaborating Authors

Delp, Edward J.

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

FairSSD: Understanding Bias in Synthetic Speech Detectors

Yadav, Amit Kumar Singh, Bhagtani, Kratika, Salvi, Davide, Bestagini, Paolo, Delp, Edward J.

arXiv.org Artificial IntelligenceApr-16-2024

Methods that can generate synthetic speech which is perceptually indistinguishable from speech recorded by a human speaker, are easily available. Several incidents report misuse of synthetic speech generated from these methods to commit fraud. To counter such misuse, many methods have been proposed to detect synthetic speech. Some of these detectors are more interpretable, can generalize to detect synthetic speech in the wild and are robust to noise. However, limited work has been done on understanding bias in these detectors. In this work, we examine bias in existing synthetic speech detectors to determine if they will unfairly target a particular gender, age and accent group. We also inspect whether these detectors will have a higher misclassification rate for bona fide speech from speech-impaired speakers w.r.t fluent speakers. Extensive experiments on 6 existing synthetic speech detectors using more than 0.9 million speech signals demonstrate that most detectors are gender, age and accent biased, and future work is needed to ensure fairness. To support future research, we release our evaluation dataset, models used in our study and source code at https://gitlab.com/viper-purdue/fairssd.

artificial intelligence, detector, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2404.10989

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Indiana > Tippecanoe County (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (0.88)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.93)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Speech (1.00)
(2 more...)

Add feedback

Compression Robust Synthetic Speech Detection Using Patched Spectrogram Transformer

Yadav, Amit Kumar Singh, Xiang, Ziyue, Bhagtani, Kratika, Bestagini, Paolo, Tubaro, Stefano, Delp, Edward J.

arXiv.org Artificial IntelligenceFeb-21-2024

Many deep learning synthetic speech generation tools are readily available. The use of synthetic speech has caused financial fraud, impersonation of people, and misinformation to spread. For this reason forensic methods that can detect synthetic speech have been proposed. Existing methods often overfit on one dataset and their performance reduces substantially in practical scenarios such as detecting synthetic speech shared on social platforms. In this paper we propose, Patched Spectrogram Synthetic Speech Detection Transformer (PS3DT), a synthetic speech detector that converts a time domain speech signal to a mel-spectrogram and processes it in patches using a transformer neural network. We evaluate the detection performance of PS3DT on ASVspoof2019 dataset. Our experiments show that PS3DT performs well on ASVspoof2019 dataset compared to other approaches using spectrogram for synthetic speech detection. We also investigate generalization performance of PS3DT on In-the-Wild dataset. PS3DT generalizes well than several existing methods on detecting synthetic speech from an out-of-distribution dataset. We also evaluate robustness of PS3DT to detect telephone quality synthetic speech and synthetic speech shared on social platforms (compressed speech). PS3DT is robust to compression and can detect telephone quality synthetic speech better than several existing methods.

artificial intelligence, machine learning, speech signal, (15 more...)

arXiv.org Artificial Intelligence

2402.14205

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Media > News (1.00)
Information Technology (1.00)
Government (0.93)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Illumination Variation Correction Using Image Synthesis For Unsupervised Domain Adaptive Person Re-Identification

Guo, Jiaqi, Reibman, Amy R., Delp, Edward J.

arXiv.org Artificial IntelligenceNov-14-2023

Unsupervised domain adaptive (UDA) person re-identification (re-ID) aims to learn identity information from labeled images in source domains and apply it to unlabeled images in a target domain. One major issue with many unsupervised re-identification methods is that they do not perform well relative to large domain variations such as illumination, viewpoint, and occlusions. In this paper, we propose a Synthesis Model Bank (SMB) to deal with illumination variation in unsupervised person re-ID. The proposed SMB consists of several convolutional neural networks (CNN) for feature extraction and Mahalanobis matrices for distance metrics. They are trained using synthetic data with different illumination conditions such that their synergistic effect makes the SMB robust against illumination variation. To better quantify the illumination intensity and improve the quality of synthetic images, we introduce a new 3D virtual-human dataset for GAN-based image synthesis. From our experiments, the proposed SMB outperforms other synthesis methods on several re-ID benchmarks.

artificial intelligence, machine learning, target domain, (15 more...)

arXiv.org Artificial Intelligence

2301.09702

Country: North America > United States > Indiana > Tippecanoe County (0.14)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Games (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Diffusion Model with Clustering-based Conditioning for Food Image Generation

Han, Yue, He, Jiangpeng, Gupta, Mridul, Delp, Edward J., Zhu, Fengqing

arXiv.org Artificial IntelligenceAug-31-2023

Image-based dietary assessment serves as an efficient and accurate solution for recording and analyzing nutrition intake using eating occasion images as input. Deep learning-based techniques are commonly used to perform image analysis such as food classification, segmentation, and portion size estimation, which rely on large amounts of food images with annotations for training. However, such data dependency poses significant barriers to real-world applications, because acquiring a substantial, diverse, and balanced set of food images can be challenging. One potential solution is to use synthetic food images for data augmentation. Although existing work has explored the use of generative adversarial networks (GAN) based structures for generation, the quality of synthetic food images still remains subpar. In addition, while diffusion-based generative models have shown promising results for general image generation tasks, the generation of food images can be challenging due to the substantial intra-class variance. In this paper, we investigate the generation of synthetic food images based on the conditional diffusion model and propose an effective clustering-based training framework, named ClusDiff, for generating high-quality and representative food images. The proposed method is evaluated on the Food-101 dataset and shows improved performance when compared with existing image generation works. We also demonstrate that the synthetic food images generated by ClusDiff can help address the severe class imbalance issue in long-tailed food classification using the VFN-LT dataset.

artificial intelligence, clustering-based conditioning, machine learning, (3 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3607828.3617796

2309.00199

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

Forensic Analysis of Synthetically Generated Scientific Images

Mandelli, Sara, Cozzolino, Davide, Cardenuto, Joao P., Moreira, Daniel, Bestagini, Paolo, Scheirer, Walter, Rocha, Anderson, Verdoliva, Luisa, Tubaro, Stefano, Delp, Edward J.

arXiv.org Artificial IntelligenceDec-16-2021

The widespread diffusion of synthetically generated content is a serious threat that needs urgent countermeasures. The generation of synthetic content is not restricted to multimedia data like videos, photographs, or audio sequences, but covers a significantly vast area that can include biological images as well, such as western-blot and microscopic images. In this paper, we focus on the detection of synthetically generated western-blot images. Western-blot images are largely explored in the biomedical literature and it has been already shown how these images can be easily counterfeited with few hope to spot manipulations by visual inspection or by standard forensics detectors. To overcome the absence of a publicly available dataset, we create a new dataset comprising more than 14K original western-blot images and 18K synthetic western-blot images, generated by three different state-of-the-art generation methods. Then, we investigate different strategies to detect synthetic western blots, exploring binary classification methods as well as one-class detectors. In both scenarios, we never exploit synthetic western-blot images at training stage. The achieved results show that synthetically generated western-blot images can be spot with good accuracy, even though the exploited detectors are not optimized over synthetic versions of these scientific images.

artificial intelligence, detector, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2112.08739

Country:

Europe > Italy (0.68)
North America > United States > Indiana (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (0.66)
Information Technology > Security & Privacy (0.50)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Image-Based Plant Wilting Estimation

Yang, Changye, Baireddy, Sriram, Cai, Enyu, Meline, Valerian, Caldwell, Denise, Iyer-Pascuzzi, Anjali S., Delp, Edward J.

arXiv.org Artificial IntelligenceMay-26-2021

Many plants become limp or droop through heat, loss of water, or disease. This is also known as wilting. In this paper, we examine plant wilting caused by bacterial infection. In particular, we want to design a metric for wilting based on images acquired of the plant. A quantifiable wilting metric will be useful in studying bacterial wilt and identifying resistance genes. Since there is no standard way to estimate wilting, it is common to use ad hoc visual scores. This is very subjective and requires expert knowledge of the plants and the disease mechanism. Our solution consists of using various wilting metrics acquired from RGB images of the plants. We also designed several experiments to demonstrate that our metrics are effective at estimating wilting in plants.

artificial intelligence, health & medicine, msk, (17 more...)

arXiv.org Artificial Intelligence

2105.12926

Country: North America > United States > Indiana > Tippecanoe County (0.14)

Genre: Research Report > Experimental Study (0.47)

Industry:

Health & Medicine (1.00)
Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)

Add feedback

We Need No Pixels: Video Manipulation Detection Using Stream Descriptors

Güera, David, Baireddy, Sriram, Bestagini, Paolo, Tubaro, Stefano, Delp, Edward J.

arXiv.org Machine LearningJun-20-2019

Manipulating video content is easier than ever. Due to the misuse potential of manipulated content, multiple detection techniques that analyze the pixel data from the videos have been proposed. However, clever manipulators should also carefully forge the metadata and auxiliary header information, which is harder to do for videos than images. In this paper, we propose to identify forged videos by analyzing their multimedia stream descriptors with simple binary classifiers, completely avoiding the pixel space. Using well-known datasets, our results show that this scalable approach can achieve a high manipulation detection score if the manipulators have not done a careful data sanitization of the multimedia stream descriptors.

pixel, stream descriptor, video manipulation detection

arXiv.org Machine Learning

1906.08743

Genre: Research Report > New Finding (0.53)

Industry: Information Technology > Security & Privacy (0.40)

Technology:

Information Technology > Security & Privacy (0.69)
Information Technology > Artificial Intelligence > Vision (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback