AITopics | local discriminator

Collaborating Authors

local discriminator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Seonghyeon Nam, Yunji Kim, Seon Joo Kim

Neural Information Processing SystemsMar-16-2026, 09:42:13 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: North America (0.28)

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Seonghyeon Nam, Yunji Kim, Seon Joo Kim

Neural Information Processing SystemsFeb-14-2026, 19:00:30 GMT

However,most existing studies concentrate onthetext-to-image synthesis [11-14], which generates images from text descriptions without the original image.

artificial intelligence, arxivpreprintarxiv, discriminator, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.50)
Information Technology > Sensing and Signal Processing > Image Processing (0.47)

Add feedback

Domain-Adaptive Diagnosis of Lewy Body Disease with Transferability Aware Transformer

Yu, Xiaowei, Zhang, Jing, Chen, Tong, Zhuang, Yan, Chen, Minheng, Cao, Chao, Lyu, Yanjun, Zhang, Lu, Su, Li, Liu, Tianming, Zhu, Dajiang

arXiv.org Artificial IntelligenceJul-15-2025

Lewy Body Disease (LBD) is a common yet understudied form of dementia that imposes a significant burden on public health. It shares clinical similarities with Alzheimer's disease (AD), as both progress through stages of normal cognition, mild cognitive impairment, and dementia. A major obstacle in LBD diagnosis is data scarcity, which limits the effectiveness of deep learning. In contrast, AD datasets are more abundant, offering potential for knowledge transfer. However, LBD and AD data are typically collected from different sites using different machines and protocols, resulting in a distinct domain shift. To effectively leverage AD data while mitigating domain shift, we propose a Transferability Aware Transformer (TAT) that adapts knowledge from AD to enhance LBD diagnosis. Our method utilizes structural connectivity (SC) derived from structural MRI as training data. Built on the attention mechanism, TAT adaptively assigns greater weights to disease-transferable features while suppressing domain-specific ones, thereby reducing domain shift and improving diagnostic accuracy with limited LBD data. The experimental results demonstrate the effectiveness of TAT. To the best of our knowledge, this is the first study to explore domain adaptation from AD to LBD under conditions of data scarcity and domain shift, providing a promising framework for domain-adaptive diagnosis of rare diseases.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2507.08839

Country:

North America > United States (0.68)
Europe > United Kingdom > England (0.28)

Genre: Research Report (0.84)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.72)
Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (0.62)
Health & Medicine > Therapeutic Area > Neurology > Dementia (0.55)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Reviews: Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

Neural Information Processing SystemsOct-8-2024, 05:41:18 GMT

After rebuttal comments: * readability: I trust the authors to update the paper based on my suggestions (as they agreed to in their rebuttal). For AttrGAN, they did change the weight sweep and for SISGAN they used the same hyperparameters as they used in their method (which I would object to in general, but given that the authors took most of their hyperparameters from DCGAN, does not create an unfair advantage). I expect the additional details of the experimental results to be added in the paper (as supplementary material). Ensure that content that is not relevant to the text does not change. Method: to avoid changing too much of the image, use local discriminators that learn the presence of individual visual attributes.

artificial intelligence, discriminator, machine learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

An Efficient Illumination Invariant Tiger Detection Framework for Wildlife Surveillance

Pendharkar, Gaurav, Micheal, A. Ancy, Misquitta, Jason, Kaippada, Ranjeesh

arXiv.org Artificial IntelligenceJan-5-2024

With the advent of artificial intelligence, tiger surveillance can be automated using object detection. In this paper, an accurate illumination invariant framework is proposed based on EnlightenGAN and YOLOv8 for tiger detection. The fine-tuned YOLOv8 model achieves a mAP score of 61% without illumination enhancement. The illumination enhancement improves the mAP by 0.7%.

detection, surveillance, tiger detection, (10 more...)

arXiv.org Artificial Intelligence

2311.17552

Country:

Asia > Singapore (0.05)
Asia > Thailand (0.04)
Asia > Indonesia (0.04)
Asia > India > Tamil Nadu > Chennai (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

GAS-NeXt: Few-Shot Cross-Lingual Font Generator

He, Haoyang, Jin, Xin, Chen, Angela

arXiv.org Artificial IntelligenceDec-15-2022

Generating new fonts is a time-consuming and labor-intensive task, especially in a language with a huge amount of characters like Chinese. Various deep learning models have demonstrated the ability to efficiently generate new fonts with a few reference characters of that style, but few models support cross-lingual font generation. This paper presents GAS-NeXt, a novel few-shot cross-lingual font generator based on AGIS-Net and Font Translator GAN, and improve the performance metrics such as Fr\'echet Inception Distance (FID), Structural Similarity Index Measure(SSIM), and Pixel-level Accuracy (pix-acc). Our approaches include replacing the original encoder and decoder with the idea of layer attention and context-aware attention from Font Translator GAN, while utilizing the shape, texture, and local discriminators of AGIS-Net. In our experiment on English-to-Chinese font translation, we observed better results in fonts with distinct local features than conventional Chinese fonts compared to results obtained from Font Translator GAN. We also validate our method on multiple languages and datasets.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.02886

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.15)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.96)

Add feedback

Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Rajesh, Bulla, Agrawal, Manav Kamlesh, Bhuva, Milan, Kishore, Kisalaya, Javed, Mohammed

arXiv.org Artificial IntelligenceSep-13-2022

Image binarization techniques are being popularly used in enhancement of noisy and/or degraded images catering different Document Image Anlaysis (DIA) applications like word spotting, document retrieval, and OCR. Most of the existing techniques focus on feeding pixel images into the Convolution Neural Networks to accomplish document binarization, which may not produce effective results when working with compressed images that need to be processed without full decompression. Therefore in this research paper, the idea of document image binarization directly using JPEG compressed stream of document images is proposed by employing Dual Discriminator Generative Adversarial Networks (DD-GANs). Here the two discriminator networks - Global and Local work on different image ratios and use focal loss as generator loss. The proposed model has been thoroughly tested with different versions of DIBCO dataset having challenges like holes, erased or smudged ink, dust, and misplaced fibres. The model proved to be highly robust, efficient both in terms of time and space complexities, and also resulted in state-of-the-art performance in JPEG compressed domain.

artificial intelligence, deep learning, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2209.05921

Country:

Asia > India (0.04)
Africa > Sudan (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Training Federated GANs with Theoretical Guarantees: A Universal Aggregation Approach

Zhang, Yikai, Qu, Hui, Chang, Qi, Liu, Huidong, Metaxas, Dimitris, Chen, Chao

arXiv.org Artificial IntelligenceFeb-9-2021

Recently, Generative Adversarial Networks (GANs) have demonstrated their potential in federated learning, i.e., learning a centralized model from data privately hosted by multiple sites. A federatedGAN jointly trains a centralized generator and multiple private discriminators hosted at different sites. A major theoretical challenge for the federated GAN is the heterogeneity of the local data distributions. Traditional approaches cannot guarantee to learn the target distribution, which isa mixture of the highly different local distributions. This paper tackles this theoretical challenge, and for the first time, provides a provably correct framework for federated GAN. We propose a new approach called Universal Aggregation, which simulates a centralized discriminator via carefully aggregating the mixture of all private discriminators. We prove that a generator trained with this simulated centralized discriminator can learn the desired target distribution. Through synthetic and real datasets, we show that our method can learn the mixture of largely different distributions where existing federated GAN methods fail.

dataset, discriminator, local discriminator, (14 more...)

arXiv.org Artificial Intelligence

2102.04655

Country:

North America > United States > New York > Suffolk County > Stony Brook (0.04)
North America > United States > Virginia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Health Care Technology (1.00)
Education (0.68)
Information Technology > Security & Privacy (0.68)
Law > Statutes (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.34)

Add feedback

Landmark Assisted CycleGAN: Draw Me Like One of Your Cartoon Girls

#artificialintelligenceFeb-8-2020, 13:40:28 GMT

In an iconic scene from the 1997 film "Titanic," Kate Winslet's oceangoing character Rose asks charming artist Jack Dawson (Leonardo DiCaprio) to "draw me like one of your French girls" -- that is, reclining nude on a chaise lounge. A flustered Jack obliges and this kindles a romance, but -- spoiler alert -- the ship hits an iceberg and Jack perishes protecting Rose from the icy North Atlantic waters. On a more robust vessel who knows what additional portrait styles the young lovebirds might have explored. For example, with the help of a new AI algorithm, Jack could have drawn Rose as a cute cartoon character. A group of researchers from the Chinese University of Hong Kong, Harbin Institute of Technology and Tencent have proposed a method to create such cartoon faces from photos of human faces via a novel CycleGAN model informed by facial landmarks.

assisted cyclegan, cyclegan, landmark assisted cyclegan, (10 more...)

#artificialintelligence

Country:

Asia > China > Hong Kong (0.26)
Asia > China > Heilongjiang Province > Harbin (0.26)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.73)

Add feedback

LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup

Gu, Qiao, Wang, Guanzhi, Chiu, Mang Tik, Tai, Yu-Wing, Tang, Chi-Keung

arXiv.org Artificial IntelligenceApr-25-2019

We propose a local adversarial disentangling network (LADN) for facial makeup and de-makeup. Central to our method are multiple and overlapping local adversarial discriminators in a content-style disentangling network for achieving local detail transfer between facial images, with the use of asymmetric loss functions for dramatic makeup styles with high-frequency details. Existing techniques do not demonstrate or fail to transfer high-frequency details in a global adversarial setting, or train a single local discriminator only to ensure image structure consistency and thus work only for relatively simple styles. Unlike others, our proposed local adversarial discriminators can distinguish whether the generated local image details are consistent with the corresponding regions in the given reference image in cross-image style transfer in an unsupervised setting. Incorporating these technical contributions, we achieve not only state-of-the-art results on conventional styles but also novel results involving complex and dramatic styles with high-frequency details covering large areas across multiple facial features. A carefully designed dataset of unpaired before and after makeup images will be released.

artificial intelligence, discriminator, machine learning, (19 more...)

arXiv.org Artificial Intelligence

1904.11272

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.87)

Add feedback