local discriminator
- North America > United States > California (0.04)
- North America > Canada > Quebec > Montreal (0.04)
Domain-Adaptive Diagnosis of Lewy Body Disease with Transferability Aware Transformer
Yu, Xiaowei, Zhang, Jing, Chen, Tong, Zhuang, Yan, Chen, Minheng, Cao, Chao, Lyu, Yanjun, Zhang, Lu, Su, Li, Liu, Tianming, Zhu, Dajiang
Lewy Body Disease (LBD) is a common yet understudied form of dementia that imposes a significant burden on public health. It shares clinical similarities with Alzheimer's disease (AD), as both progress through stages of normal cognition, mild cognitive impairment, and dementia. A major obstacle in LBD diagnosis is data scarcity, which limits the effectiveness of deep learning. In contrast, AD datasets are more abundant, offering potential for knowledge transfer. However, LBD and AD data are typically collected from different sites using different machines and protocols, resulting in a distinct domain shift. To effectively leverage AD data while mitigating domain shift, we propose a Transferability Aware Transformer (TAT) that adapts knowledge from AD to enhance LBD diagnosis. Our method utilizes structural connectivity (SC) derived from structural MRI as training data. Built on the attention mechanism, TAT adaptively assigns greater weights to disease-transferable features while suppressing domain-specific ones, thereby reducing domain shift and improving diagnostic accuracy with limited LBD data. The experimental results demonstrate the effectiveness of TAT. To the best of our knowledge, this is the first study to explore domain adaptation from AD to LBD under conditions of data scarcity and domain shift, providing a promising framework for domain-adaptive diagnosis of rare diseases.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > Texas (0.04)
- North America > United States > Indiana > Marion County > Indianapolis (0.04)
- Europe > United Kingdom > England > South Yorkshire > Sheffield (0.04)
- Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (0.72)
- Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (0.62)
- Health & Medicine > Therapeutic Area > Neurology > Dementia (0.55)
Reviews: Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language
After rebuttal comments: * readability: I trust the authors to update the paper based on my suggestions (as they agreed to in their rebuttal). For AttrGAN, they did change the weight sweep and for SISGAN they used the same hyperparameters as they used in their method (which I would object to in general, but given that the authors took most of their hyperparameters from DCGAN, does not create an unfair advantage). I expect the additional details of the experimental results to be added in the paper (as supplementary material). Ensure that content that is not relevant to the text does not change. Method: to avoid changing too much of the image, use local discriminators that learn the presence of individual visual attributes.
An Efficient Illumination Invariant Tiger Detection Framework for Wildlife Surveillance
Pendharkar, Gaurav, Micheal, A. Ancy, Misquitta, Jason, Kaippada, Ranjeesh
With the advent of artificial intelligence, tiger surveillance can be automated using object detection. In this paper, an accurate illumination invariant framework is proposed based on EnlightenGAN and YOLOv8 for tiger detection. The fine-tuned YOLOv8 model achieves a mAP score of 61% without illumination enhancement. The illumination enhancement improves the mAP by 0.7%.
GAS-NeXt: Few-Shot Cross-Lingual Font Generator
He, Haoyang, Jin, Xin, Chen, Angela
Generating new fonts is a time-consuming and labor-intensive task, especially in a language with a huge amount of characters like Chinese. Various deep learning models have demonstrated the ability to efficiently generate new fonts with a few reference characters of that style, but few models support cross-lingual font generation. This paper presents GAS-NeXt, a novel few-shot cross-lingual font generator based on AGIS-Net and Font Translator GAN, and improve the performance metrics such as Fr\'echet Inception Distance (FID), Structural Similarity Index Measure(SSIM), and Pixel-level Accuracy (pix-acc). Our approaches include replacing the original encoder and decoder with the idea of layer attention and context-aware attention from Font Translator GAN, while utilizing the shape, texture, and local discriminators of AGIS-Net. In our experiment on English-to-Chinese font translation, we observed better results in fonts with distinct local features than conventional Chinese fonts compared to results obtained from Font Translator GAN. We also validate our method on multiple languages and datasets.
Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks
Rajesh, Bulla, Agrawal, Manav Kamlesh, Bhuva, Milan, Kishore, Kisalaya, Javed, Mohammed
Image binarization techniques are being popularly used in enhancement of noisy and/or degraded images catering different Document Image Anlaysis (DIA) applications like word spotting, document retrieval, and OCR. Most of the existing techniques focus on feeding pixel images into the Convolution Neural Networks to accomplish document binarization, which may not produce effective results when working with compressed images that need to be processed without full decompression. Therefore in this research paper, the idea of document image binarization directly using JPEG compressed stream of document images is proposed by employing Dual Discriminator Generative Adversarial Networks (DD-GANs). Here the two discriminator networks - Global and Local work on different image ratios and use focal loss as generator loss. The proposed model has been thoroughly tested with different versions of DIBCO dataset having challenges like holes, erased or smudged ink, dust, and misplaced fibres. The model proved to be highly robust, efficient both in terms of time and space complexities, and also resulted in state-of-the-art performance in JPEG compressed domain.
Training Federated GANs with Theoretical Guarantees: A Universal Aggregation Approach
Zhang, Yikai, Qu, Hui, Chang, Qi, Liu, Huidong, Metaxas, Dimitris, Chen, Chao
Recently, Generative Adversarial Networks (GANs) have demonstrated their potential in federated learning, i.e., learning a centralized model from data privately hosted by multiple sites. A federatedGAN jointly trains a centralized generator and multiple private discriminators hosted at different sites. A major theoretical challenge for the federated GAN is the heterogeneity of the local data distributions. Traditional approaches cannot guarantee to learn the target distribution, which isa mixture of the highly different local distributions. This paper tackles this theoretical challenge, and for the first time, provides a provably correct framework for federated GAN. We propose a new approach called Universal Aggregation, which simulates a centralized discriminator via carefully aggregating the mixture of all private discriminators. We prove that a generator trained with this simulated centralized discriminator can learn the desired target distribution. Through synthetic and real datasets, we show that our method can learn the mixture of largely different distributions where existing federated GAN methods fail.
- North America > United States > New York > Suffolk County > Stony Brook (0.04)
- North America > United States > Virginia (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Health & Medicine > Health Care Technology (1.00)
- Education (0.68)
- Information Technology > Security & Privacy (0.68)
- Law > Statutes (0.67)
Landmark Assisted CycleGAN: Draw Me Like One of Your Cartoon Girls
In an iconic scene from the 1997 film "Titanic," Kate Winslet's oceangoing character Rose asks charming artist Jack Dawson (Leonardo DiCaprio) to "draw me like one of your French girls" -- that is, reclining nude on a chaise lounge. A flustered Jack obliges and this kindles a romance, but -- spoiler alert -- the ship hits an iceberg and Jack perishes protecting Rose from the icy North Atlantic waters. On a more robust vessel who knows what additional portrait styles the young lovebirds might have explored. For example, with the help of a new AI algorithm, Jack could have drawn Rose as a cute cartoon character. A group of researchers from the Chinese University of Hong Kong, Harbin Institute of Technology and Tencent have proposed a method to create such cartoon faces from photos of human faces via a novel CycleGAN model informed by facial landmarks.
LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup
Gu, Qiao, Wang, Guanzhi, Chiu, Mang Tik, Tai, Yu-Wing, Tang, Chi-Keung
We propose a local adversarial disentangling network (LADN) for facial makeup and de-makeup. Central to our method are multiple and overlapping local adversarial discriminators in a content-style disentangling network for achieving local detail transfer between facial images, with the use of asymmetric loss functions for dramatic makeup styles with high-frequency details. Existing techniques do not demonstrate or fail to transfer high-frequency details in a global adversarial setting, or train a single local discriminator only to ensure image structure consistency and thus work only for relatively simple styles. Unlike others, our proposed local adversarial discriminators can distinguish whether the generated local image details are consistent with the corresponding regions in the given reference image in cross-image style transfer in an unsupervised setting. Incorporating these technical contributions, we achieve not only state-of-the-art results on conventional styles but also novel results involving complex and dramatic styles with high-frequency details covering large areas across multiple facial features. A carefully designed dataset of unpaired before and after makeup images will be released.