Generative AI
Data-Free Model-Related Attacks: Unleashing the Potential of Generative AI
Ye, Dayong, Zhu, Tianqing, Wang, Shang, Liu, Bo, Zhang, Leo Yu, Zhou, Wanlei, Zhang, Yang
Generative AI technology has become increasingly integrated into our daily lives, offering powerful capabilities to enhance productivity. However, these same capabilities can be exploited by adversaries for malicious purposes. While existing research on adversarial applications of generative AI predominantly focuses on cyberattacks, less attention has been given to attacks targeting deep learning models. In this paper, we introduce the use of generative AI for facilitating model-related attacks, including model extraction, membership inference, and model inversion. Our study reveals that adversaries can launch a variety of model-related attacks against both image and text models in a data-free and black-box manner, achieving comparable performance to baseline methods that have access to the target models' training data and parameters in a white-box manner. This research serves as an important early warning to the community about the potential risks associated with generative AI-powered attacks on deep learning models.
The A.I. Will See You Now
Artificial intelligence is coming to a doctor's office near you--if it isn't already there, working in an administrative role. Are you ready for generative A.I. to help your doctor diagnose you? Is your doctor ready to listen--with the necessary mix of humility and skepticism? Subscribe to Slate Plus to access ad-free listening to the whole What Next family and all your favorite Slate podcasts. Subscribe today on Apple Podcasts by clicking "Try Free" at the top of our show page.
Review for NeurIPS paper: Sample-Efficient Optimization in the Latent Space of Deep Generative Models via Weighted Retraining
Weaknesses: Cons: In general I found the method section ok, however some important parts are missing and need to be addressed. "Fit objective model h" (pseudo algo line 6) What is h and how is it fitted. You mention a gaussian process for the Zinc dataset - why is that model appropriate and how well does it actually fit the true objective function? "suggest new latent z based on h" (pseudo algo line 6) How do you find new latent space samples? Some of this information can likely be found in the refs or in the appendix however this information (in my opinion) really needs to be explained and self-contained in the main paper It would strengthen the paper a lot if one more real world example were included in the experimental results (currently two toy tasks, one real world dataset).
Review for NeurIPS paper: Sample-Efficient Optimization in the Latent Space of Deep Generative Models via Weighted Retraining
This paper had 4 qualified reviewers, 3 of whom recommended acceptance and one who gave a 4 (updated from a 3 post-rebuttal). I think some of the complaints raised by the low review are technically correct, but I also don't feel that they are super-relevant to evaluating the scientific significance of this work. I think the numerical score they gave was too low given the text of their review). Given all of that, I am recommending acceptance for this paper.
Review for NeurIPS paper: VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data
This naturally brings up the question of whether careful tuning of the scaling coefficient for the likelihood function of each dimension could ease the aforementioned optimization difficulties. The "VAE-adaptive" baseline seems to be a data-dependent attempt at this, but I'm not convinced that a single minibatch is sufficient for computing the coefficients for each data type (as described in Appendix C.1.2). In particular, it'd be interesting to see if VAEM would outperform a (possibly hierarchical) VAE with more carefully tuned scaling factors for each dimension to rule out the possibility that the poor performance of vanilla VAE baselines is simply due to hyperparameter tuning.
Review for NeurIPS paper: VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data
The paper proposes modelling vectors with dimensions having different types (real-valued and categorical) using a two-stage VAE approach. First, a VAE with a 1D latent is trained once for each input dimension to standardize the data. Then a "dependency" VAE is trained on top of the resulting latents to capture the dependence between them. Pros: -The approach is interesting and novel -The idea is simple and seems effective, so might be widely adopted -The paper is well written -VAEM outperforms sensible baselines at generative modelling and a sequential information acquisition task Cons: -It is not explained why the two-stage training approach is a good idea. The fact that joint training tends to perform less well than two-stage training, as reported in the rebuttal, is an important observation that should be discussed and, ideally, explained in the paper.
Kernels of Selfhood: GPT-4o shows humanlike patterns of cognitive consistency moderated by free choice
Lehr, Steven A., Saichandran, Ketan S., Harmon-Jones, Eddie, Vitali, Nykko, Banaji, Mahzarin R.
Large Language Models (LLMs) have surprised the scientific community and even their creators by exhibiting emergent abilities once thought to be uniquely human, such as advanced cognition and reasoning (1-6), although the full extent of these accomplishments is debated (3, 7-10). These capabilities align with the rational and deliberative aspects of human nature, but humans are not purely rational creatures, and it is unclear whether LLMs will mimic a broader spectrum of human psychological tendencies. Here we test whether OpenAI's GPT-4o replicates behaviors associated with the human tendency toward cognitive consistency as well as human sensitivity to choice, characterized by greater attitude shifts when the behaviors inducing these changes are freely chosen. Decades of research demonstrate that humans will irrationally twist their attitudes to align with behaviors they were induced to perform. For example, consider an individual who opposes single-payer healthcare, but volunteers, in response to a request for help, to craft an argument in favor of the policy. Rationally, this individual's attitude toward single-payer healthcare should not move in a more supportive direction; they should be able to discriminate between their genuine attitude and the opposing one that they have articulated only to be helpful.
Visual Generation Without Guidance
Chen, Huayu, Jiang, Kai, Zheng, Kaiwen, Chen, Jianfei, Su, Hang, Zhu, Jun
Classifier-Free Guidance (CFG) has been a default technique in various visual generative models, yet it requires inference from both conditional and unconditional models during sampling. We propose to build visual models that are free from guided sampling. The resulting algorithm, Guidance-Free Training (GFT), matches the performance of CFG while reducing sampling to a single model, halving the computational cost. Unlike previous distillation-based approaches that rely on pretrained CFG networks, GFT enables training directly from scratch. GFT is simple to implement. It retains the same maximum likelihood objective as CFG and differs mainly in the parameterization of conditional models. Implementing GFT requires only minimal modifications to existing codebases, as most design choices and hyperparameters are directly inherited from CFG. Our extensive experiments across five distinct visual models demonstrate the effectiveness and versatility of GFT. Across domains of diffusion, autoregressive, and masked-prediction modeling, GFT consistently achieves comparable or even lower FID scores, with similar diversity-fidelity trade-offs compared with CFG baselines, all while being guidance-free. Code will be available at https://github.com/thu-ml/GFT.
Blissful (A)Ignorance: People form overly positive impressions of others based on their written messages, despite wide-scale adoption of Generative AI
As the use of Generative AI (GenAI) tools becomes more prevalent in interpersonal communication, understanding their impact on social perceptions is crucial. According to signaling theory, GenAI may undermine the credibility of social signals conveyed in writing, since it reduces the cost of writing and makes it hard to verify the authenticity of messages. Using a pre-registered large-scale online experiment (N = 647; Prolific), featuring scenarios in a range of communication contexts (personal vs. professional; close others vs. strangers), we explored how senders' use of GenAI influenced recipients' impressions of senders, both when GenAI use was known or uncertain. Consistent with past work, we found strong negative effects on social impressions when disclosing that a message was AI-generated, compared to when the same message was human-written. However, under the more realistic condition when potential GenAI use was not explicitly highlighted, recipients did not exhibit any skepticism towards senders, and these "uninformed" impressions were virtually indistinguishable from those of fully human-written messages. Even when we highlighted the potential (but uncertain) use of GenAI, recipients formed overly positive impressions. These results are especially striking given that 46% of our sample admitted having used such tools for writing messages, just within the past two weeks. Our findings put past work in a new light: While social judgments can be substantially affected when GenAI use is explicitly disclosed, this information may not be readily available in more realistic communication settings, making recipients blissfully ignorant about others' potential use of GenAI.