Goto

Collaborating Authors

 recipe


Gemini for Google Home will no longer freak out if you ask it how to make a margarita

Engadget

Google has updated Gemini for Home so that it no longer acts like a strict parent when you ask it for cocktail recipes. In the past, you may have encountered a message that says I cannot provide recipes for alcoholic beverages when you ask the AI assistant for a margarita recipe on Google smart home devices, such as the Nest Hub . Now, Google has updated its safeguards to prevent adult users from encountering filters meant for younger ones. Adults will now experience improved availability for general queries, including recipes for age-gated beverages, the company said in the Google Home support page . If Gemini still isn't responding when you ask it for instructions on how to make a cocktail, you may have to check you Parental Control settings and your Gemini for Home response filter settings in the Google Home app.


ProteinJEPA: Latent prediction complements protein language models

arXiv.org Machine Learning

Protein language models are trained primarily with masked language modeling (MLM), which predicts amino-acid identities at masked positions. We ask whether latent-space prediction can complement these token-level objectives under matched wall-clock budget. Across pretrained and random-init protein sequence encoders at 35--150M parameters, we find that the best protein-JEPA design is not all-position latent prediction but a variant: predicting latent targets only at masked positions, and retaining the MLM cross-entropy. We call this recipe masked-position MLM+JEPA. On a 16-task downstream suite (15 frozen linear probes plus SCOPe-40 zero-shot fold retrieval), under matched wall-clock budgets, this recipe wins more tasks than it loses against MLM-only continuation: 10 wins / 3 losses / 3 ties (hereafter W/L/T) on pretrained ESM2-35M, 11/2/3 on ESM2-150M while results in pretraining from scratch are mixed (6/8/2). Gains are seen for multiple models on 11 of 16 tasks, including stability, \b{eta}β\b{eta}-lactamase fitness, variant effect, intrinsic disorder, remote homology, enzyme classification, and SCOPe-40 fold retrieval. Tasks with more losses than wins are Fluorescence (TAPE) and Peptide-HLA Binding. All-position MLM+JEPA matches MLM-only overall but does not reproduce the masked-position gains. JEPA-only (no MLM) collapses in nearly every experiment. We conclude that JEPA, when combined with MLM, is competitive and can outperform pure MLM in pretraining and continued training, even under matched wall-clock budgets.


Unleashing the Power of Randomization in Auditing Differentially Private ML

Neural Information Processing Systems

We present a rigorous methodology for auditing differentially private machine learning algorithms by adding multiple carefully designed examples called canaries. We take a first principles approach based on three key components. First, we introduce Lifted Differential Privacy (LiDP) which expands the definition of differential privacy to handle randomized datasets. This gives us the freedom to design randomized canaries. Second, we audit LiDP by trying to distinguish between the model trained with K canaries versus K 1 canaries in the dataset, leaving one canary out. By drawing the canaries i.i.d., LiDP can leverage the symmetry in the design and reuse each privately trained model to run multiple statistical tests, one for each canary. Third, we introduce novel confidence intervals that take advantage of the multiple test statistics by adapting to the empirical higher-order correlations. Together, this new recipe demonstrates significant improvements in sample complexity, both theoretically and empirically, using synthetic and real data. Further, recent advances in designing stronger canaries can be readily incorporated into the new framework.


The science of hosting the perfect dinner party

Popular Science

You may be using the wrong plates. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. The ideal dinner party size is somewhere between five and eight guests. Breakthroughs, discoveries, and DIY tips sent six days a week. You don't have to be Martha Stewart to host a successful dinner party .


Equilibrium and non-Equilibrium regimes in the learning of Restricted Boltzmann Machines

Neural Information Processing Systems

Training Restricted Boltzmann Machines (RBMs) has been challenging for a long time due to the difficulty of computing precisely the log-likelihood gradient. Over the past decades, many works have proposed more or less successful training recipes but without studying the crucial quantity of the problem: the mixing time, i.e. the number of Monte Carlo iterations needed to sample new configurations from a model. In this work, we show that this mixing time plays a crucial role in the dynamics and stability of the trained model, and that RBMs operate in two well-defined regimes, namely equilibrium and out-of-equilibrium, depending on the interplay between this mixing time of the model and the number of steps, k, used to approximate the gradient. We further show empirically that this mixing time increases with the learning, which often implies a transition from one regime to another as soon as kbecomes smaller than this time. In particular, we show that using the popular k (persistent) contrastive divergence approaches, with k small, the dynamics of the learned model are extremely slow and often dominated by strong out-of-equilibrium effects. On the contrary, RBMs trained in equilibrium display faster dynamics, and a smooth convergence to dataset-like configurations during the sampling. Finally we discuss how to exploit in practice both regimes depending on the task one aims to fulfill: (i) short k can be used to generate convincing samples in short learning times, (ii) large k (or increasingly large) is needed to learn the correct equilibrium distribution of the RBM. Finally, the existence of these two operational regimes seems to be a general property of energy based models trained via likelihood maximization.


I Used to Love Turning to My Dad for Advice. Guess Who He Wants Me to Ask Now.

Slate

Life My Dad Used to Have All the Answers. It feels like he adopted a robot child, and that child will stop at nothing to wedge a divide between us. Like many twentysomethings, I ask my dad a lot of questions. How do I fix the leak under my sink? What does "federal withholding" mean?


Candy now tastes different. It's not just you.

Popular Science

From recipe changes to aging taste buds, here's why those peanut butter cups don't hit like they used to. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. There's a reason you might remember Hershey's chocolate differently. Breakthroughs, discoveries, and DIY tips sent six days a week. Brad Reese, grandson of Reese's Peanut Butter Cups inventor H.B. Reese, caused a stir this year with his claims that The Hershey Company had changed his grandfather's recipes beyond recognition .


fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R

arXiv.org Machine Learning

Preprocessing leakage arises when scaling, imputation, or other data-dependent transformations are estimated before resampling, inflating apparent performance while remaining hard to detect. We present fastml, an R package that provides a single-call interface for leakage-aware machine learning through guarded resampling, where preprocessing is re-estimated inside each resample and applied to the corresponding assessment data. The package supports grouped and time-ordered resampling, blocks high-risk configurations, audits recipes for external dependencies, and includes sandboxed execution and integrated model explanation. We evaluate fastml with a Monte Carlo simulation contrasting global and fold-local normalization, a usability comparison with tidymodels under matched specifications, and survival benchmarks across datasets of different sizes. The simulation demonstrates that global preprocessing substantially inflates apparent performance relative to guarded resampling. fastml matched held-out performance obtained with tidymodels while reducing workflow orchestration, and it supported consistent benchmarking of multiple survival model classes through a unified interface.