password
Are bank text codes enough to protect you?
This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by LSEG . You have a credit freeze; it still isn't enough Turning 65? Month-by-month plan to protect yourself China's AI growth is about'economic and political leverage,' Rep Hinson says Expert warns'red-green-green alliance' helping China gain AI edge AI's impact on jobs, economy debated as youth express growing fears Jury dismisses Elon Musk's lawsuit against OpenAI and Sam Altman China does not'innovate,' they'replicate': Former DHS spokeswoman Trump to press Xi to'open up' China as tech CEOs join key summit Smart and Safe Tech Are bank text codes enough to protect you?
Stress-Testing Capability Elicitation With Password-Locked Models
To determine the safety of large language models (LLMs), AI developers must be able to assess their dangerous capabilities. But simple prompting strategies often fail to elicit an LLM's full capabilities. One way to elicit capabilities more robustly is to fine-tune the LLM to complete the task. In this paper, we investigate the conditions under which fine-tuning-based elicitation suffices to elicit capabilities. To do this, we introduce password-locked models, LLMs fine-tuned such that some of their capabilities are deliberately hidden.
Give Your Problems (and Passwords) to Moltbot, Then Watch It Go
A viral new virtual assistant formerly known as Clawdbot is complex and brings security risks--but some early adopters say it feels like the future. Dan Peguine, a tech entrepreneur and marketing consultant based in Lisbon, lets a precocious, lobster-themed AI assistant called Moltbot run much of his life. Peguine, a self-professed early adopter and trendspotter, discovered Moltbot several weeks ago--back then it was Clawdbot--after discussing a vibe-coding side project with friends on WhatsApp. He installed it on his computer, connected it to numerous apps and online accounts, including Google Apps, and was astonished by how capable it was. "I tried it, got interested, then got really obsessed," Peguine says.