browse
I Let Google's 'Auto Browse' AI Agent Take Over Chrome. It Didn't Quite Click
I Let Google's'Auto Browse' AI Agent Take Over Chrome. Auto Browse can shop for clothes, plan a trip, and buy tickets for you. So, while testing Google's new "Auto Browse" feature for Chrome, I was filled with a strange sense of loss as I watched the AI agent open browser tabs and attempt to complete digital tasks with automated clicks. Sure, I felt some loss of control as the bot tapped away on my laptop screen. But also a kind of preemptive nostalgia for how the internet currently works, flaws and all, considering Google's plans to fundamentally alter the user experience.
- North America > United States > California > San Francisco County > San Francisco (0.04)
- Europe > Slovakia (0.04)
- Europe > Czechia (0.04)
- Information Technology > Communications (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.71)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Google's New Chrome 'Auto Browse' Agent Attempts to Roam the Web Without You
Google's latest addition to its Chrome browser puts generative AI behind the wheel and you in the passenger seat. Google debuted a new "Auto Browse" feature for Chrome on Wednesday. The tool, powered by Google's current Gemini 3 generative AI model, is an AI agent designed to take over your Chrome browser to help complete online tasks like booking flights, finding apartments, and filing expenses. The release of Auto Browse is part of Google's continued integration of AI features into Chrome. Last year, Google dropped the "Gemini in Chrome" mode to answer questions about what's on web pages and synthesize details from multiple open tabs.
- North America > United States > California (0.15)
- South America > Venezuela (0.05)
- Europe > Slovakia (0.05)
- Europe > Czechia (0.05)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.57)
Browse a 3D map of the world's 2.75 billion buildings
GlobalBuildingAtlas includes almost every habitable structure on Earth. Breakthroughs, discoveries, and DIY tips sent every weekday. Researchers in Germany recently accomplished a truly audacious feat of cartography . Using a diverse array of datasets, a team at the Technical University of Munich released GlobalBuildingAtlas, the first high-resolution mapping model featuring every structure in the world at a given point in time. With over 2.75 billion buildings detailed in the map, the endeavor will help create accurate analyses of urban structures, volume calculations, and infrastructure planning around the planet.
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Liu, Junteng, Li, Yunji, Zhang, Chi, Li, Jingyang, Chen, Aili, Ji, Ke, Cheng, Weiyu, Wu, Zijia, Du, Chengyu, Xu, Qidi, Song, Jiayuan, Zhu, Zhengmao, Chen, Wenhu, Zhao, Pengyu, He, Junxian
The paradigm of Large Language Models (LLMs) has increasingly shifted toward agentic applications, where web browsing capabilities are fundamental for retrieving information from diverse online sources. However, existing open-source web agents either demonstrate limited information-seeking abilities on complex tasks or lack transparent implementations. In this work, we identify that the key challenge lies in the scarcity of challenging data for information seeking. To address this limitation, we introduce WebExplorer: a systematic data generation approach using model-based exploration and iterative, long-to-short query evolution. This method creates challenging query-answer pairs that require multi-step reasoning and complex web navigation. By leveraging our curated high-quality dataset, we successfully develop advanced web agent WebExplorer-8B through supervised fine-tuning followed by reinforcement learning. Our model supports 128K context length and up to 100 tool calling turns, enabling long-horizon problem solving. Across diverse information-seeking benchmarks, WebExplorer-8B achieves the state-of-the-art performance at its scale. Notably, as an 8B-sized model, WebExplorer-8B is able to effectively search over an average of 16 turns after RL training, achieving higher accuracy than WebSailor-72B on BrowseComp-en/zh and attaining the best performance among models up to 100B parameters on WebWalkerQA and FRAMES. Beyond these information-seeking tasks, our model also achieves strong generalization on the HLE benchmark even though it is only trained on knowledge-intensive QA data. These results highlight our approach as a practical path toward long-horizon web agents.
- Europe > Austria > Vienna (0.14)
- South America > Brazil (0.04)
- North America > United States > Maryland (0.04)
- North America > United States > District of Columbia > Washington (0.04)
Netflix overhauls its TV app with a fresh UI and responsive recommendations
Netflix is giving its TV user interface a major overhaul. Alongside a fresh, cleaner look, you'll see recommendations that adapt to your activity as Netflix tries to better gauge what you might be in the mood to watch. The company plans to roll out the update over the coming weeks and months. It seems that the aim here, as has long been a goal for Netflix, is to help you find something you want to watch faster. That way, you won't spend an eternity scrolling through the various options while struggling to figure out what to start streaming.
- Oceania > New Zealand (0.05)
- Oceania > Australia (0.05)
- Media > Television (1.00)
- Media > Film (1.00)
- Information Technology > Services (1.00)
This new Windows feature offers gaming advice while you play
If you've ever become "stuck" in a game, the new Microsoft Edge Game Assist for Windows offers a pretty convenient way to help yourself out: Simply look up a hint while playing. Microsoft debuted this new system as a beta during the holidays last year. Now, it's part of Microsoft Edge itself, provided that you have the current version, Edge 132 or higher. For those of you who want a "pure" gaming experience, don't worry. Edge Game Assist doesn't just launch itself, and it actually seems rather helpful when you invoke it.
- North America > United States > Indiana (0.07)
- Europe > Ukraine > Kyiv Oblast > Chernobyl (0.06)
Microsoft's Copilot AI gets a voice and the ability to see websites you browse
Beyond debuting new features for Copilot AI PCs and Windows 11's 2024 update, Microsoft is also giving its Copilot AI a makeover on the web, mobile and desktop. That includes a slightly friendlier interface wherever you access it, along with new capabilities like Copilot Voice, which allows you to talk conversationally with the AI assistant. Ultimately, Microsoft is aiming for Copilot to be seen as more than just a party trick for generative AI search and image creation -- it's trying to make it a core part of your daily workflow. That starts with a cleaner and simpler UI that makes Copilot look different than a boring old search engine. You'll also be able to access Copilot from within Whatsapp, which could be useful if you want to avoid Meta's AI assistant.
- North America > United States (0.21)
- Oceania > New Zealand (0.07)
- Oceania > Australia (0.07)
- North America > Canada (0.07)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.36)
Who makes money when AI reads the internet for us?
Last week, The Browser Company, a startup that makes the Arc web browser, released a slick new iPhone app called Arc Search. Instead of displaying links, its brand new "Browse for Me" feature reads the first handful of pages and summarizes them into a single, custom-built, Arc-formatted web page using large language models from OpenAI and others. If a user does click through to any of the actual pages, Arc Search blocks ads, cookies and trackers by default. Arc's efforts to reimagine web browsing have received near-universal acclaim. But over the last few days, "Browse for Me" earned The Browser Company its first online backlash.
- Information Technology > Communications (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)
The Arc Browser is getting new AI-powered features that try to browse the web for you
Earlier this week, the team behind the Arc browser for Mac (and recently Windows) released a brand-new iPhone app called Arc Search. As you might expect, it's infused with AI to power an experience where the app "browses for you"--pulling together a variety of sources of info across the internet to make a custom webpage to answer whatever questions you throw at it. That's just one part of what The Browser Company is calling Act 2 of Arc, and the company gave details on three other major new features its bringing to the browser over the coming weeks and months. The connective tissue of all these updates is that Arc is trying to blur the lines between a browser, search engine and website -- the company wants to combine them all to make the internet a bit more useful to end users. In a promo video released today, various people from The Browser Company excitedly discuss a browser that can browse for you (an admittedly handy idea). The Arc Search app showed off one implementation of that idea, and the next is a feature that arrives today called Instant Links.
What to expect from the coming year in AI
I also had plenty of time to reflect on the past year. There are so many more of you reading The Algorithm than when we first started this newsletter, and for that I am eternally grateful. Thank you for joining me on this wild AI ride. So what can we expect in 2024? All signs point to there being immense pressure on AI companies to show that generative AI can make money and that Silicon Valley can produce the "killer app" for AI. Big Tech, generative AI's biggest cheerleaders, is betting big on customized chatbots, which will allow anyone to become a generative-AI app engineer, with no coding skills needed.
- North America > United States > California (0.25)
- Asia > China (0.05)