Goto

Collaborating Authors

 vision


MLP-Mixer: An all-MLP Architecture for Vision

Neural Information Processing Systems

Convolutional Neural Networks (CNNs) are the go-to model for computer vision. Recently, attention-based networks, such as the Vision Transformer, have also become popular. In this paper we show that while convolutions and attention are both sufficient for good performance, neither of them are necessary. We present MLP-Mixer, an architecture based exclusively on multi-layer perceptrons (MLPs). MLP-Mixer contains two types of layers: one with MLPs applied independently to image patches (i.e.


The Humane Ai Pin Will Become E-Waste Next Week

WIRED

The story of the infamous Humane Ai Pin is coming to an end. This week, the company announced that HP--known for its computers and printers that always seem to need a refill--will acquire several assets from Humane in a 116 million deal expected to close at the end of the month. HP will get more than 300 patents and patent applications, a few Humane employees--including founders Imran Chaudhri and Bethany Bongiorno--and Humane's Cosmos operating system. Late in 2024, Humane looked to license this operating system so that third parties could inject the AI voice assistant into other products, like cars. Humane became Silicon Valley's "next big thing" in late 2023 when it unveiled its AI wearable, equipped with a ChatGPT-powered assistant and a laser-projected display, that promised to replace your smartphone.


Apple Intelligence is headed to the Vision Pro in April, dev beta available today

Engadget

The rumors are true: Apple confirmed today that the Vision Pro will get Apple Intelligence features in April with the arrival of visionOS 2.4. A developer beta is also rolling out today for the less patient. As we've seen on other devices, Apple is starting out the Vision Pro's AI rollout with basic features. Those include Writing Tools, which can help you summarize, rewrite and proofread text, as well as generate text with ChatGPT; Image Playground for creating AI imagery; and Genmoji for building custom AI generated emojis and stickers. It really was only a matter of when Apple would bring Apple Intelligence to the Vision Pro.


Driving into the future

MIT Technology Review

But the real secret of the TR10 is what we leave off the list. It is hard to think of another industry, aside from maybe entertainment, that has as much of a hype machine behind it as tech does. Which means that being too conservative is rarely the wrong call. Last year, for example, we were going to include robotaxis on the TR10. Autonomous vehicles have been around for years, but 2023 seemed like a real breakthrough moment; both Cruise and Waymo were ferrying paying customers around various cities, with big expansion plans on the horizon. And then, last fall, after a series of mishaps (including an incident when a pedestrian was caught under a vehicle and dragged), Cruise pulled its entire fleet of robotaxis from service.


Meta Connect 2024: The cheaper Quest 3S, AI, smart glasses and everything else to expect

Engadget

It used to go by at least two different names -- Oculus Connect and then Facebook Connect -- but whatever the moniker, Meta's fall event is still a big showcase for the company's latest and greatest achievements in the virtual reality and mixed reality space. Much like last year, we can likely predict the biggest news coming out of Meta Connect 2024 with just two acronyms: AI and AR. Like every other big tech firm this year, Meta will be desperate to demonstrate how it plans to stay relevant in a future powered by AI. And now that we're seven months beyond the launch of Apple's Vision Pro, which arrived alongside a short-lived spike in interest in augmented reality (AR), Meta CEO Mark Zuckerberg is likely eager to show off his own plans to make AR a reality. While Zuckerberg isn't as hot on the metaverse as he was when he renamed his company, the union of AI and AR is one way he can still make the dream of persistent virtual worlds come true.


The Apple Vision Pro goes on sale in the US on February 2 for 3,499

Engadget

Those who've been yearning for a chance to try the Apple Vision Pro headset and have the cash to spare won't need to wait much longer to snap one up. The company says the hotly anticipated device will arrive in the US on February 2. Pre-orders for the 3,499 mixed reality headset will open on January 19. The device will be available at all US Apple Store locations as well as through the company's web store. Those who require vision correction will need to snap up Zeiss optical inserts and attach them to the headset magnetically (Vision Pro doesn't work with glasses). Readers will cost 99, while prescription lenses will set you back 149.

  Country: North America > United States > Nevada > Clark County > Las Vegas (0.06)
  Industry: Retail (0.74)

Xreal Air 2 Ultra is an affordable alternative to the Apple Vision Pro, apparently

Engadget

Xreal, formerly Nreal, hosted one of the busiest booths at CES in recent years, so it's no surprise that the company is back with new AR glasses for this year's show -- especially given the much anticipated launch of Apple's Vision Pro. Following the Nreal Light from 2019, the brand new Xreal Air 2 Ultra finally brings back 6DoF (six degrees of freedom) spatial tracking and hand tracking, along with a wider 52-degree FOV (field of view) and a 42-pixel-per-degree sharpness within an 80-gram titanium package. The firm goes as far as claiming that these specs make the 699 Air 2 Ultra a compelling alternative to the 3,499 Vision Pro. Unlike the standalone mixed reality headsets, the Air 2 series of glasses need to be powered by an external computing unit, such as a smartphone, a computer or Xreal's Beam module, via USB-C. While the earlier Air 2 Pro and Air 2 were positioned more as personal display wearables, the Air 2 Ultra emphasizes on its 6DoF spatial computing capabilities, meaning virtual objects can be mapped to the real world while you walk around.


Springer has released 65 Machine Learning and Data books for free

#artificialintelligence

Springer has released hundreds of free books on a wide range of topics to the general public. The list, which includes 408 books in total, covers a wide range of scientific and technological topics. In order to save you some time, I have created one list of all the books (65 in number) that are relevant to the data and Machine Learning field. Among the books, you will find those dealing with the mathematical side of the domain (Algebra, Statistics, and more), along with more advanced books on Deep Learning and other advanced topics. You also could find some good books in various programming languages such as Python, R, MATLAB, etc.


New Approaches To 3D Vision - FoundersList

#artificialintelligence

This launch event for the Royal Society volume New Approaches to 3D Vision explores how AI, animals, & humans see & navigate the 3D world. In Artificial Intelligence (AI), 3D vision is enabling autonomous cars & robots to freely navigate the world & helping AI to solve fundamental scientific questions like protein folding. In animals, brain recordings from freely moving animals are enabling us to understand how animals process & navigate through space. In humans, virtual reality, augmented reality, & 3D cinema are all having a transformative effect on our 3D visual experience. In turn, these innovations are revolutionizing our understanding of 3D vision & navigation.


Survey of Neural Radiance Field in 3D Vision

#artificialintelligence

Neural Radiance Field (NeRF), a new novel view synthesis with implicit scene representation has taken the field of Computer Vision by storm. As a novel view synthesis and 3D reconstruction method, NeRF models find applications in robotics, urban mapping, autonomous navigation, virtual reality/augmented reality, and more. Since the original paper by Mildenhall et al., more than 250 preprints were published, with more than 100 eventually being accepted in tier one Computer Vision Conferences. Given NeRF popularity and the current interest in this research area, ... believe it necessary to compile a comprehensive survey of NeRF papers from the past two years ... organized into both architecture, and application based taxonomies.