Computer Vision - Richard Szeliski


As humans, we perceive the three-dimensional structure of the world around us with apparent ease. Think of how vivid the three-dimensional percept is when you look at a vase of flowers sitting on the table next to you. You can tell the shape and translucency of each petal through the subtle patterns of light and shading that play across its surface and effortlessly segment each flower from the background of the scene (Figure 1.1). Looking at a framed group por- trait, you can easily count (and name) all of the people in the picture and even guess at their emotions from their facial appearance. Perceptual psychologists have spent decades trying to understand how the visual system works and, even though they can devise optical illusions1 to tease apart some of its principles (Figure 1.3), a complete solution to this puzzle remains elusive (Marr 1982; Palmer 1999; Livingstone 2008).

Choosing Computer Vision board in 2022


Choosing a platform to work with Computer Vision on the Edge is difficult. There are dozens of boards on the market. If you read about one of them, you want to use it. But when you try - it is not so good. I tried to compare a lot of the cheap boards on the market.

PowerToys update adds OCR and two more free tools


If you use Windows, you want PowerToys. This collection of open-source goodies, guided and published by Microsoft itself, is one of the best free software packages out there, and we can't recommend it enough. That only becomes more true today, as the company publishes an updated version with three brand new tools: the previously-spotted Text Extrator (an Optical Character Recognition tool), a ruler for measuring pixels on your screen, and a tool for quickly inserting little-used accents into text. Text Extractor is probably the most universally-applicable addition here. It's an open-source version of Joseph Finney's paid Text Grab app, now integrated into PowerToys and free for Windows users.

EasyOCR: A Free Open-source OCR That Supports 80+ Languages


EasyOCR is a free developer-friendly OCR "Optical Character Recognition" that supports 80 languages including Latin, Chinese, Arabic, and Cyrillic. EasyOCR is written in the Python programming language. It can be installed as a Python package, and integrates well with other Python Frameworks like Django, Flask, and others. You can test the demo here, as you can upload images in different format and test several languages. It comes with a trainer models that can be used to train for new languages, dozens of example datasets for model training, user-friendly instructions on how to train custom recognition models and more. It also supports vertical text, and PIL images, and more.

OpenCV -- Introduction


OpenCV was created in C at first. Furthermore, Python and Java bindings were provided. OpenCV is compatible with a wide range of operating systems, including Windows, Linux, OS X, FreeBSD, Net BSD, Open BSD, and others. OpenCV is completely free to use. Because the OpenCV library is written in C/C, it is extremely fast. It can now be used with Python. It requires less RAM to operate, possibly as little as 60–70 MB. Computer Vision, like OpenCV, is portable and can run on any device that supports C. Before beginning to learn OpenCV, you should be familiar with the Python programming language.

Tutorial: Real-Time Object Detection with DeepStream on Nvidia Jetson AGX Orin


Last month, NVIDIA unleashed the next-generation edge computing hardware device branded as Jetson AGX Orin at GTC. Courtesy of Nvidia, I was fortunate enough to get a Jetson AGX Orin Developer Kit to evaluate and experiment with it. The Jetson AGX Orin Developer Kit has everything you need to run AI inference at the edge with ultra-low latency and high throughput. As a successor to the most powerful Jetson AGX Xavier, AGX Orin packs a punch. The developer kit comes with a carrier board that makes it easy to connect various peripherals. The Jetson AGX Orin Developer Kit comes with a preview of JetPack SDK 5.0, which is based on the Ubuntu 20.04 root filesystem and Linux Kernel 5.10.