Augmenting Polymer Datasets by Iterative Rearrangement


One of the biggest obstacles to successful polymer property prediction is an effective representation that accurately captures the sequence of repeat units in a polymer. Motivated by the successes of data augmentation in computer vision and natural language processing, we explore augmenting polymer data by rearranging the molecular representation while preserving the correct connectivity, revealing additional substructural information that is not present in a single representation. We evaluate the effects of this technique on the performance of machine learning models trained on three experimental polymer datasets and compare them to common molecular representations. Data augmentation improves deep learning property prediction performance compared to equivalent (non-augmented) representations. In datasets where the target property is primarily influenced by the polymer sequence rather than experimental parameters, this data augmentation technique provides the molecular embedding with more information to improve property prediction accuracy.

Towards Deep Learning for Relational Databases


We got very much used now to read about deep learning, making the headlines with various breakthroughs in research domains ranging from vision and image generation to game playing. However, most of deep learning methods are still quite distant from everyday business practices. A lot of companies successfully transformed into the data-driven era, collecting large amounts of valuable data as part of their processes. And while most of such companies do use various data analytics tools to extract insights from these data, only rarely do we see engagement of actual deep learning methods, unless they directly match the business domain (e.g., image data processing companies). In this article, we argue that one of the core reasons for this lack of neural networks in business practice is the gap in the learning representation assumed by virtually all the deep learning models and the, by far most common, data storage representation used in everyday practice -- relational databases.

Best NLP Papers -- October 2022


This roundup highlights some interesting NLP papers from October 2022 around language model capabilities. This article's title and TL;DR have been generated with Cohere. Get started with text generation. NLP is evolving at a rapid pace, and every month we discover new capabilities. Large language models, like those built by Cohere, are being used for use cases that we couldn't have imagined even just a few months ago.

AI Is Terrible at Detecting Misinformation. It Doesn't Have to Be. - Nautilus


Elon Musk has said he wants to make Twitter "the most accurate source of information in the world." I am not convinced that he means it, but whether he does or not, he's going to have to work on the problem; a lot of advertisers have already made that pretty clear. If he does nothing, they are out. And Musk has continued to tweet in ways that seem to indicate that he is generally on board with some kind of content moderation. The tech journalist Kara Swisher has speculated that Musk wants AI to help; on Twitter she wrote, rather plausibly, that Musk "is hoping to build an AI system that replaces [fired moderators] that will not work well now but will presumably get better."

Digital transformation: The definitive guide to doing digitalizaton right Digital transformation: The definitive guide to doing digitalizaton right


Digital transformation is the megatrend driving billions in investment across the corporate world to reinvent the way they do business. In the enterprise digital transformation guide, we will address the following topics to help you master the art and science of digitalization. "When digital transformation is done right, it's like a caterpillar turning into a butterfly, but when done wrong, all you have is a really fast caterpillar." Digital transformation is a customer-centric reimagination of the future of an enterprise and subsequently rethinking the business model. Reshaping the product/service portfolio, restructuring the processes, re-platforming technologies, reskilling the workforce, and instilling a new culture to get to the end goal. That definition of what is digital transformation packs a lot of punch.

Researchers Win Gordon Bell Special Prize for Models that Track COVID Variants


Members of the GenSLMs team received the Gordon Bell Special Prize for HPC-Based COVID-19 Research at the SC22 conference. Scientists from Argonne National Laboratory and a team of collaborators have won the 2022 ACM Gordon Bell Special Prize for High Performance Computing-Based COVID-19 Research for their method of quickly identifying how a virus evolves. Their work in training large language models (LLMs) to discover variants of SARS-CoV-2 has implications to biology beyond COVID-19. The researchers leveraged Argonne's supercomputing and AI resources to develop and apply LLMs toward tracking how a virus can mutate into more dangerous or more transmissible variants, or a variant of concern (VOC). Existing methods to track VOCs can be slow.

Traffic Forecasting using Deep Neural Networks


Accurate and real-time traffic forecasting plays an important role in the intelligent traffic system and is of great significance for urban traffic planning, traffic management, and traffic control. However, traffic forecasting has always been considered an "open" scientific issue, owing to the constraints of urban road network topological structure and the law of dynamic change with time.

GM's Cruise Seeks Regulatory OK to Test Shuttle With No Steering Wheel WSJD - Technology

General Motors Co.'s driverless-car unit has requested approval from California regulators to begin public testing of a shuttle that has no steering wheel or manual controls, showing the auto maker's determination to make progress on autonomous vehicles as rivals step back. GM's Cruise LLC division in August submitted an application to the California Department of Motor Vehicles, requesting permission to test its Origin driverless vehicle on San Francisco streets, according to a copy of the document obtained through a public records request. The California DMV began reviewing the application in late October, according to emails reviewed by The Wall Street Journal as part of the request. In its application, Cruise said it would begin test runs of the electric Origin in a confined area of San Francisco during limited hours and gradually expand over time. While GM and Cruise push ahead on plans to commercialize robotaxis, other players have pulled back, expressing doubts about whether the technology can support a viable business any time soon.

Kamikaze Killers: Iran's Drones Fly with Western Technology

Der Spiegel International

The same applies to typical aviation instruments, such as the gyro stabilizer, also known as a gyroscope, which enables aircraft to orient themselves in the air. The mechanical gyroscope CAR found in a Mohajer 6 drone resembles one previously documented by the group in a Qasef 1 drone. That model is similar to the Shahed 136 and 131 and also belongs to the loitering munitions category, known casually as kamikaze drones. "The data from the report seems valid and very detailed," Ulrike Franke, a drone expert at the European Council of Foreign Relations in London told DER SPIEGEL. At the same time, the experts also discovered differences to older models. According to the report, some of the drones found in Ukraine had been fitted with more modern technology, such as a software-defined radio that was in a Shahad 136.