Goto

Collaborating Authors

 hardware requirement


Comparison of Large Language Models for Deployment Requirements

arXiv.org Artificial Intelligence

--Large Language Models (LLMs), such as Generative Pre-trained Transformers (GPTs) are revolutionizing the generation of human-like text, producing contextually relevant and syntactically correct content. Despite challenges like biases and hallucinations, these Artificial Intelligence (AI) models excel in tasks, such as content creation, translation, and code generation. Fine-tuning and novel architectures, such as Mixture of Experts (MoE), address these issues. Over the past two years, numerous open-source foundational and fine-tuned models have been introduced, complicating the selection of the optimal LLM for researchers and companies regarding licensing and hardware requirements. T o navigate the rapidly evolving LLM landscape and facilitate LLM selection, we present a comparative list of foundational and domain-specific models, focusing on features, such as release year, licensing, and hardware requirements. This list is published on GitLab and will be continuously updated.


Petition demands Windows 10 live on past 2025

PCWorld

Pressure is growing on Microsoft to extend support for Windows 10, which is still the most widely used version of the operating system. The Public Interest Research Group (PIRG) is demanding that the Redmond software company continue to provide security updates for Windows 10 beyond October 2025. PIRG has now launched the online petition Tell Microsoft: Don't leave millions of computers behind if you want to add your voice to the effort. The Public Interest Research Group (PIRG) is an association of US and Canadian non-profit organizations that, among other things, work for consumer protection. The PIRG now points out that with the end of support for Windows 10 in 2025, millions of Windows computers will effectively become electronic waste.


How Deep Learning is helping to save human lives at a container terminal

#artificialintelligence

The Port of Montevideo is located in the capital city of Montevideo, on the banks of the "Río de la Plata" river. Due to its strategic location between the Atlantic Ocean and the "Uruguay" river, it is considered one of the main routes of cargo mobilization for Uruguay and MERCOSUR . Over the past decades, it has established itself as a multipurpose port handling: containers, bulk, fishing boats, cruises, passenger transport, cars, and general cargo. MERCOSUR or officially the Southern Common Market is a commercial and political bloc established in 1991 by several South American countries. Moreover, only two companies concentrate all-cargo operations in this port: the company of Belgian origin Katoen Natie and the Chilean and Canadian capital company Montecon.


How do Self-Driving Cars Work?

#artificialintelligence

Google's self-driving cars program, Waymo, has recorded the most successful run in the autonomous vehicles category, until now. More is expected in the AV domain in the coming years, something to wait and watch out for.


Analyzing Algorithmic Complexity

#artificialintelligence

Machine learning (ML) and complexity are not synonymous. In fact, machine learning is a way to wade through complexities posed by the data. But, what if we try to examine the level of algorithmic complexity before applying the ML algorithms? Most of the ML algorithms are definite answers to solve complexities and arrive at an optimal solution. Yet, some algorithms have complexities that can make us contemplate the level of expertise to examine and use them; and to apply them for getting the best results.


[D] GPT-J for text generation: hardware requirements

#artificialintelligence

Video games have, for several decades, only loaded the assets they needed to use, as they used them, due to tiny RAM amounts relative to the asset sizes yea. The original Playstation is a good example with its 2MB of RAM and 800MB discs full of assets. Enabling the ability to do something at all for many, is often more important than being able to do it extremely quickly for a few. Modern games get around this in part, by having the graphics driver handle all the asset management in and out of VRAM, allowing the driver to swap the least recently used assets out to RAM until they are needed, and then swap them back to VRAM, without the executing program having to know it happened. Basically treating all of VRAM and virtual VRAM hosted in RAM as one big asymmetrically speedy storage medium.


Hardware Requirements for Artificial Intelligence

#artificialintelligence

So you're planning to launch an AI project or startup, or maybe adding an AI-based team to an existing organization. Now, if you want to run machine learning, deep learning, computer vision or other AI-driven research project you can't just buy any off-the-rack computer from an office superstore; you need hardware that can handle your workload. This leaves you with an important decision: build, buy, or rent. In this context, "renting" would generally refer to using cloud compute resources, which tend to be more expensive in the long run, but may be a good choice in some cases (great for startups or when you're not planning on scaling in a big way). This article, however, is concerned with balancing hardware and computational requirements and is based on the assumption that you will be spec'ing custom AI hardware or building an AI computer yourself.


A Sample-Based Training Method for Distantly Supervised Relation Extraction with Pre-Trained Transformers

arXiv.org Artificial Intelligence

Multiple instance learning (MIL) has become the standard learning paradigm for distantly supervised relation extraction (DSRE). However, due to relation extraction being performed at bag level, MIL has significant hardware requirements for training when coupled with large sentence encoders such as deep transformer neural networks. In this paper, we propose a novel sampling method for DSRE that relaxes these hardware requirements. In the proposed method, we limit the number of sentences in a batch by randomly sampling sentences from the bags in the batch. However, this comes at the cost of losing valid sentences from bags. To alleviate the issues caused by random sampling, we use an ensemble of trained models for prediction. We demonstrate the effectiveness of our approach by using our proposed learning setting to fine-tuning BERT on the widely NYT dataset. Our approach significantly outperforms previous state-of-the-art methods in terms of AUC and P@N metrics.


On sparse connectivity, adversarial robustness, and a novel model of the artificial neuron

arXiv.org Machine Learning

Deep neural networks have achieved human-level accuracy on almost all perceptual benchmarks. It is interesting that these advances were made using two ideas that are decades old: (a) an artificial neuron based on a linear summator and (b) SGD training. However, there are important metrics beyond accuracy: computational efficiency and stability against adversarial perturbations. In this paper, we propose two closely connected methods to improve these metrics on contour recognition tasks: (a) a novel model of an artificial neuron, a "strong neuron," with low hardware requirements and inherent robustness against adversarial perturbations and (b) a novel constructive training algorithm that generates sparse networks with $O(1)$ connections per neuron. We demonstrate the feasibility of our approach through experiments on SVHN and GTSRB benchmarks. We achieved an impressive 10x-100x reduction in operations count (10x when compared with other sparsification approaches, 100x when compared with dense networks) and a substantial reduction in hardware requirements (8-bit fixed-point math was used) with no reduction in model accuracy. Superior stability against adversarial perturbations (exceeding that of adversarial training) was achieved without any counteradversarial measures, relying on the robustness of strong neurons alone. We also proved that constituent blocks of our strong neuron are the only activation functions with perfect stability against adversarial attacks.


The Design Automation Conference to Showcase an AI Hardware Pavilion, Broadening the 2020 Exhibition Lineup

#artificialintelligence

The new Pavilion invites AI hardware innovators to exhibit at DAC in a turnkey solution package SAN FRANCISCO, CA. – February 13, 2020 –The Design Automation Conference (DAC), the premier conference devoted to the design and automation of electronic circuits and systems, will this year showcase a dedicated Pavilion centered on the artificial intelligence (AI) hardware ecosystem. AI hardware is driving the largest wave of chip-design activity in decades. Understanding and harnessing the enormous computational and application potential of AI is fertile ground for new ideas and startup providers. Converting these ideas into working hardware circuits and systems is the core value of design automation, and the major technical focus of 57th DAC. The 57th DAC will be held at Moscone West Center in San Francisco, CA, from July 19-23, 2020.