AITopics | teal

Collaborating Authors

teal

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

Chen, Sihan, Zhao, Dan, Ko, Jongwoo, Banbury, Colby, Zhuang, Huiping, Liang, Luming, Chen, Tianyi

arXiv.org Artificial IntelligenceMay-27-2025

The growing computational demands of large language models (LLMs) make efficient inference and activation strategies increasingly critical. While recent approaches, such as Mixture-of-Experts (MoE), leverage selective activation but require specialized training, training-free sparse activation methods offer broader applicability and superior resource efficiency through their plug-and-play design. However, many existing methods rely solely on hidden state magnitudes to determine activation, resulting in high approximation errors and suboptimal inference accuracy. To address these limitations, we propose WINA (Weight Informed Neuron Activation), a novel, simple, and training-free sparse activation framework that jointly considers hidden state magnitudes and the column-wise $\ell_2$-norms of weight matrices. We show that this leads to a sparsification strategy that obtains optimal approximation error bounds with theoretical guarantees tighter than existing techniques. Empirically, WINA also outperforms state-of-the-art methods (e.g., TEAL) by up to $2.94\%$ in average performance at the same sparsity levels, across a diverse set of LLM architectures and datasets. These results position WINA as a new performance frontier for training-free sparse activation in LLM inference, advancing training-free sparse activation methods and setting a robust baseline for efficient inference. The source code is available at https://github.com/microsoft/wina.

arxiv preprint arxiv, large language model, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2505.19427

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning

Shaul-Ariel, Shahar, Weinshall, Daphna

arXiv.org Artificial IntelligenceJun-30-2024

Continual Learning is an unresolved challenge, whose relevance increases when considering modern applications. Unlike the human brain, trained deep neural networks suffer from a phenomenon called Catastrophic Forgetting, where they progressively lose previously acquired knowledge upon learning new tasks. To mitigate this problem, numerous methods have been developed, many relying on replaying past exemplars during new task training. However, as the memory allocated for replay decreases, the effectiveness of these approaches diminishes. On the other hand, maintaining a large memory for the purpose of replay is inefficient and often impractical. Here we introduce TEAL, a novel approach to populate the memory with exemplars, that can be integrated with various experience-replay methods and significantly enhance their performance on small memory buffers. We show that TEAL improves the average accuracy of the SOTA method XDER as well as ER and ER-ACE on several image recognition benchmarks, with a small memory buffer of 1-3 exemplars per class in the final task. This confirms the hypothesis that when memory is scarce, it is best to prioritize the most typical data.

buffer, exemplar, learning, (14 more...)

arXiv.org Artificial Intelligence

2407.00673

Country:

Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)

Genre: Research Report (1.00)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models

Yang, Zhen, Zhang, Yingxue, Meng, Fandong, Zhou, Jie

arXiv.org Artificial IntelligenceJan-4-2024

Despite Multi-modal Large Language Models (MM-LLMs) have made exciting strides recently, they are still struggling to efficiently model the interactions among multi-modal inputs and the generation in non-textual modalities. In this work, we propose TEAL (Tokenize and Embed ALl)}, an approach to treat the input from any modality as a token sequence and learn a joint embedding space for all modalities. Specifically, for the input from any modality, TEAL first discretizes it into a token sequence with the off-the-shelf tokenizer and embeds the token sequence into a joint embedding space with a learnable embedding matrix. MM-LLMs just need to predict the multi-modal tokens autoregressively as the textual LLMs do. Finally, the corresponding de-tokenizer is applied to generate the output in each modality based on the predicted token sequence. With the joint embedding space, TEAL enables the frozen LLMs to perform both understanding and generation tasks involving non-textual modalities, such as image and audio. Thus, the textual LLM can just work as an interface and maintain its high performance in textual understanding and generation. Experiments show that TEAL achieves substantial improvements in multi-modal understanding, and implements a simple scheme for multi-modal generations.

arxiv preprint arxiv, modality, tokenizer, (13 more...)

arXiv.org Artificial Intelligence

2311.04589

Country:

South America > Colombia > Meta Department > Villavicencio (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)

Genre: Research Report (0.82)

Industry: Education > Curriculum > Subject-Specific Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Teal: Learning-Accelerated Optimization of WAN Traffic Engineering

Xu, Zhiying, Yan, Francis Y., Singh, Rachee, Chiu, Justin T., Rush, Alexander M., Yu, Minlan

arXiv.org Artificial IntelligenceJul-25-2023

The rapid expansion of global cloud wide-area networks (WANs) has posed a challenge for commercial optimization engines to efficiently solve network traffic engineering (TE) problems at scale. Existing acceleration strategies decompose TE optimization into concurrent subproblems but realize limited parallelism due to an inherent tradeoff between run time and allocation performance. We present Teal, a learning-based TE algorithm that leverages the parallel processing power of GPUs to accelerate TE control. First, Teal designs a flow-centric graph neural network (GNN) to capture WAN connectivity and network flows, learning flow features as inputs to downstream allocation. Second, to reduce the problem scale and make learning tractable, Teal employs a multi-agent reinforcement learning (RL) algorithm to independently allocate each traffic demand while optimizing a central TE objective. Finally, Teal fine-tunes allocations with ADMM (Alternating Direction Method of Multipliers), a highly parallelizable optimization algorithm for reducing constraint violations such as overutilized links. We evaluate Teal using traffic matrices from Microsoft's WAN. On a large WAN topology with >1,700 nodes, Teal generates near-optimal flow allocations while running several orders of magnitude faster than the production optimization engine. Compared with other TE acceleration schemes, Teal satisfies 6--32% more traffic demand and yields 197--625x speedups.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2210.13763

Country:

North America > United States > New York > New York County > New York City (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Washington > King County > Renton (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Information Technology (1.00)
Telecommunications > Networks (0.87)
Transportation > Ground > Road (0.62)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Democratizing Ethical Assessment of Natural Language Generation Models

Rasekh, Amin, Eisenberg, Ian

arXiv.org Artificial IntelligenceJul-22-2022

Natural language generation models are computer systems that generate coherent language when prompted with a sequence of words as context. Despite their ubiquity and many beneficial applications, language generation models also have the potential to inflict social harms by generating discriminatory language, hateful speech, profane content, and other harmful material. Ethical assessment of these models is therefore critical. But it is also a challenging task, requiring an expertise in several specialized domains, such as computational linguistics and social justice. While significant strides have been made by the research community in this domain, accessibility of such ethical assessments to the wider population is limited due to the high entry barriers. This article introduces a new tool to democratize and standardize ethical assessment of natural language generation models: Tool for Ethical Assessment of Language generation models (TEAL), a component of Credo AI Lens, an open-source assessment framework.

assessment, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2207.10576

Country:

North America > United States > District of Columbia > Washington (0.06)
North America > United States > New York > New York County > New York City (0.04)
Europe > Ireland (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Media (0.46)
Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

UVify's Draco drone is fast, furious fun for wannabe racers

EngadgetSep-27-2017, 17:00:30 GMT

I look down and start gliding toward a dilapidated skate park below. Once I'm near the ground I pull my nose up and look level with the horizon. Spotting two trees, I race toward them, pass between them, then turn on a dime, skirting some shipping containers on my left. It's like every dream I've ever had about flying, but faster. I take off a pair of video goggles, and I see the shipping containers come into focus, this time directly in front of me, as my eyes adjust to the sunlight. This is my third "First Person View" flight with the Draco drone, and it's more exciting every time.

artificial intelligence, draco, drone, (15 more...)

Engadget

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.05)
North America > Canada (0.05)
(2 more...)

Industry: Transportation (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.49)

Add feedback

How drones are learning to find their own way in the world

AITopics Original LinksJan-18-2017, 12:14:21 GMT

When you're zipping through the air at 60 kilometres per hour, it can be hard to work out where you're going. But now drones can create detailed 3D maps as they fly – an advance that could let them navigate the world free from human input. Called Hydra Fusion, the system could one day allow drones to use a form of navigation known as simultaneous localisation and mapping to find their way in unfamiliar spaces – just as some robots do on the ground. It will also make them better at aerial surveillance. Hydra Fusion works by stitching together multiple images – in this case, consecutive frames of footage from a drone's video camera – to form a detailed 3D map while it is in the air.

artificial intelligence, drone, swarm, (12 more...)

AITopics Original Links

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.05)
North America > United States > Oregon (0.05)
North America > United States > New Mexico (0.05)
(2 more...)

Industry:

Information Technology > Robotics & Automation (0.50)
Transportation > Ground > Rail (0.31)
Transportation > Air (0.31)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

Flying at 85MPH Isn't Even the Teal Drone's Best Trick

WIREDJul-20-2016, 13:40:41 GMT

Sure, with a top speed of 85 mph, it is twice as fast as a DJI Phantom 4 and it will leave almost every consumer drone eating its dust. But its appeal goes well beyond air speed. Buying a drone typically means having a specific activity in mind. There are aerial photography drones, racing drones, follow-me around drones--it can all be a little overwhelming, particularly for someone who's new to UAVs. Teal wants to solve this problem with a modular machine you can tailor to suit to your exact needs.

artificial intelligence, drone, teal, (7 more...)

WIRED

Industry:

Transportation > Air (1.00)
Information Technology > Robotics & Automation (1.00)
Media > Photography (0.96)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback