AITopics | activator

Collaborating Authors

activator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reviews: What Makes Objects Similar: A Unified Multi-Metric Learning Approach

Neural Information Processing SystemsJan-20-2025, 15:45:26 GMT

The main contribution of this paper is to introduce the interesting idea of "activator" (or integrator) operating on multiple metrics. Depending on how this activator (kappa) is picked, one may recover some previously proposed multiple metric learning approaches or create new relevant ones such as those discussed in Section 2.1. The proposed algorithm/analysis is an application of stochastic proximal (sub)gradient descent and is thus not a significant contribution from the optimization point of view (and the presentation of the algorithm is actually quite confusing, see below). My main problem with the paper is the experimental section. I was expecting some experiments showing how the choice of kappa influences the type of metrics that are learned, with an analysis of why some choices are better suited to some kind of problems / datasets / networks.

artificial intelligence, experiment, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router

Mei, Lingrui, Liu, Shenghua, Wang, Yiwei, Bi, Baolong, Yuan, Ruibin, Cheng, Xueqi

arXiv.org Artificial IntelligenceOct-3-2024

As Large Language Models (LLMs) grow increasingly powerful, ensuring their safety and alignment with human values remains a critical challenge. Ideally, LLMs should provide informative responses while avoiding the disclosure of harmful or sensitive information. However, current alignment approaches, which rely heavily on refusal strategies, such as training models to completely reject harmful prompts or applying coarse filters are limited by their binary nature. These methods either fully deny access to information or grant it without sufficient nuance, leading to overly cautious responses or failures to detect subtle harmful content. For example, LLMs may refuse to provide basic, public information about medication due to misuse concerns. Moreover, these refusal-based methods struggle to handle mixed-content scenarios and lack the ability to adapt to context-dependent sensitivities, which can result in over-censorship of benign content. To overcome these challenges, we introduce HiddenGuard, a novel framework for fine-grained, safe generation in LLMs. HiddenGuard incorporates Prism (rePresentation Router for In-Stream Moderation), which operates alongside the LLM to enable real-time, token-level detection and redaction of harmful content by leveraging intermediate hidden states. This fine-grained approach allows for more nuanced, context-aware moderation, enabling the model to generate informative responses while selectively redacting or replacing sensitive information, rather than outright refusal. We also contribute a comprehensive dataset with token-level fine-grained annotations of potentially harmful information across diverse contexts. Our experiments demonstrate that HiddenGuard achieves over 90% in F1 score for detecting and redacting harmful content while preserving the overall utility and informativeness of the model's responses.

arxiv preprint arxiv, language model, representation, (15 more...)

arXiv.org Artificial Intelligence

2410.02684

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
Asia > China (0.04)
North America > United States > California > Merced County > Merced (0.04)
Europe > Latvia > Lubāna Municipality > Lubāna (0.04)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Implementation of a framework for deploying AI inference engines in FPGAs

Herbst, Ryan, Coffee, Ryan, Fronk, Nathan, Kim, Kukhee, Kim, Kuktae, Ruckman, Larry, Russell, J. J.

arXiv.org Artificial IntelligenceMay-30-2023

The LCLS2 Free Electron Laser FEL will generate xray pulses to beamline experiments at up to 1Mhz These experimentals will require new ultrahigh rate UHR detectors that can operate at rates above 100 kHz and generate data throughputs upwards of 1 TBs a data velocity which requires prohibitively large investments in storage infrastructure Machine Learning has demonstrated the potential to digest large datasets to extract relevant insights however current implementations show latencies that are too high for realtime data reduction objectives SLAC has endeavored on the creation of a software framework which translates MLs structures for deployment on Field Programmable Gate Arrays FPGAs deployed at the Edge of the data chain close to the instrumentation This framework leverages Xilinxs HLS framework presenting an API modeled after the open source Keras interface to the TensorFlow library This SLAC Neural Network Library SNL framework is designed with a streaming data approach optimizing the data flow between layers while minimizing the buffer data buffering requirements The goal is to ensure the highest possible framerate while keeping the maximum latency constrained to the needs of the experiment Our framework is designed to ensure the RTL implementation of the network layers supporting full redeployment of weights and biases without requiring resynthesis after training The ability to reduce the precision of the implemented networks through quantization is necessary to optimize the use of both DSP and memory resources in the FPGA We currently have a preliminary version of the toolset and are experimenting with both general purpose example networks and networks being designed for specific LCLS2 experiments.

artificial intelligence, expert system, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2305.19455

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)

Add feedback

Creating our first optimized DCGAN – Towards AI

#artificialintelligenceMar-1-2023, 00:30:25 GMT

Originally published on Towards AI. In this first article, we will use one of the simplest Datasets, made up of 28 x 28 images of handwritten numbers in grayscale. I suppose you all know that it is the very famous MNIST Dataset. While we are looking at the theory, we will see the code needed to make and train the GAN. Even though this is our first GAN, we will use several of the recommendations launched by Soumith Chintala, known as GAN Hacks. Thus, we will create a GAN correctly optimized for the Dataset used.

discriminator, generator, recommendation, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

Are Advanced FPGAs the Activators of Smarter AI Features?

#artificialintelligenceFeb-3-2020, 04:03:27 GMT

FPGAs' ability to distribute massive workloads into parallel computation enables AI features to create highly efficient electronic devices. FREMONT, CA: Implementing FPGA increases the number of parallel computational elements and processing efficiency of the electronic devices. FPGAs that hold parallel and hardware-programmable feature enables electronics excellence at specialized workloads with high computational operations and optimal configurations. Over the past few years, FPGAs have proved to be the low-power solution, making it flexible and ideal for Neural Network (NN) architectures. Today, professionals are focusing on creating designs for supporting AI-based applications and functions.

advanced fpga, fpga, smarter ai feature, (6 more...)

#artificialintelligence

Country: North America > United States > California > Alameda County > Fremont (0.27)

Industry: Semiconductors & Electronics (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Build a simple Neural Network for Breast Cancer Detection using Tensorflow.js

#artificialintelligenceDec-1-2019, 20:52:18 GMT

There's more and more research done on detecting all types of cancers in early stages and thus increasing probability of survival. Since I've been passionate about machine learning for a while, I decided to bring my own contribution to this research and learn to train my own neural network detection model. The twist was to build it using Tensorflow with JavaScript, not with Python. We're also using React to manage the state and display the data we get back from the model. For this tutorial, I chose to work with a breast cancer dataset.

accuracy, dataset, tensorflow, (15 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.63)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Changing Lanes Challenges For AI Autonomous Cars - AI Trends

#artificialintelligenceOct-10-2019, 12:32:16 GMT

Hey buddy, pick a lane and stick with it! I was taken aback by a rude driver that had abruptly and without signaling opted to cut into my lane. I needed to quickly move into the lane next to me. Wait a second, unfortunately, none of the cars in that lane were willing to allow me into their jam packed lane. I'm sure that you likely have experienced the same kinds of frustrations in your daily commute as well. On the freeways here in Los Angeles, it seems like here aren't enough lanes for the number of cars. The lanes are often poorly marked and cars tend to radically veer toward each other. Plus, drivers illegally dart into and out of the HOV lane, and sometimes illegally use the emergency lane as a form of underhanded transit. We live in a world of lanes. On the open highways, there are usually a couple of lanes going in each direction.

lane change, self-driving car, turn signal, (16 more...)

#artificialintelligence

Country: North America > United States > California > Los Angeles County > Los Angeles (0.24)

Industry: Transportation > Ground > Road (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.93)

Add feedback

Indirect Causes in Dynamic Bayesian Networks Revisited

Motzek, Alexander (Universität zu Lübeck) | Möller, Ralf (Universität zu Lübeck)

AAAI ConferencesJul-15-2015

Modeling causal dependencies often demands cycles at a coarse-grained temporal scale. If Bayesian networks are to be used for modeling uncertainties, cycles are eliminated with dynamic Bayesian networks, spreading indirect dependencies over time and enforcing an infinitesimal resolution of time. Without a "causal design," i.e., without anticipating indirect influences appropriately in time, we argue that such networks return spurious results. By introducing activator random variables, we propose template fragments for modeling dynamic Bayesian networks under a causal use of time, anticipating indirect influences on a solid mathematical basis, obeying the laws of Bayesian networks.

activator, bayesian network, dependency, (13 more...)

AAAI Conferences

Twenty-Fourth International Joint Conference on Artificial Intelligence

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Schleswig-Holstein > Lübeck (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback