AITopics | Evans, Alex

Collaborating Authors

Evans, Alex

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Mysterious Case of Neuron 1512: Injectable Realignment Architectures Reveal Internal Characteristics of Meta's Llama 2 Model

Smith, Brenden, Baker, Dallin, Chase, Clayton, Barney, Myles, Parker, Kaden, Allred, Makenna, Hu, Peter, Evans, Alex, Fulda, Nancy

arXiv.org Artificial IntelligenceJul-4-2024

Large Language Models (LLMs) have an unrivaled and invaluable ability to "align" their output to a diverse range of human preferences, by mirroring them in the text they generate. The internal characteristics of such models, however, remain largely opaque. This work presents the Injectable Realignment Model (IRM) as a novel approach to language model interpretability and explainability. Inspired by earlier work on Neural Programming Interfaces, we construct and train a small network -- the IRM -- to induce emotion-based alignments within a 7B parameter LLM architecture. The IRM outputs are injected via layerwise addition at various points during the LLM's forward pass, thus modulating its behavior without changing the weights of the original model. This isolates the alignment behavior from the complex mechanisms of the transformer model. Analysis of the trained IRM's outputs reveals a curious pattern. Across more than 24 training runs and multiple alignment datasets, patterns of IRM activations align themselves in striations associated with a neuron's index within each transformer layer, rather than being associated with the layers themselves. Further, a single neuron index (1512) is strongly correlated with all tested alignments. This result, although initially counterintuitive, is directly attributable to design choices present within almost all commercially available transformer architectures, and highlights a potential weak point in Meta's pretrained Llama 2 models. It also demonstrates the value of the IRM architecture for language model analysis and interpretability. Our code and datasets are available at https://github.com/DRAGNLabs/injectable-alignment-model

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2407.03621

Country: Asia > China (0.14)

Genre: Research Report > Promising Solution (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

Wen, Bowen, Tremblay, Jonathan, Blukis, Valts, Tyree, Stephen, Muller, Thomas, Evans, Alex, Fox, Dieter, Kautz, Jan, Birchfield, Stan

arXiv.org Artificial IntelligenceMar-24-2023

We present a near real-time method for 6-DoF tracking of an unknown object from a monocular RGBD video sequence, while simultaneously performing neural 3D reconstruction of the object. Our method works for arbitrary rigid objects, even when visual texture is largely absent. The object is assumed to be segmented in the first frame only. No additional information is required, and no assumption is made about the interaction agent. Key to our method is a Neural Object Field that is learned concurrently with a pose graph optimization process in order to robustly accumulate information into a consistent 3D representation capturing both geometry and appearance. A dynamic pool of posed memory frames is automatically maintained to facilitate communication between these threads. Our approach handles challenging sequences with large pose changes, partial and full occlusion, untextured surfaces, and specular highlights. We show results on HO3D, YCBInEOAT, and BEHAVE datasets, demonstrating that our method significantly outperforms existing approaches. Project page: https://bundlesdf.github.io

add-s, artificial intelligence, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2303.14158

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

Lin, Yunzhi, Müller, Thomas, Tremblay, Jonathan, Wen, Bowen, Tyree, Stephen, Evans, Alex, Vela, Patricio A., Birchfield, Stan

arXiv.org Artificial IntelligenceMar-10-2023

We present a parallelized optimization method based on fast Neural Radiance Fields (NeRF) for estimating 6-DoF pose of a camera with respect to an object or scene. Given a single observed RGB image of the target, we can predict the translation and rotation of the camera by minimizing the residual between pixels rendered from a fast NeRF model and pixels in the observed image. We integrate a momentum-based camera extrinsic optimization procedure into Instant Neural Graphics Primitives, a recent exceptionally fast NeRF implementation. By introducing parallel Monte Carlo sampling into the pose estimation task, our method overcomes local minima and improves efficiency in a more extensive search space. We also show the importance of adopting a more robust pixel-based loss function to reduce error. Experiments demonstrate that our method can achieve improved generalization and robustness on both synthetic and real-world benchmarks.

artificial intelligence, machine learning, pose estimation, (14 more...)

arXiv.org Artificial Intelligence

2210.10108

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.65)

Add feedback

Concave Pro-rata Games

Johnson, Nicholas A. G, Diamandis, Theo, Evans, Alex, de Valence, Henry, Angeris, Guillermo

arXiv.org Artificial IntelligenceFeb-4-2023

Existing blockchain systems come to consensus on transactions in batches, called blocks. Yet the economic mechanisms those transactions interact with are generally designed to process each individual transaction sequentially, making their behavior reliant on the ordering of transactions within the batch. This abstraction mismatch is the primary source of miner extractible value (MEV), defined as economic value that can be captured by the block proposer (originally the miner) who selects and sequences the transactions to be included in the batch [6]. However, rather than trying to blind the block proposer, or choose a "fair" ordering (which is difficult, if not impossible, to construct in any direct sense on current systems) within a batch, we could alternatively attempt to design economic mechanisms which do not depend on the order of transactions within a block, and instead, process each batch of transactions'all at once'.

artificial intelligence, equilibrium, game theory, (18 more...)

arXiv.org Artificial Intelligence

2302.02126

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Industry:

Leisure & Entertainment > Games (0.94)
Information Technology > Security & Privacy (0.93)
Banking & Finance > Trading (0.71)

Technology:

Information Technology > Game Theory (0.93)
Information Technology > Security & Privacy (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.46)

Add feedback