Goto

Collaborating Authors

 wolfram


What can we learn from signals and systems in a transformer? Insights for probabilistic modeling and inference architecture

Chang, Heng-Sheng, Mehta, Prashant G.

arXiv.org Artificial Intelligence

In the 1940s, Wiener introduced a linear predictor, where the future prediction is computed by linearly combining the past data. A transformer generalizes this idea: it is a nonlinear predictor where the next-token prediction is computed by nonlinearly combining the past tokens. In this essay, we present a probabilistic model that interprets transformer signals as surrogates of conditional measures, and layer operations as fixed-point updates. An explicit form of the fixed-point update is described for the special case when the probabilistic model is a hidden Markov model (HMM). In part, this paper is in an attempt to bridge the classical nonlinear filtering theory with modern inference architectures.


The Biggest Dating App Faux Pas for Gen Z? Being Cringe

WIRED

When it comes to online dating, Giovanni Wolfram, a 25-year-old living in Santa Fe, New Mexico, isn't all too worried about whether his fellow dating app users will find him attractive. Rather, his biggest fear is that he might come off as "cringey." "You can get away with being ugly," Wolfram says. "But being cringey is just like--that's a character that's imprinted on you." Since he first joined Hinge at 18, he has worked hard to scrub his profile of sincerity.


Intelligence at the Edge of Chaos

Zhang, Shiyang, Patel, Aakash, Rizvi, Syed A, Liu, Nianchen, He, Sizhuang, Karbasi, Amin, Zappala, Emanuele, van Dijk, David

arXiv.org Artificial Intelligence

We explore the emergence of intelligent behavior in artificial systems by investigating how the complexity of rule-based systems influences the capabilities of models trained to predict these rules. Our study focuses on elementary cellular automata (ECA), simple yet powerful one-dimensional systems that generate behaviors ranging from trivial to highly complex. By training distinct Large Language Models (LLMs) on different ECAs, we evaluated the relationship between the complexity of the rules' behavior and the intelligence exhibited by the LLMs, as reflected in their performance on downstream tasks. Our findings reveal that rules with higher complexity lead to models exhibiting greater intelligence, as demonstrated by their performance on reasoning and chess move prediction tasks. Both uniform and periodic systems, and often also highly chaotic systems, resulted in poorer downstream performance, highlighting a sweet spot of complexity conducive to intelligence. We conjecture that intelligence arises from the ability to predict complexity and that creating intelligence may require only exposure to complexity.


Is English the New Programming Language? How About Pseudo-code Engineering?

Michaelsen, Gian Alexandre, Santos, Renato P. dos

arXiv.org Artificial Intelligence

Background: The integration of artificial intelligence (AI) into daily life, particularly through chatbots utilizing natural language processing (NLP), presents both revolutionary potential and unique challenges. This intended to investigate how different input forms impact ChatGPT, a leading language model by OpenAI, performance in understanding and executing complex, multi-intention tasks. Design: Employing a case study methodology supplemented by discourse analysis, the research analyzes ChatGPT's responses to inputs varying from natural language to pseudo-code engineering. The study specifically examines the model's proficiency across four categories: understanding of intentions, interpretability, completeness, and creativity. Setting and Participants: As a theoretical exploration of AI interaction, this study focuses on the analysis of structured and unstructured inputs processed by ChatGPT, without direct human participants. Data collection and analysis: The research utilizes synthetic case scenarios, including the organization of a "weekly meal plan" and a "shopping list," to assess ChatGPT's response to prompts in both natural language and pseudo-code engineering. The analysis is grounded in the identification of patterns, contradictions, and unique response elements across different input formats. Results: Findings reveal that pseudo-code engineering inputs significantly enhance the clarity and determinism of ChatGPT's responses, reducing ambiguity inherent in natural language. Enhanced natural language, structured through prompt engineering techniques, similarly improves the model's interpretability and creativity. Conclusions: The study underscores the potential of pseudo-code engineering in refining human-AI interaction and achieving more deterministic, concise, and direct outcomes, advocating for its broader application across disciplines requiring precise AI responses.


Why does AI being good at math matter?

MIT Technology Review

This is the second time in recent months that the AI world has got all excited about math. The rumor mill went into overdrive last November, when there were reports that the boardroom drama at OpenAI, which saw CEO Sam Altman temporarily ousted, was caused by a new powerful AI breakthrough. It was reported that the AI system in question was called Q* and could solve complex math calculations. You don't need to be really into math to see why this stuff is potentially very exciting. Math is really, really hard for AI models.


Where is the Edge of Chaos?

Fulbright, Ron

arXiv.org Artificial Intelligence

Previous study of cellular automata and random Boolean networks has shown emergent behavior occurring at the edge of chaos where the randomness (disorder) of internal connections is set to an intermediate critical value. The value at which maximal emergent behavior occurs has been observed to be inversely related to the total number of interconnected elements, the neighborhood size. However, different equations predict different values. This paper presents a study of one-dimensional cellular automata (1DCA) verifying the general relationship but finding a more precise correlation with the radius of the neighborhood rather than neighborhood size. Furthermore, the critical value of the emergent regime is observed to be very close to 1/e hinting at the discovery of a fundamental characteristic of emergent systems.


AI and the Future of Jobs Part 2 - realrate

#artificialintelligence

Carrying on the hot topic of the moment, many people are saying that AI will be taking humans' jobs soon, or some indeed say that it has already started. So, what is the situation? British-American computer scientist, physicist, and businessperson, Stephan Wolfram, says it is not so straightforward. Although AI has reached groundbreaking heights lately, which leads humans to ask these pertinent questions, and indeed Wolfram says in the future this will only increase, he says we must look to history for a rounded look at the argument. The role of humans has ever been evolving….'and


ChatGPT Massive Upgrade. Ultimate Powers, Internet Access, and More.

#artificialintelligence

ChatGPT is trained till 2021, and that was a big bummer. With the latest update from OpenAI, it can access the internet, upload images, videos, audio, and CSV files, and connect to your website for enhanced functionality. Well, that world is here! OpenAI has just announced a game-changing upgrade to ChatGPT that will revolutionize how you interact with AI. OpenAI recently announced a massive upgrade to ChatGPT, which now supports plugins that extend its functionality.


Classification of Discrete Dynamical Systems Based on Transients

Hudcová, Barbora, Mikolov, Tomáš

arXiv.org Artificial Intelligence

In order to develop systems capable of artificial evolution, we need to identify which systems can produce complex behavior. We present a novel classification method applicable to any class of deterministic discrete space and time dynamical systems. The method is based on classifying the asymptotic behavior of the average computation time in a given system before entering a loop. We were able to identify a critical region of behavior that corresponds to a phase transition from ordered behavior to chaos across various classes of dynamical systems. To show that our approach can be applied to many different computational systems, we demonstrate the results of classifying cellular automata, Turing machines, and random Boolean networks. Further, we use this method to classify 2D cellular automata to automatically find those with interesting, complex dynamics. We believe that our work can be used to design systems in which complex structures emerge. Also, it can be used to compare various versions of existing attempts to model open-ended evolution (Ray (1991), Ofria et al. (2004), Channon (2006)).


Stephen Wolfram on the future of programming and why we live in a computational universe

#artificialintelligence

This article originally appeared on TechRepublic. When it came to figuring out which computer scientist should help linguists decipher inscrutable alien texts, it was Stephen Wolfram who got the call. Sure, these extraterrestrials may only have existed in the sci-fi movie Arrival, but if ET ever does drop out of orbit, Wolfram might well still be on the short list of people to contact. Download this article as a PDF (free registration required). The British-born computer scientist's life is littered with exceptional achievements -- completing a PhD in theoretical physics at Caltech at age 20, winning a MacArthur Genius Grant at 21, and creating the technical computing platform Mathematica (which is used by millions of mathematicians, scientists, and engineers worldwide), plus the Wolfram Language, and the Wolfram Alpha knowledge engine.