AITopics

Genre: Instructional Material > Course Syllabus & Notes (0.55)

Industry: Banking & Finance > Financial Services (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots (0.48)
Information Technology > Architecture > Real Time Systems (0.31)

#artificialintelligenceFeb-27-2023, 14:02:22 GMT

Learn JAX in 2023: Part 2 - grad, jit, vmap, and pmap

In this tutorial, you will learn the power tools of JAX, grad, jit, vmap, and pmap. To learn how to use JAX's power tools, just keep reading. Welcome to our comprehensive guide on advanced JAX techniques! In the previous tutorial, we were introduced to JAX, and its predecessors autograd and xla. We also briefly looked into numerical computing with JAX. In this post, we'll be diving into some of the most powerful and useful features of the JAX library, including grad, jit, vmap, and pmap. These functions allow you to easily and efficiently compute gradients of functions, optimize your code for faster execution, and apply functions to arrays of data in parallel. By the end of this post, you'll have a solid understanding of how to use these tools to improve the performance and functionality of your numerical computation and machine learning tasks. We'll also cover the topic of randomness in JAX, including how to generate and control random numbers for use in your computations.

jax, random number, transformation, (15 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.68)

Technology:

Information Technology > Software > Programming Languages (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

#artificialintelligenceFeb-27-2023, 01:25:28 GMT

3 Ways to Build Machine Learning Models in Keras You Must Know

In this Keras tutorial we learn about 3 Ways to Build Machine Learning Models in Keras You Must Know.

build machine learning model, functional interface, keras, (11 more...)

Genre: Instructional Material (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Maillard, Antoine, Bandeira, Afonso S., Belius, David, Dokmanić, Ivan, Nakajima, Shuta

Injectivity of ReLU networks: perspectives from statistical physics

arXiv.org Artificial IntelligenceFeb-27-2023

When can the input of a ReLU neural network be inferred from its output? In other words, when is the network injective? We consider a single layer, $x \mapsto \mathrm{ReLU}(Wx)$, with a random Gaussian $m \times n$ matrix $W$, in a high-dimensional setting where $n, m \to \infty$. Recent work connects this problem to spherical integral geometry giving rise to a conjectured sharp injectivity threshold for $\alpha = \frac{m}{n}$ by studying the expected Euler characteristic of a certain random set. We adopt a different perspective and show that injectivity is equivalent to a property of the ground state of the spherical perceptron, an important spin glass model in statistical physics. By leveraging the (non-rigorous) replica symmetry-breaking theory, we derive analytical equations for the threshold whose solution is at odds with that from the Euler characteristic. Furthermore, we use Gordon's min--max theorem to prove that a replica-symmetric upper bound refutes the Euler characteristic prediction. Along the way we aim to give a tutorial-style introduction to key ideas from statistical physics in an effort to make the exposition accessible to a broad audience. Our analysis establishes a connection between spin glasses and integral geometry but leaves open the problem of explaining the discrepancies.

artificial intelligence, machine learning, prediction, (15 more...)

2302.14112

Country:

Europe > Switzerland (0.67)
North America > United States (0.45)

Genre:

Research Report (0.64)
Instructional Material > Course Syllabus & Notes (0.48)

Industry: Energy > Oil & Gas > Upstream (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science (0.92)

arXiv.org Artificial IntelligenceFeb-27-2023

CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis

Nijkamp, Erik, Pang, Bo, Hayashi, Hiroaki, Tu, Lifu, Wang, Huan, Zhou, Yingbo, Savarese, Silvio, Xiong, Caiming

Program synthesis strives to generate a computer program as a solution to a given problem specification, expressed with input-output examples or natural language descriptions. The prevalence of large language models advances the state-of-the-art for program synthesis, though limited training resources and data impede open access to such models. To democratize this, we train and release a family of large language models up to 16.1B parameters, called CODEGEN, on natural language and programming language data, and open source the training library JAXFORMER. We show the utility of the trained model by demonstrating that it is competitive with the previous state-of-the-art on zero-shot Python code generation on HumanEval. We further investigate the multi-step paradigm for program synthesis, where a single program is factorized into multiple prompts specifying subproblems. To this end, we construct an open benchmark, Multi-Turn Programming Benchmark (MTPB), consisting of 115 diverse problem sets that are factorized into multi-turn prompts. Our analysis on MTPB shows that the same intent provided to CODEGEN in multi-turn fashion significantly improves program synthesis over that provided as a single turn. We make the training library JAXFORMER and model checkpoints available as open source contribution: https://github.com/salesforce/CodeGen.

large language model, machine learning, specification, (22 more...)

2203.13474

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Tuscany > Florence (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(3 more...)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.34)

Industry: Information Technology > Software (0.35)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

arXiv.org Artificial IntelligenceFeb-27-2023

On the Role of Emergent Communication for Social Learning in Multi-Agent Reinforcement Learning

Karten, Seth, Kailas, Siva, Li, Huao, Sycara, Katia

Explicit communication among humans is key to coordinating and learning. Social learning, which uses cues from experts, can greatly benefit from the usage of explicit communication to align heterogeneous policies, reduce sample complexity, and solve partially observable tasks. Emergent communication, a type of explicit communication, studies the creation of an artificial language to encode a high task-utility message directly from data. However, in most cases, emergent communication sends insufficiently compressed messages with little or null information, which also may not be understandable to a third-party listener. This paper proposes an unsupervised method based on the information bottleneck to capture both referential complexity and task-specific utility to adequately explore sparse social communication scenarios in multi-agent reinforcement learning (MARL). We show that our model is able to i) develop a natural-language-inspired lexicon of messages that is independently composed of a set of emergent concepts, which span the observations and intents with minimal bits, ii) develop communication to align the action policies of heterogeneous agents with dissimilar feature models, and iii) learn a communication policy from watching an expert's action policy, which we term `social shadowing'.

communication, machine learning, reinforcement learning, (14 more...)

2302.14276

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Genre:

Instructional Material (0.93)
Research Report (0.82)

Industry: Education > Curriculum (0.72)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

#artificialintelligenceFeb-26-2023, 06:30:34 GMT

Multiple Linear Regression in R - Lituptech Digital

We are going to learn how to implement a Multiple Linear Regression model in R. This is a bit more complex than Simple Linear Regression but it's going to be so practical and fun. Multiple Linear Regression is a data science technique that uses several explanatory variables to predict the outcome of a response variable. A Multiple linear regression model attempts to model the relationship between two or more explanatory variables (independent variables) and a response variable (dependent variable), by fitting a linear equation to observed data. Every value of the independent variable x is associated with a value of the dependent variable y.

independent variable, linear regression, regressor, (13 more...)

Genre:

Research Report (0.64)
Instructional Material > Course Syllabus & Notes (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceFeb-26-2023, 03:10:09 GMT

How AI could influence learning across subjects, while becoming a crucial one itself

The education industry is having to grapple with where artificial intelligence can fit into schools, from lesson plans to teacher training, since the technology has been propelled to the forefront of debates in recent months. The chatbot ChatGPT has caused shockwaves through the education industry over concerns about cheating and how students will learn, but the importance of AI in technological education has also been highlighted in the discussion. The introduction of AI could one day be integrated into all school subjects, not just computer science, experts say. And familiarity with the technology itself could soon become essential for students. "The way that we integrate AI education to the classroom is really an approach to connect artificial intelligence with core subjects like English, science, math, social studies, in addition to computer science and career technology education," Alex Kotran, co-founder and CEO of the AI Education Project, told The Hill.

artificial intelligence, machine learning, natural language, (18 more...)

Country: North America > United States > Massachusetts > Hampshire County > Amherst (0.05)

Genre: Instructional Material > Course Syllabus & Notes (0.50)

Industry: Education > Curriculum > Subject-Specific Education (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Myers, Vivek, Bıyık, Erdem, Sadigh, Dorsa

Active Reward Learning from Online Preferences

arXiv.org Artificial IntelligenceFeb-26-2023

Robot policies need to adapt to human preferences and/or new environments. Human experts may have the domain knowledge required to help robots achieve this adaptation. However, existing works often require costly offline re-training on human feedback, and those feedback usually need to be frequent and too complex for the humans to reliably provide. To avoid placing undue burden on human experts and allow quick adaptation in critical real-world situations, we propose designing and sparingly presenting easy-to-answer pairwise action preference queries in an online fashion. Our approach designs queries and determines when to present them to maximize the expected value derived from the queries' information. We demonstrate our approach with experiments in simulation, human user studies, and real robot experiments. In these settings, our approach outperforms baseline techniques while presenting fewer queries to human experts. Experiment videos, code and appendices are found at https://sites.google.com/view/onlineactivepreferences.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

2302.13507

Country: North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Weiming, Qu, Tianlin, Liu, Dingsheng, Luo

High-Precise Robot Arm Manipulation based on Online Iterative Learning and Forward Simulation with Positioning Error Below End-Effector Physical Minimum Displacement

arXiv.org Artificial IntelligenceFeb-26-2023

Precision is a crucial performance indicator for robot arms, as high precision manipulation allows for a wider range of applications. Traditional methods for improving robot arm precision rely on error compensation. However, these methods are often not robust and lack adaptability. Learning-based methods offer greater flexibility and adaptability, while current researches show that they often fall short in achieving high precision and struggle to handle many scenarios requiring high precision. In this paper, we propose a novel high-precision robot arm manipulation framework based on online iterative learning and forward simulation, which can achieve positioning error (precision) less than end-effector physical minimum displacement. Additionally, we parallelize multiple high-precision manipulation strategies to better combine online iterative learning and forward simulation. Furthermore, we consider the joint angular resolution of the real robot arm, which is usually neglected in related works. A series of experiments on both simulation and real UR3 robot arm platforms demonstrate that our proposed method is effective and promising. The related code will be available soon.

artificial intelligence, precision, robot arm, (13 more...)

2302.13338

Country:

Asia > China > Beijing > Beijing (0.04)
Europe > United Kingdom > England (0.04)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Online (0.84)

Technology: Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)