AITopics | randn

Collaborating Authors

randn

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SC-OGM[63] xk+1=x+k + κ 1 8κ+1+2+κ (x+k x+k 1)

Neural Information Processing SystemsFeb-9-2026, 01:23:01 GMT

ProofofObservation4. Figure 2 (middle) depicts the plane of iteration of TMM. Now, we complete the proof by showing that{Uk}Kk=0 is nonincreasing. Optimality condition for strongly convex function implies that there existu g(x) such that f(x)+u+L(x x)=0. Linear coupling [4] interprets acceleration as a unification of gradient descent and mirror descent. The auxiliary iterates ofour setup are referred toasthe mirror descent iterates inthe linear coupling viewpoint.

artificial intelligence, randn, xkzk, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.34)

Add feedback

Practical programming research of Linear DML model based on the simplest Python code: From the standpoint of novice researchers

Yao, Shunxin

arXiv.org Artificial IntelligenceFeb-22-2025

This paper presents linear DML models for causal inference using the simplest Python code on a Jupyter notebook based on an Anaconda platform and compares the performance of different DML models. The results show that current Library API technology is not yet sufficient to enable novice Python users to build qualified and high-quality DML models with the simplest coding approach. Novice users attempting to perform DML causal inference using Python still have to improve their mathematical and computer knowledge to adapt to more flexible DML programming. Additionally, the issue of mismatched outcome variable dimensions is also widespread when building linear DML models in Jupyter notebook.

causal inference, sklearn, train, (12 more...)

arXiv.org Artificial Intelligence

2502.16172

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.30)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Data-Driven Estimation of Conditional Expectations, Application to Optimal Stopping and Reinforcement Learning

Moustakides, George V.

arXiv.org Artificial IntelligenceJul-18-2024

When the underlying conditional density is known, conditional expectations can be computed analytically or numerically. When, however, such knowledge is not available and instead we are given a collection of training data, the goal of this work is to propose simple and purely data-driven means for estimating directly the desired conditional expectation. Because conditional expectations appear in the description of a number of stochastic optimization problems with the corresponding optimal solution satisfying a system of nonlinear equations, we extend our data-driven method to cover such cases as well. We test our methodology by applying it to Optimal Stopping and Optimal Action Policy in Reinforcement Learning.

conditional expectation, linewidth, sqrt, (12 more...)

arXiv.org Artificial Intelligence

2407.13189

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Greece (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

Exploring Cross-Domain Few-Shot Classification via Frequency-Aware Prompting

Zhang, Tiange, Cai, Qing, Gao, Feng, Qi, Lin, Dong, Junyu

arXiv.org Artificial IntelligenceJun-24-2024

Cross-Domain Few-Shot Learning has witnessed great stride with the development of meta-learning. However, most existing methods pay more attention to learning domain-adaptive inductive bias (meta-knowledge) through feature-wise manipulation or task diversity improvement while neglecting the phenomenon that deep networks tend to rely more on high-frequency cues to make the classification decision, which thus degenerates the robustness of learned inductive bias since high-frequency information is vulnerable and easy to be disturbed by noisy information. Hence in this paper, we make one of the first attempts to propose a Frequency-Aware Prompting method with mutual attention for Cross-Domain Few-Shot classification, which can let networks simulate the human visual perception of selecting different frequency cues when facing new recognition tasks. Specifically, a frequency-aware prompting mechanism is first proposed, in which high-frequency components of the decomposed source image are switched either with normal distribution sampling or zeroing to get frequency-aware augment samples. Then, a mutual attention module is designed to learn generalizable inductive bias under CD-FSL settings. More importantly, the proposed method is a plug-and-play module that can be directly applied to most off-the-shelf CD-FLS methods. Experimental results on CD-FSL benchmarks demonstrate the effectiveness of our proposed method as well as robustly improve the performance of existing CD-FLS methods. Resources at https://github.com/tinkez/FAP_CDFSC.

baseline, dataset, few-shot learning, (15 more...)

arXiv.org Artificial Intelligence

2406.16422

Country:

North America > United States (0.14)
Asia > China > Shandong Province > Qingdao (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

TensorKrowch: Smooth integration of tensor networks in machine learning

Monturiol, José Ramón Pareja, Pérez-García, David, Pozas-Kerstjens, Alejandro

arXiv.org Artificial IntelligenceJun-14-2023

Tensor networks are factorizations of high-dimensional tensors into network-like structures composed of smaller tensors. Originating from condensed matter physics and acclaimed for their efficient representation of quantum many-body systems [1-10], these structures have allowed researchers to comprehend the intricate properties of such systems and, additionally, simulate them using classical computers [11-13]. Notably, tensor networks are the most successful method for simulating the results of quantum advantage experiments [14-16]. Furthermore, tensor networks were rediscovered within the numerical linear algebra community [17-19], where the techniques have been adapted to other high-dimensional problems such as numerical integration [20], signal processing [21], or epidemic modelling [22]. With the advent of machine learning and the the quest for expressive yet easy-to-train models, tensor networks have been suggested as promising candidates, due to their ability to parameterize regions of the complex space of size exponential in the number of input features. Since the pioneering works [23, 24] that used simple, 1-dimensional networks known as Matrix Product States (MPS) in the physics literature [4, 25] and as Tensor Trains in the numerical linear algebra literature [18], these have been applied in both supervised and unsupervised learning settings [26-28].

node, tensor, tensor network, (15 more...)

arXiv.org Artificial Intelligence

2306.08595

Country:

Europe > Spain > Galicia > Madrid (0.05)
Africa > Senegal > Kolda Region > Kolda (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

How to generate random numbers from normal distribution? - The Security Buddy

#artificialintelligenceFeb-10-2023, 12:17:00 GMT

In Python, we can use the randn() function to generate random numbers from the normal distribution. Here, the randn() function will return 10000 floating point random numbers from the standard normal distribution. Therefore, the mean of the generated numbers will be 0, and the standard deviation will be 1. After generating 10000 such random numbers from the standard normal distribution, we can plot the distribution using a histogram.

generate random number, normal distribution, random number, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

PyTorch Distributed

#artificialintelligenceMar-2-2022, 01:12:25 GMT

Here, message passing semantics is leveraged, which helps any process to communicate with other processes through the messages. Different communication backends are used, and the communication doesn't need to be from the same machine. But, of course, multiple processes should be started in parallel, and coordination tools in the cluster must be enabled. There are three main components in the torch. First, distributed as distributed data-parallel training, RPC-based distributed training, and collective communication.

pytorch, size, torch, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

Add feedback

#010 C Random initialization of parameters in a Neural Network - Master Data Science

#artificialintelligenceNov-30-2021, 20:35:59 GMT

Why do we need a random initialization? In other words unit1 and unit2 are symmetric, and it can be shown by induction that these two units are computing the same function after every iteration of training. Even if we have a lot of hidden units in the hidden layer they all are symetric if we initialize corresponding parameters to zeros. To solve this problem we need to initialize randomly rather then with zeros. And then we can initialize $b_1$ with zeros, because initialization of $W_1$ breaks the symmetry, and unit1 and unit2 will not output the same value even if we initialize $b_1$ to zero.

initialization, initialize parameter, neural network, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Noise-robust Clustering

Adesunkanmi, Rahmat, Kumar, Ratnesh

arXiv.org Machine LearningOct-19-2021

This paper presents noise-robust clustering techniques in unsupervised machine learning. The uncertainty about the noise, consistency, and other ambiguities can become severe obstacles in data analytics. As a result, data quality, cleansing, management, and governance remain critical disciplines when working with Big Data. With this complexity, it is no longer sufficient to treat data deterministically as in a classical setting, and it becomes meaningful to account for noise distribution and its impact on data sample values. Classical clustering methods group data into "similarity classes" depending on their relative distances or similarities in the underlying space. This paper addressed this problem via the extension of classical $K$-means and $K$-medoids clustering over data distributions (rather than the raw data). This involves measuring distances among distributions using two types of measures: the optimal mass transport (also called Wasserstein distance, denoted $W_2$) and a novel distance measure proposed in this paper, the expected value of random variable distance (denoted ED). The presented distribution-based $K$-means and $K$-medoids algorithms cluster the data distributions first and then assign each raw data to the cluster of data's distribution.

k-means, k-means and k-medoid, k-medoid, (16 more...)

arXiv.org Machine Learning

2110.08871

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Colorado (0.04)
North America > United States > Iowa > Story County > Ames (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

MimicGAN: Corruption-Mimicking for Blind Image Recovery & Adversarial Defense

Anirudh, Rushil, Thiagarajan, Jayaraman J., Kailkhura, Bhavya, Bremer, Timo

arXiv.org Artificial IntelligenceNov-20-2018

Solving inverse problems continues to be a central challenge in computer vision. Existing techniques either explicitly construct an inverse mapping using prior knowledge about the corruption, or learn the inverse directly using a large collection of examples. However, in practice, the nature of corruption may be unknown, and thus it is challenging to regularize the problem of inferring a plausible solution. On the other hand, collecting task-specific training data is tedious for known corruptions and impossible for unknown ones. We present MimicGAN, an unsupervised technique to solve general inverse problems based on image priors in the form of generative adversarial networks (GANs). Using a GAN prior, we show that one can reliably recover solutions to underdetermined inverse problems through a surrogate network that learns to mimic the corruption at test time. Our system successively estimates the corruption and the clean image without the need for supervisory training, while outperforming existing baselines in blind image recovery. We also demonstrate that MimicGAN improves upon recent GAN-based defenses against adversarial attacks and represents one of the strongest test-time defenses available today.

artificial intelligence, machine learning, mimicgan, (15 more...)

arXiv.org Artificial Intelligence

1811.08484

Country: North America > United States (1.00)

Genre: Research Report (0.50)

Industry: Government > Regional Government > North America Government > United States Government (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback