AITopics | Bottou, Léon

Large Scale Online Learning

Neural Information Processing SystemsDec-31-2004

We consider situations where training data is abundant and computing resources are comparatively scarce. We argue that suitably designed online learningalgorithms asymptotically outperform any batch learning algorithm. Both theoretical and experimental evidences are presented.

algorithm, computer based training, educational technology, (20 more...)

Neural Information Processing Systems

Country: North America > United States > New Jersey (0.14)

Industry: Education > Educational Setting > Online (0.43)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.43)

Add feedback

Geometric Clustering Using the Information Bottleneck Method

Still, Susanne, Bialek, William, Bottou, Léon

Neural Information Processing SystemsDec-31-2004

We argue that K-means and deterministic annealing algorithms for geometric clusteringcan be derived from the more general Information Bottleneck approach.If we cluster the identities of data points to preserve information about their location, the set of optimal solutions is massively degenerate. But if we treat the equations that define the optimal solution as an iterative algorithm, then a set of "smooth" initial conditions selects solutions with the desired geometrical properties. In addition to conceptual unification,we argue that this approach can be more efficient and robust than classic algorithms.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback

Vicinal Risk Minimization

Chapelle, Olivier, Weston, Jason, Bottou, Léon, Vapnik, Vladimir

Neural Information Processing SystemsDec-31-2001

The Vicinal Risk Minimization principle establishes a bridge between generative models and methods derived from the Structural Risk Minimization Principle such as Support Vector Machines or Statistical Regularization. We explain how VRM provides a framework which integrates a number of existing algorithms, such as Parzen windows, Support Vector Machines, Ridge Regression, Constrained Logistic Classifiers and Tangent-Prop. We then show how the approach implies new algorithms for solving problems usually associated with generative models. New algorithms are described for dealing with pattern recognition problems with very different pattern distributions and dealing with unlabeled data. Preliminary empirical results are presented.

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Vicinal Risk Minimization

Chapelle, Olivier, Weston, Jason, Bottou, Léon, Vapnik, Vladimir

Neural Information Processing SystemsDec-31-2001

The Vicinal Risk Minimization principle establishes a bridge between generative models and methods derived from the Structural Risk Minimization Principlesuch as Support Vector Machines or Statistical Regularization. Weexplain how VRM provides a framework which integrates a number of existing algorithms, such as Parzen windows, Support Vector Machines, Ridge Regression, Constrained Logistic Classifiers and Tangent-Prop. We then show how the approach implies new algorithms forsolving problems usually associated with generative models. New algorithms are described for dealing with pattern recognition problems with very different pattern distributions and dealing with unlabeled data. Preliminary empirical results are presented.

algorithm, artificial intelligence, health & medicine, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Boxlets: A Fast Convolution Algorithm for Signal Processing and Neural Networks

Simard, Patrice, Bottou, Léon, Haffner, Patrick, LeCun, Yann

Neural Information Processing SystemsDec-31-1999

Feature extraction is a typical example: The distance between a small pattern (i.e.

artificial intelligence, convolution, neural network, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Boxlets: A Fast Convolution Algorithm for Signal Processing and Neural Networks

Simard, Patrice, Bottou, Léon, Haffner, Patrick, LeCun, Yann

Neural Information Processing SystemsDec-31-1999

Feature extraction is a typical example: The distance between a small pattern (i.e.

artificial intelligence, convolution, neural network, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Boxlets: A Fast Convolution Algorithm for Signal Processing and Neural Networks

Simard, Patrice, Bottou, Léon, Haffner, Patrick, LeCun, Yann

Neural Information Processing SystemsDec-31-1999

Feature extraction is a typical example: The distance between a small pattern (i.e.

artificial intelligence, convolution, neural network, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Convergence Properties of the K-Means Algorithms

Bottou, Léon, Bengio, Yoshua

Neural Information Processing SystemsDec-31-1995

K-Means is a popular clustering algorithm used in many applications, including the initialization of more computationally expensive algorithms (Gaussian mixtures, Radial Basis Functions, Learning Vector Quantization and some Hidden Markov Models). The practice of this initialization procedure often gives the frustrating feeling that K-Means performs most of the task in a small fraction of the overall time. This motivated us to better understand this convergence speed. A second reason lies in the traditional debate between hard threshold (e.g.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada > Quebec > Montreal (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Convergence Properties of the K-Means Algorithms

Bottou, Léon, Bengio, Yoshua

Neural Information Processing SystemsDec-31-1995

K-Means is a popular clustering algorithm used in many applications, including the initialization of more computationally expensive algorithms (Gaussian mixtures, Radial Basis Functions, Learning Vector Quantization and some Hidden Markov Models). The practice of this initialization procedure often gives the frustrating feeling that K-Means performs most of the task in a small fraction of the overall time. This motivated us to better understand this convergence speed. A second reason lies in the traditional debate between hard threshold (e.g.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.28)
North America > Canada > Quebec > Montreal (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Convergence Properties of the K-Means Algorithms

Bottou, Léon, Bengio, Yoshua

Neural Information Processing SystemsDec-31-1995

K-Means is a popular clustering algorithm used in many applications, including the initialization of more computationally expensive algorithms (Gaussian mixtures, Radial Basis Functions, Learning Vector Quantization and some Hidden Markov Models). The practice of this initialization procedure often gives the frustrating feeling that K-Means performs most of the task in a small fraction of the overall time. This motivated us to better understand this convergence speed. A second reason lies in the traditional debate between hard threshold (e.g.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: