AITopics | feedforward net

This work focuses on deriving quantitative approximation error bounds for neural ordinary differential equations having at most quadratic nonlinearities in the dynamics. The simple dynamics of this model form demonstrates how expressivity can be derived primarily from iteratively composing many basic elementary operations, versus from the complexity of those elementary operations themselves. Like the analog differential analyzer and universal polynomial DAEs, the expressivity is derived instead primarily from the "depth" of the model. These results contribute to our understanding of what depth specifically imparts to the capabilities of deep learning architectures.

approximation, artificial intelligence, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2504.09385

Country: North America > United States > Illinois (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Generalization and Parameter Estimation in Feedforward Nets: Some Experiments

Neural Information Processing SystemsApr-6-2023, 19:47:49 GMT

We have done an empirical study of the relation of the number of parameters (weights) in a feedforward net to generalization perfor(cid:173) mance. In one, we use simulated data sets with well-controlled parameters, such as the signal-to-noise ratio of continuous-valued data. In the second, we train the network on vector-quantized mel cepstra from real speech samples. In each case, we use back-propagation to train the feedforward net to discriminate in a multiple class pattern classification problem. We report the results of these studies, and show the application of cross-validation techniques to prevent overfitting.

experiment, feedforward net, generalization and parameter estimation

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Recurrent Networks: Second Order Properties and Pruning

Pedersen, Morten With, Hansen, Lars Kai

Neural Information Processing SystemsDec-31-1995

Second order properties of cost functions for recurrent networks are investigated. We analyze a layered fully recurrent architecture, the virtue of this architecture is that it features the conventional feedforward architecture as a special case. A detailed description of recursive computation of the full Hessian of the network cost function is provided. We discuss the possibility of invoking simplifying approximations of the Hessian and show how weight decays iron the cost function and thereby greatly assist training. We present tentative pruning results, using Hassibi et al.'s Optimal Brain Surgeon, demonstrating that recurrent networks can construct an efficient internal memory. 1 LEARNING IN RECURRENT NETWORKS Time series processing is an important application area for neural networks and numerous architectures have been suggested, see e.g. (Weigend and Gershenfeld, 94). The most general structure is a fully recurrent network and it may be adapted using Real Time Recurrent Learning (RTRL) suggested by (Williams and Zipser, 89). By invoking a recurrent network, the length of the network memory can be adapted to the given time series, while it is fixed for the conventional lag-space net (Weigend et al., 90). In forecasting, however, feedforward architectures remain the most popular structures; only few applications are reported based on the Williams&Zipser approach.

architecture, cost function, weight decay, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
North America > United States > California > San Mateo County > Redwood City (0.04)
Europe > Denmark > Capital Region > Kongens Lyngby (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Recurrent Networks: Second Order Properties and Pruning

Pedersen, Morten With, Hansen, Lars Kai

Neural Information Processing SystemsDec-31-1995

Second order properties of cost functions for recurrent networks are investigated. We analyze a layered fully recurrent architecture, the virtue of this architecture is that it features the conventional feedforward architecture as a special case. A detailed description of recursive computation of the full Hessian of the network cost function is provided. We discuss the possibility of invoking simplifying approximations of the Hessian and show how weight decays iron the cost function and thereby greatly assist training. We present tentative pruning results, using Hassibi et al.'s Optimal Brain Surgeon, demonstrating that recurrent networks can construct an efficient internal memory. 1 LEARNING IN RECURRENT NETWORKS Time series processing is an important application area for neural networks and numerous architectures have been suggested, see e.g. (Weigend and Gershenfeld, 94). The most general structure is a fully recurrent network and it may be adapted using Real Time Recurrent Learning (RTRL) suggested by (Williams and Zipser, 89). By invoking a recurrent network, the length of the network memory can be adapted to the given time series, while it is fixed for the conventional lag-space net (Weigend et al., 90). In forecasting, however, feedforward architectures remain the most popular structures; only few applications are reported based on the Williams&Zipser approach.

architecture, cost function, weight decay, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
North America > United States > California > San Mateo County > Redwood City (0.04)
Europe > Denmark > Capital Region > Kongens Lyngby (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Recurrent Networks: Second Order Properties and Pruning

Pedersen, Morten With, Hansen, Lars Kai

Neural Information Processing SystemsDec-31-1995

Second order properties of cost functions for recurrent networks are investigated. We analyze a layered fully recurrent architecture, the virtue of this architecture is that it features the conventional feedforward architecture as a special case. A detailed description of recursive computation of the full Hessian of the network cost function isprovided. We discuss the possibility of invoking simplifying approximations of the Hessian and show how weight decays iron the cost function and thereby greatly assist training. We present tentative pruningresults, using Hassibi et al.'s Optimal Brain Surgeon, demonstrating that recurrent networks can construct an efficient internal memory. 1 LEARNING IN RECURRENT NETWORKS Time series processing is an important application area for neural networks and numerous architectures have been suggested, see e.g.

artificial intelligence, machine learning, weight decay, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
Europe > Denmark (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Remarks on Interpolation and Recognition Using Neural Nets

Sontag, Eduardo D.

Neural Information Processing SystemsDec-31-1991

We consider different types of single-hidden-Iayer feedforward nets: with or without direct input to output connections, and using either threshold or sigmoidal activation functions. The main results show that direct connections in threshold nets double the recognition but not the interpolation power, while using sigmoids rather than thresholds allows (at least) doubling both. Various results are also given on VC dimension and other measures of recognition capabilities.

direct connection, interpolation and recognition, sontag, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > Canada > Ontario > National Capital Region > Ottawa (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Remarks on Interpolation and Recognition Using Neural Nets

Sontag, Eduardo D.

Neural Information Processing SystemsDec-31-1991

We consider different types of single-hidden-Iayer feedforward nets: with or without direct input to output connections, and using either threshold or sigmoidal activation functions. The main results show that direct connections in threshold nets double the recognition but not the interpolation power, while using sigmoids rather than thresholds allows (at least) doubling both. Various results are also given on VC dimension and other measures of recognition capabilities.

direct connection, interpolation and recognition, sontag, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)
North America > United States > District of Columbia > Washington (0.04)
North America > Canada > Ontario > National Capital Region > Ottawa (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Remarks on Interpolation and Recognition Using Neural Nets

Sontag, Eduardo D.

Neural Information Processing SystemsDec-31-1991

We consider different types of single-hidden-Iayer feedforward nets: with or without direct input to output connections, and using either threshold orsigmoidal activation functions. The main results show that direct connections in threshold nets double the recognition but not the interpolation power,while using sigmoids rather than thresholds allows (at least) doubling both. Various results are also given on VC dimension and other measures of recognition capabilities.

artificial intelligence, direct connection, machine learning, (16 more...)

Neural Information Processing Systems

Country: