Computational Analysis and Understanding of Natural Languages: Principles, Methods and Applications by Venkat N. Gudivada & C.R. Rao
Author:Venkat N. Gudivada & C.R. Rao
Language: eng
Format: epub
ISBN: 9780444640437
Publisher: Elsevier Ltd.
Published: 2018-08-27T16:00:00+00:00
5.1 Parameter Norm Penalties
Linear models such as linear regression and logistic regression are straightforward and effective regularization strategies which have been used prior to the advent of DL.
As used in many regularization approaches, by adding a parameter norm penalty to the objective function, the capacity of the model becomes limited. In these approaches, training algorithm minimizes both original objective function on the training data and some measures of the size of a single parameter or a subset of the parameters. Here, we briefly discuss the effect of different norms on the model parameters as they are used as penalties. It should be noted that for the neural networks, typically we use a parameter norm penalty that penalizes only the weights of the affine transformation at each layer. Since the biases basically require less data to fit accurately in comparison with the weights, we leave the biases unregularized. The reason is that weights indicate the interaction between two variables whereas the biases control only a single variable. Moreover, regularizing the biases can result in a significant amount of underfitting.
Considering different penalties for the layers of a neural network may be useful, however, it makes the computations more expensive and using same α coefficients for all layers is still reasonable.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Biomathematics | Differential Equations |
Game Theory | Graph Theory |
Linear Programming | Probability & Statistics |
Statistics | Stochastic Modeling |
Vector Analysis |
Modelling of Convective Heat and Mass Transfer in Rotating Flows by Igor V. Shevchuk(6201)
Weapons of Math Destruction by Cathy O'Neil(5779)
Factfulness: Ten Reasons We're Wrong About the World – and Why Things Are Better Than You Think by Hans Rosling(4454)
Descartes' Error by Antonio Damasio(3139)
A Mind For Numbers: How to Excel at Math and Science (Even If You Flunked Algebra) by Barbara Oakley(3076)
Factfulness_Ten Reasons We're Wrong About the World_and Why Things Are Better Than You Think by Hans Rosling(3025)
TCP IP by Todd Lammle(2982)
Applied Predictive Modeling by Max Kuhn & Kjell Johnson(2858)
Fooled by Randomness: The Hidden Role of Chance in Life and in the Markets by Nassim Nicholas Taleb(2834)
The Tyranny of Metrics by Jerry Z. Muller(2819)
The Book of Numbers by Peter Bentley(2744)
The Great Unknown by Marcus du Sautoy(2516)
Once Upon an Algorithm by Martin Erwig(2457)
Easy Algebra Step-by-Step by Sandra Luna McCune(2435)
Lady Luck by Kristen Ashley(2386)
Practical Guide To Principal Component Methods in R (Multivariate Analysis Book 2) by Alboukadel Kassambara(2358)
Police Exams Prep 2018-2019 by Kaplan Test Prep(2334)
All Things Reconsidered by Bill Thompson III(2242)
Linear Time-Invariant Systems, Behaviors and Modules by Ulrich Oberst & Martin Scheicher & Ingrid Scheicher(2210)
