Home > Computers & Technology > Business Technology

Transfer Learning for Natural Language Processing by Paul Azunre

Author:Paul Azunre [Azunre, Paul] , Date: November 15, 2021 ,Views: 140

Transfer Learning for Natural Language Processing by Paul Azunre

Author:Paul Azunre [Azunre, Paul]
Language: eng
Format: epub, pdf
Tags: computers, Artificial Intelligence, Natural Language Processing, Data Science, Neural Networks, Machine Learning
ISBN: 9781617297267
Google: bGI7EAAAQBAJ
Publisher: Simon and Schuster
Published: 2021-08-31T23:39:27.656520+00:00

Figure 6.8 Suggested ULMFiT rate schedule for the case of 10,000 total iterations. The rate increases linearly for 10% of the total number of iterations (i.e., 1,000), up to a maximum of 0.01, and then decreases linearly afterward to 0.

6.3.2 Target task classifier fine-tuning

In addition to techniques for fine-tuning the language model on a small dataset representing the data distribution for the new scenario, ULMFiT provides two techniques for refining the task-specific layers: concat pooling and gradual unfreezing.

At the time ULMFiT was developed, it was standard practice to pass the hidden state of the final unit of an LSTM-based language model to the task-specific layer. The authors instead recommend concatenating these final hidden states with the max-pooled and mean-pooled hidden states of all time steps (as many of them as can fit in memory). In the bidirectional context, they do this separately for forward and backward language models and average predictions. This process, which they call concat pooling, performs a similar function to the bidirectional language modeling approach described for ELMo.

In order to reduce the risks of catastrophic forgetting when fine-tuning, the authors suggest unfreezing and tuning gradually. This process starts with the last layer, which contains the least general knowledge and is the only one unfrozen and refined at the first epoch. At the second epoch, an additional layer is unfrozen, and the process is repeated. The process continues until all task-specific layers are unfrozen and fine-tuned at the last iteration of this gradual unfreezing process.

As a reminder, these techniques will be explored in the code in chapter 9, which will cover various adaptation strategies.

Download

Transfer Learning for Natural Language Processing by Paul Azunre.epub
Transfer Learning for Natural Language Processing by Paul Azunre.pdf

Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.

Categories

Linux & Unix	iPhone & iOS
Macintosh	Android
Business Technology	Certification
Computer Science	Databases & Big Data
Digital Audio, Video & Photography	Games & Strategy Guides
Graphics & Design	Hardware & DIY
History & Culture	Internet & Social Media
Mobile Phones, Tablets & E-Readers	Networking & Cloud Computing
Operating Systems	Programming
Programming Languages	Security & Encryption
Software	Web Development & Design

Popular ebooks

Dependency Injection in .NET by Mark Seemann(11001)
Exploring Deepfakes by Bryan Lyon and Matt Tora(8287)
Robo-Advisor with Python by Aki Ranin(8241)
Offensive Shellcode from Scratch by Rishalin Pillay(6386)
Microsoft 365 and SharePoint Online Cookbook by Gaurav Mahajan Sudeep Ghatak Nate Chamberlain Scott Brewster(5616)
Ego Is the Enemy by Ryan Holiday(5294)
Management Strategies for the Cloud Revolution: How Cloud Computing Is Transforming Business and Why You Can't Afford to Be Left Behind by Charles Babcock(4527)
Python for ArcGIS Pro by Silas Toms Bill Parker(4461)
Elevating React Web Development with Gatsby by Samuel Larsen-Disney(4182)
Machine Learning at Scale with H2O by Gregory Keys | David Whiting(4178)
Liar's Poker by Michael Lewis(3369)
Learning C# by Developing Games with Unity 2021 by Harrison Ferrone(3333)
Speed Up Your Python with Rust by Maxwell Flitton(3280)
OPNsense Beginner to Professional by Julio Cesar Bueno de Camargo(3251)
Extreme DAX by Michiel Rozema & Henk Vlootman(3237)
Agile Security Operations by Hinne Hettema(3160)
Linux Command Line and Shell Scripting Techniques by Vedran Dakic and Jasmin Redzepagic(3151)
Essential Cryptography for JavaScript Developers by Alessandro Segala(3122)
Cryptography Algorithms by Massimo Bertaccini(3057)
AI-Powered Commerce by Andy Pandharikar & Frederik Bussler(3022)