Mastering NLP from Foundations to LLMs by Lior Gazit Meysam Ghaffari
Author:Lior Gazit, Meysam Ghaffari
Language: eng
Format: epub
Publisher: Packt Publishing Pvt Ltd
Published: 2024-04-04T00:00:00+00:00
Fine-tuning
You use the pretrained model as a starting point and update all or some of the modelâs parameters for your new task. In other words, you continue the training where it left off, allowing the model to adjust from generic feature extraction to features more specific to your task. Often, a lower learning rate is used during fine-tuning to avoid overwriting the prelearned features entirely during training.
Transfer learning is a powerful technique that can be used to improve the performance of ML models. It is particularly useful for tasks where there are limited labeled data available. It is commonly used in DL applications. For instance, itâs almost a standard in image classification problems where pretrained models on ImageNet, a large-scale annotated image dataset (ResNet, VGG, Inception, and so on), are used as the starting point. The features learned by these models are generic for image classification and can be fine-tuned on a specific image classification task with a smaller amount of data.
Here are some examples of how transfer learning can be used:
A model trained to classify images of cats and dogs can be used to fine-tune a model to classify images of other animals, such as birds or fish
A model trained to translate text from English to Spanish can be used to fine-tune a model to translate text from Spanish to French
A model trained to predict the price of a house can be used to fine-tune a model to predict the price of a car
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Computer Vision & Pattern Recognition | Expert Systems |
Intelligence & Semantics | Machine Theory |
Natural Language Processing | Neural Networks |
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8255)
Test-Driven Development with Java by Alan Mellor(6379)
Data Augmentation with Python by Duc Haba(6277)
Principles of Data Fabric by Sonia Mezzetta(6054)
Hadoop in Practice by Alex Holmes(5934)
Learn Blender Simulations the Right Way by Stephen Pearson(5908)
Microservices with Spring Boot 3 and Spring Cloud by Magnus Larsson(5802)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5780)
RPA Solution Architect's Handbook by Sachin Sahgal(5201)
Big Data Analysis with Python by Ivan Marin(5169)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(5099)
The Infinite Retina by Robert Scoble Irena Cronin(4887)
Pretrain Vision and Large Language Models in Python by Emily Webber(4147)
Functional Programming in JavaScript by Mantyla Dan(4015)
The Age of Surveillance Capitalism by Shoshana Zuboff(3910)
Infrastructure as Code for Beginners by Russ McKendrick(3905)
WordPress Plugin Development Cookbook by Yannick Lefebvre(3610)
Embracing Microservices Design by Ovais Mehboob Ahmed Khan Nabil Siddiqui and Timothy Oleson(3419)
Applied Machine Learning for Healthcare and Life Sciences Using AWS by Ujjwal Ratan(3394)
