Data Augmentation with Python by Duc Haba

Data Augmentation with Python by Duc Haba

Author:Duc Haba
Language: eng
Format: epub
Publisher: Packt
Published: 2023-11-15T00:00:00+00:00


Figure 5.12 – Netflix keyboard augmenting

Pluto does the same for the Twitter NLP data, as follows:

# use keyboard augmentation technique pluto.print_aug_keyboard(pluto.df_twitter_data, col_dest='clean_tweet', aug_name='Keyboard Augment')

The output is as follows:

Figure 5.13 – Twitter keyboard augmenting

The last of the three text augmentation methods is the random technique.

Random augmenting

The random character function randomly swaps, inserts, or deletes characters in the text. The four modes for the random process are inserting, deleting, substituting, and swapping. The augmentation variable defines as follows:

# define augmentation function variable definition aug_func = nlpaug.augmenter.char.RandomCharAug(action=action)

Pluto uses the print_aug_random() wrapper function with action set to insert in the Netflix NLP data, as follows:

# use random insert augmentation technique pluto.print_aug_char_random(pluto.df_netflix_data, action='insert', col_dest='description', aug_name='Random Insert Augment')

The output is as follows:



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.