Pentaho Data Integration Cookbook Second Edition by 2013
Author:2013
Language: eng
Format: epub, mobi
Publisher: Packt Publishing
There's more...
The Fuzzy match step allows you to choose among several matching algorithms, which are classified in the following two groups:
Algorithms based on a metric distance: The comparison is based on how the compared terms are spelled
Phonetic algorithms: The comparison is based on how the compared terms sound, as read in English
The following is a brief comparative table for the implemented algorithms:
Algorithm
Classification
Explanation
Example
Levenshtein
Metric distance
The distance is calculated as the minimum edit distance that transforms one string into the other. These edits can be character insertion or deletion, or substitution of a single character.
The transformation of "pciking" into "picking" requires two changes (the c and i need to be replaced), which would be a distance of 2.
Download
Pentaho Data Integration Cookbook Second Edition by 2013.mobi
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Implementing Enterprise Observability for Success by Manisha Agrawal and Karun Krishnannair(7345)
Supercharging Productivity with Trello by Brittany Joiner(6609)
Secrets of the JavaScript Ninja by John Resig Bear Bibeault(6419)
Mastering Tableau 2023 - Fourth Edition by Marleen Meier(6368)
Inkscape by Example by István Szép(6224)
Visualize Complex Processes with Microsoft Visio by David J Parker & Šenaj Lelić(5920)
Build Stunning Real-time VFX with Unreal Engine 5 by Hrishikesh Andurlekar(4913)
Design Made Easy with Inkscape by Christopher Rogers(4601)
Customizing Microsoft Teams by Gopi Kondameda(4141)
Linux Device Driver Development Cookbook by Rodolfo Giometti(3935)
Extending Microsoft Power Apps with Power Apps Component Framework by Danish Naglekar(3732)
Business Intelligence Career Master Plan by Eduardo Chavez & Danny Moncada(3697)
Salesforce Platform Enterprise Architecture - Fourth Edition by Andrew Fawcett(3607)
Pandas Cookbook by Theodore Petrou(3586)
The Tableau Workshop by Sumit Gupta Sylvester Pinto Shweta Sankhe-Savale JC Gillet and Kenneth Michael Cherven(3388)
TCP IP by Todd Lammle(2985)
Drawing Shortcuts: Developing Quick Drawing Skills Using Today's Technology by Leggitt Jim(2911)
Applied Predictive Modeling by Max Kuhn & Kjell Johnson(2865)
Work Smarter with Microsoft OneNote by Connie Clark(2843)
