architecting hbase applications by Jean-Marc Spaggiari Kevin O'Dell
Author:Jean-Marc Spaggiari, Kevin O'Dell
Language: eng
Format: epub
Publisher: O'Reilly Media, Inc.
Published: 2015-08-25T04:00:00+00:00
Warning
The reducer used to create the HFiles needs to load into memory the columns of a single row and then sort all before being able to write them all. If you have many columns in your dataset, it might not fit into memory. This should be fixed in a future release when HBASE-13897 will be implemented.
The operations on the mapper side are simple. The goal is just to split the line into different fields, assign them to an Avro object and provide this Avro object to the HBase framework to be stored into HFiles ready to be loaded.
The first thing we do is to define a set of variables that we will re-use for each and every iteration of the mapper. This is done to reduce the number of objects created.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8310)
Azure Data and AI Architect Handbook by Olivier Mertens & Breght Van Baelen(6839)
Building Statistical Models in Python by Huy Hoang Nguyen & Paul N Adams & Stuart J Miller(6816)
Serverless Machine Learning with Amazon Redshift ML by Debu Panda & Phil Bates & Bhanu Pittampally & Sumeet Joshi(6699)
Data Wrangling on AWS by Navnit Shukla | Sankar M | Sam Palani(6488)
Driving Data Quality with Data Contracts by Andrew Jones(6441)
Machine Learning Model Serving Patterns and Best Practices by Md Johirul Islam(6187)
Learning SQL by Alan Beaulieu(6007)
Weapons of Math Destruction by Cathy O'Neil(5801)
Big Data Analysis with Python by Ivan Marin(5409)
Data Engineering with dbt by Roberto Zagni(4418)
Solidity Programming Essentials by Ritesh Modi(4066)
Time Series Analysis with Python Cookbook by Tarek A. Atwan(3927)
Pandas Cookbook by Theodore Petrou(3630)
Blockchain Basics by Daniel Drescher(3308)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2914)
Feature Store for Machine Learning by Jayanth Kumar M J(2822)
Learn T-SQL Querying by Pam Lahoud & Pedro Lopes(2804)
Mastering Python for Finance by Unknown(2748)
