Big Data Glossary by Pete Warden

Big Data Glossary by Pete Warden

Author:Pete Warden [Pete Warden]
Language: eng
Format: epub, pdf
Tags: COMPUTERS / Data Modeling & Design
ISBN: 9781449314583
Publisher: O'Reilly Media
Published: 2011-09-13T16:00:00+00:00


Kafka

Kafka is a comparatively new project for sending large numbers of events from producers to consumers. Originally built to connect LinkedIn’s website with its backend systems, it’s somewhere between S4 and Flume in its functionality. Unlike S4, it’s persistent and offers more safeguards for delivery than Yahoo!’s UDP-based system, but it tries to retain its distributed nature and low latency. It can be used in a very similar way to Flume, keeping its high throughput, but with a more flexible system for creating multiple clients and an underlying architecture that’s more focused on parallelization. Kafka relies on ZooKeeper to keep track of its distributed processing.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.