Hadoop in the Enterprise: Architecture, 1st Edition by Unknown

Hadoop in the Enterprise: Architecture, 1st Edition by Unknown

Author:Unknown
Language: eng
Format: epub, mobi, pdf
Published: 0101-01-01T00:00:00+00:00


Quorums

During our excursion through Hadoop high availability, we will encounter the

concept of a quorum. When dealing with distributed systems, a common

problem is how to ensure that processes running on different machines agree

on the value of something in the presence of faults and failures, the so-called

problem of consensus. There has been a lot written in the literature about how

to achieve distributed consensus: you may have heard of algorithms such as

Paxos, Raft and ZooKeeper’s very own Zab. We will not go into detail about how these algorithms work here but the enterprise architect should be familiar

with the basics.

Note

If the reader is interested in this topic in detail, the “Distributed Systems”

section of Martin Kleppmann’s Designing Data-Intensive Applications is highly recommended.

In general, in order to achieve consensus in distributed systems and for that

system to support consistency (i.e. everyone is guaranteed to agree on the same

value either immediately or at some point in the future), a majority of machines

in a quorum need to agree on a value using one of the algorithms mentioned

above a the majority of machines needs to be maintained for the system to be

considered consistent.

What this all means in practice is that a quorum should consist of an odd

number of machines and that the maximum number of node failures in a quorum

of n nodes for it to remain operational is

. While

quorums can have even numbers, it would not increase the resiliency: quorums

of both 5 and 6 nodes can tolerate only 2 node failures.

In Hadoop, achieving consensus between master services is key to enabling

consistent writes and reads of metadata and are critical to services such as

ZooKeeper and HDFS HA. As we will see, other services rely on ZooKeeper

and its reliable consensus for distributed locking and configuration storage.

Load Balancing

A key requirement in any high availability setup is the ability to seamlessly and

automatically switch between instances of a service running on different nodes.

This is especially important for services which have no inbuilt mechanism for

providing HA. As clients we want some or all of the following:

To access a service via a single logical endpoint

To ultimately connect to an instance that is running and healthy

To transparently open and maintain sessions with the service

To connect to a node that is not overwhelmed with users

To automatically switch between backend service providers in the event

of failure

Step in load balancing. Load balancers act as a proxy in front of a service and,

depending on the implementation, address one or more of the above

requirements. So what are the different forms of load balancing we can use?

DNS Round Robin

The easiest form of load balancing to achieve is DNS round robin2. When clients connect to a network service they first lookup the IP address from the

DNS server (or its own local definitions) using the service hostname. DNS

servers can be configured to cycle through or return a random entry from a list

of configured IP addresses. The process is illustrated in Figure Figure 3-1:

clients query the DNS server for the IP for a hostname (1), the server response

with one of the list of configured IPs (2) and the client contacts the supplied IP

(3).



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Popular ebooks
Whisky: Malt Whiskies of Scotland (Collins Little Books) by dominic roskrow(73919)
What's Done in Darkness by Kayla Perrin(26960)
The Ultimate Python Exercise Book: 700 Practical Exercises for Beginners with Quiz Questions by Copy(20859)
De Souza H. Master the Age of Artificial Intelligences. The Basic Guide...2024 by Unknown(20612)
D:\Jan\FTP\HOL\Work\Alien Breed - Tower Assault CD32 Alien Breed II - The Horror Continues Manual 1.jpg by PDFCreator(20538)
The Fifty Shades Trilogy & Grey by E L James(19460)
Shot Through the Heart: DI Grace Fisher 2 by Isabelle Grey(19381)
Shot Through the Heart by Mercy Celeste(19242)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 10 by Isuna Hasekura and Jyuu Ayakura(17388)
Python GUI Applications using PyQt5 : The hands-on guide to build apps with Python by Verdugo Leire(17356)
Peren F. Statistics for Business and Economics...Essential Formulas 3ed 2025 by Unknown(17188)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 03 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(17099)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 01 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(16713)
The Subtle Art of Not Giving a F*ck by Mark Manson(14831)
The 3rd Cycle of the Betrayed Series Collection: Extremely Controversial Historical Thrillers (Betrayed Series Boxed set) by McCray Carolyn(14443)
Stepbrother Stories 2 - 21 Taboo Story Collection (Brother Sister Stepbrother Stepsister Taboo Pseudo Incest Family Virgin Creampie Pregnant Forced Pregnancy Breeding) by Roxi Harding(14219)
Cozy crochet hats: 7 Stylish and Beginner-Friendly Patterns from Baby Beanies to Trendy Bucket Hats by Vanilla Lazy(13504)
Scorched Earth by Nick Kyme(13096)
Reichel W. Numerical methods for Electrical Engineering, Meteorology,...2022 by Unknown(12980)
Drei Generationen auf dem Jakobsweg by Stein Pia(11259)