Hadoop in the Enterprise: Architecture, 1st Edition by Unknown
Author:Unknown
Language: eng
Format: epub, mobi, pdf
Published: 0101-01-01T00:00:00+00:00
Quorums
During our excursion through Hadoop high availability, we will encounter the
concept of a quorum. When dealing with distributed systems, a common
problem is how to ensure that processes running on different machines agree
on the value of something in the presence of faults and failures, the so-called
problem of consensus. There has been a lot written in the literature about how
to achieve distributed consensus: you may have heard of algorithms such as
Paxos, Raft and ZooKeeper’s very own Zab. We will not go into detail about how these algorithms work here but the enterprise architect should be familiar
with the basics.
Note
If the reader is interested in this topic in detail, the “Distributed Systems”
section of Martin Kleppmann’s Designing Data-Intensive Applications is highly recommended.
In general, in order to achieve consensus in distributed systems and for that
system to support consistency (i.e. everyone is guaranteed to agree on the same
value either immediately or at some point in the future), a majority of machines
in a quorum need to agree on a value using one of the algorithms mentioned
above a the majority of machines needs to be maintained for the system to be
considered consistent.
What this all means in practice is that a quorum should consist of an odd
number of machines and that the maximum number of node failures in a quorum
of n nodes for it to remain operational is
. While
quorums can have even numbers, it would not increase the resiliency: quorums
of both 5 and 6 nodes can tolerate only 2 node failures.
In Hadoop, achieving consensus between master services is key to enabling
consistent writes and reads of metadata and are critical to services such as
ZooKeeper and HDFS HA. As we will see, other services rely on ZooKeeper
and its reliable consensus for distributed locking and configuration storage.
Load Balancing
A key requirement in any high availability setup is the ability to seamlessly and
automatically switch between instances of a service running on different nodes.
This is especially important for services which have no inbuilt mechanism for
providing HA. As clients we want some or all of the following:
To access a service via a single logical endpoint
To ultimately connect to an instance that is running and healthy
To transparently open and maintain sessions with the service
To connect to a node that is not overwhelmed with users
To automatically switch between backend service providers in the event
of failure
Step in load balancing. Load balancers act as a proxy in front of a service and,
depending on the implementation, address one or more of the above
requirements. So what are the different forms of load balancing we can use?
DNS Round Robin
The easiest form of load balancing to achieve is DNS round robin2. When clients connect to a network service they first lookup the IP address from the
DNS server (or its own local definitions) using the service hostname. DNS
servers can be configured to cycle through or return a random entry from a list
of configured IP addresses. The process is illustrated in Figure Figure 3-1:
clients query the DNS server for the IP for a hostname (1), the server response
with one of the list of configured IPs (2) and the client contacts the supplied IP
(3).
Download
Hadoop in the Enterprise: Architecture, 1st Edition by Unknown.mobi
Hadoop in the Enterprise: Architecture, 1st Edition by Unknown.pdf
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
What's Done in Darkness by Kayla Perrin(26960)
The Ultimate Python Exercise Book: 700 Practical Exercises for Beginners with Quiz Questions by Copy(20859)
De Souza H. Master the Age of Artificial Intelligences. The Basic Guide...2024 by Unknown(20612)
D:\Jan\FTP\HOL\Work\Alien Breed - Tower Assault CD32 Alien Breed II - The Horror Continues Manual 1.jpg by PDFCreator(20538)
The Fifty Shades Trilogy & Grey by E L James(19460)
Shot Through the Heart: DI Grace Fisher 2 by Isabelle Grey(19381)
Shot Through the Heart by Mercy Celeste(19242)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 10 by Isuna Hasekura and Jyuu Ayakura(17388)
Python GUI Applications using PyQt5 : The hands-on guide to build apps with Python by Verdugo Leire(17356)
Peren F. Statistics for Business and Economics...Essential Formulas 3ed 2025 by Unknown(17188)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 03 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(17099)
Wolf & Parchment: New Theory Spice & Wolf, Vol. 01 by Isuna Hasekura and Jyuu Ayakura & Jyuu Ayakura(16713)
The Subtle Art of Not Giving a F*ck by Mark Manson(14831)
The 3rd Cycle of the Betrayed Series Collection: Extremely Controversial Historical Thrillers (Betrayed Series Boxed set) by McCray Carolyn(14443)
Stepbrother Stories 2 - 21 Taboo Story Collection (Brother Sister Stepbrother Stepsister Taboo Pseudo Incest Family Virgin Creampie Pregnant Forced Pregnancy Breeding) by Roxi Harding(14219)
Cozy crochet hats: 7 Stylish and Beginner-Friendly Patterns from Baby Beanies to Trendy Bucket Hats by Vanilla Lazy(13504)
Scorched Earth by Nick Kyme(13096)
Reichel W. Numerical methods for Electrical Engineering, Meteorology,...2022 by Unknown(12980)
Drei Generationen auf dem Jakobsweg by Stein Pia(11259)