Big Data by unknow
Author:unknow
Language: eng
Format: epub
Publisher: Elsevier Science & Technology
Published: 2016-06-07T00:00:00+00:00
9.5 Performance Optimization of HDFS
The distributed file system is one of the core technologies in a cloud computing platform, and it is also the current research focus. There has emerged many distributed file systems in the industry, such as the GFS [6], HDFS [4], Haystack [46], and TFS [47], wherein HDFS is an open source version of GFS. It has been researched extensively, and it has been widely used in commercial enterprises such as Yahoo!, Cloudera, and Mapr. HDFS has good expansion capability, and it can store and process massive amounts of data reliably. It can also be used for low-cost business machines and for reducing development costs. Data can be processed in parallel to improve the efficiency of the system. It can automatically maintain the data copy, and after a failure, it can automatically rearrange computing tasks. Therefore, many large enterprises use HDFS to handle massive amounts of data. However, there are still many problems seriously restricting the further development of HDFS. HDFS is optimized through many ways in academia, including modifying the underlying traditional file system of HDFS. Its modification and some improvement in high-level optimization top on HDFS. We analyze the small file performance optimization and security performance optimization in the following.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Kathy Andrews Collection by Kathy Andrews(11772)
The remains of the day by Kazuo Ishiguro(8906)
Spare by Prince Harry The Duke of Sussex(5150)
Paper Towns by Green John(5149)
The Body: A Guide for Occupants by Bill Bryson(5044)
Industrial Automation from Scratch: A hands-on guide to using sensors, actuators, PLCs, HMIs, and SCADA to automate industrial processes by Olushola Akande(5028)
Machine Learning at Scale with H2O by Gregory Keys | David Whiting(4268)
Be in a Treehouse by Pete Nelson(4005)
Never by Ken Follett(3893)
Harry Potter and the Goblet Of Fire by J.K. Rowling(3818)
Goodbye Paradise(3778)
The Remains of the Day by Kazuo Ishiguro(3359)
Into Thin Air by Jon Krakauer(3355)
Fairy Tale by Stephen King(3317)
The Cellar by Natasha Preston(3301)
The Genius of Japanese Carpentry by Azby Brown(3265)
120 Days of Sodom by Marquis de Sade(3235)
The Man Who Died Twice by Richard Osman(3047)
Drawing Shortcuts: Developing Quick Drawing Skills Using Today's Technology by Leggitt Jim(3045)