Some Key concepts used in HDFS - MyPythonGuru

Jobs Search Portal and Learning point for Python,Data Science,AI,ML, Cloud and latest technologies.

Follow us on Facebook

Post Top Ad

Your Ad Spot

Thursday, August 22, 2019

Some Key concepts used in HDFS


Below are the important link for detailed description of topics covered in Audio:

1. The following documents describe how to install and set up a Hadoop cluster:



4.Secondary NameNode: The secondary NameNode merges the fsimage and the edits log files periodically and keeps edits log size within a limit. It is usually run on a different machine than the primary NameNode since its memory requirements are on the same order as the primary NameNode.

5.CheckPoint Node and Backup Node: The Checkpoint node periodically creates checkpoints of the namespace

Use of a Backup node provides the option of running the NameNode with no persistent storage, delegating all responsibility for persisting the state of the namespace to the Backup node

6. Balancer: HDFS provides a tool for administrators that analyzes block placement and rebalanaces data across the DataNode.

7.Rack awarewness , Safemode, FSCK, fetchdt

Other Documents for reference:

  • Hadoop Site: The home page for the Apache Hadoop site.
  • Hadoop Wiki: The home page (FrontPage) for the Hadoop Wiki. Unlike the released documentation, which is part of Hadoop source tree, Hadoop Wiki is regularly edited by Hadoop Community.
  • FAQ: The FAQ Wiki page.
  • Hadoop JavaDoc API.
  • Hadoop User Mailing List: user[at]hadoop.apache.org.
  • Explore hdfs-default.xml. It includes brief description of most of the configuration variables available.
  • HDFS Commands Guide: HDFS commands usage.

No comments:

Post Top Ad

Your Ad Spot