Hadoop Tutorial - High Availability, Fault Tolerance & Secondary Name Node

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 ธ.ค. 2024

ความคิดเห็น • 90

  • @ScholarNest
    @ScholarNest  3 ปีที่แล้ว +3

    Want to learn more Big Data Technology courses. You can get lifetime access to our courses on the Udemy platform. Visit the below link for Discounts and Coupon Code.
    www.learningjournal.guru/courses/

  • @mdshahalam3010
    @mdshahalam3010 4 ปีที่แล้ว +27

    Nothing is tough when you have a good teacher. Kudos for your work sir.

  • @mathisinav4267
    @mathisinav4267 4 ปีที่แล้ว +10

    No one, i repeat no one has explained hadoop with this perfection. A million thanks

  • @iwonazwierzynska4056
    @iwonazwierzynska4056 ปีที่แล้ว +1

    The best explanation of standby node in the Internet!!

  • @Van-pf2or
    @Van-pf2or 3 ปีที่แล้ว +1

    Crisp, Simple and Picture is what called as best teaching. You are a best tutor.

  • @labyrinth1991
    @labyrinth1991 3 ปีที่แล้ว +1

    I have gone through so many tutorials but the way you explained sir makes it so easy to understand hadoop. Thanks a lot sir!!

  • @prannoyroy5312
    @prannoyroy5312 4 ปีที่แล้ว +2

    I have become a fan of your style of teaching. Thank you, sir. 😊

  • @moehijawe555
    @moehijawe555 5 ปีที่แล้ว +3

    Really thank you for such topics,I spent a lot of time reading books but I couldn't understand anything till I watched your tutorials. big thanks

  • @arjunpandey1617
    @arjunpandey1617 4 ปีที่แล้ว

    You make things very simple to understand..... Hats off to your effort !!

  • @vinitsunita
    @vinitsunita ปีที่แล้ว

    As you described, the role of Secondary Name Node is to regularly take the checkpoint at configured interval and update the on disc FS Image by applying the editlogs that were captured in the time window when it took last checkpoint. And to further reduce the restart time of Primary Name Node, it does the same checkpoint process where it reads the on disc FS Image stored by SNN and apply the editlogs entry to create latest FS Image and store it in memory. Few questions wrt these : -
    1. Where does SNN stores the FS Image. Is it inside disc on local file system ?
    2. How does primary name node get access to that Secondary NN ?

  • @maliknauman3566
    @maliknauman3566 3 ปีที่แล้ว

    Excellent explanation Sir, Hat's off.

  • @pubgkiller2903
    @pubgkiller2903 2 ปีที่แล้ว

    You are the best teacher.. Thanks a lot

  • @swatikorade5251
    @swatikorade5251 2 ปีที่แล้ว

    i learn HDFS from last 7 days but still my concepts are not clear..but today i watched your video i am clear with everything...thank you

  • @ramswaroop1520
    @ramswaroop1520 2 ปีที่แล้ว

    What an Explanation 🙏🙏🙏🙏🙏🙏❤️❤️❤️❤️❤️❤️❤️

  • @bgsuresh0
    @bgsuresh0 5 ปีที่แล้ว

    Your explanation is very clear thank you. Kindly keep update the new videos.

  • @rohitbhagwat3031
    @rohitbhagwat3031 3 ปีที่แล้ว

    Great Work sir. Thanx for video.

  • @kamalprajapati9955
    @kamalprajapati9955 หลายเดือนก่อน

    Good tutorial. Thank you for your efforts

  • @worthwatchingeslam
    @worthwatchingeslam 7 ปีที่แล้ว +2

    Great explanation, thanks for your efforts :)

  • @kanmanik5674
    @kanmanik5674 4 ปีที่แล้ว

    Very good tutorial. Easy to understand.

  • @eldos11
    @eldos11 5 ปีที่แล้ว +1

    This was beautiful! Thank you.

  • @mahalaxmanraochappedi4690
    @mahalaxmanraochappedi4690 6 ปีที่แล้ว

    Presentation and explanation was excellent..

  • @NilanshuSharma1
    @NilanshuSharma1 5 ปีที่แล้ว +1

    Thanks. It's very clear. Piece of advice for viewers: These tutorials can easily be watched in 2x speed.

  • @anilkumar-dp1jk
    @anilkumar-dp1jk 6 ปีที่แล้ว +2

    Awesome Sir ..Thank You

  • @aasthajain6148
    @aasthajain6148 5 ปีที่แล้ว

    Awesome sir...great explanation👌👌

  • @tusharmayekar4649
    @tusharmayekar4649 2 ปีที่แล้ว

    It was very good information.

  • @kumarpolisetty3048
    @kumarpolisetty3048 4 ปีที่แล้ว

    Really nice explanation. If you can start practical implementation of one POC with end to end project , it will be very useful for all of us. Thanks for your efforts and time.

  • @wajay2006
    @wajay2006 5 ปีที่แล้ว

    Very good Tutorial. Only thing I want to say is fsimage is not only in memory but also stored on disk. Please excuse me if I am not correct on this point.

  • @anandansubash
    @anandansubash 7 ปีที่แล้ว

    Thanks for your clear explanation. Awesome!

  • @BhimSella
    @BhimSella 4 ปีที่แล้ว

    Thanks for the detailed explanation.

  • @dineshshinkar2163
    @dineshshinkar2163 6 ปีที่แล้ว

    Simple and superb explained

  • @aks8989
    @aks8989 5 ปีที่แล้ว

    Very nice explanation!

  • @tarunrey619
    @tarunrey619 7 ปีที่แล้ว

    Explanation was clear.
    I have few questions ?
    1)while setting cluster using Hadoop 2,Initially how will zookeeper elects the leader among the namenodes?
    2)Can you explain the funcitonality of failcontrollers of namenode?

    • @joker-cy6qo
      @joker-cy6qo 4 ปีที่แล้ว

      Bro i would love to answer
      When u setup a new cluster the NN will be the active NN which u have selected to be a NN
      AND
      Later if it fails the zkfc(zookeeper failover controller ) is responsible for making standby node as a active node
      Hope this will help u

    • @joker-cy6qo
      @joker-cy6qo 4 ปีที่แล้ว

      When u set up a new cluster the active namenode will be the one which you selected and if NN goes down the zookeeper will work here the demand of zookeeper ZKFC which stands for zookeeper failover and it is responsible for making standby namenode active namenode

  • @surabhibtech
    @surabhibtech 3 ปีที่แล้ว

    very useful explaination

  • @rajivraghu9857
    @rajivraghu9857 6 ปีที่แล้ว

    Very nicely explained.

  • @renukaasodaria494
    @renukaasodaria494 7 ปีที่แล้ว

    very nice.I could not understand too much about secondary name node but will try to understand it.

    • @ScholarNest
      @ScholarNest  7 ปีที่แล้ว

      Why? is it because the explanation is not clear? You can ask your doubts if there are any?

    • @renukaasodaria494
      @renukaasodaria494 7 ปีที่แล้ว

      no explantn is so nice but my fsimages nd editlog is not clear so

    • @renukaasodaria494
      @renukaasodaria494 7 ปีที่แล้ว

      nd thank u very much

  • @prometeo34
    @prometeo34 5 ปีที่แล้ว

    Great Tutorial..thanks for sharing

  • @avikthedrummer
    @avikthedrummer ปีที่แล้ว

    Hello! Is there any ppt format of this video? Need to explain students.. the representation is superb

  • @mrionutube
    @mrionutube 5 ปีที่แล้ว

    Thank you for this excellent tutorial. I am new to this topic and all the tutorials or blogs I went through, did not put up a clear picture of what is happening with Checkpoint process of SNN and that of NN too. So, can you please confirm my understanding about this topic (Related to NON HA mode) ?...
    1) After every Checkpoint run, SNN clears the Edit Log on Name Node as well? So at any time, Edit log on NN has data only since the last Checkpoint run on SNN.
    2) fsimage of the NN gets updated automatically in real time (i.e as and when changes are made to the file system). Which means , Name Node always has latest fsimage in its memory at all times.
    3) At any given time fsimage on the Secondary Name Node holds file system image updated as of last Checkpoint run.
    4) After a reboot, Name Node picks up the fsimage from the "Secondary Name Node" and the Edit Log from NN local disc and merges them to create new fsimage file which is up to date with all changes as of then.

  • @sridharthogaru4403
    @sridharthogaru4403 7 ปีที่แล้ว

    great and clear explanation thanks.

  • @shramandas2721
    @shramandas2721 2 ปีที่แล้ว

    Why cant we dump the fsimage directly to disk during restarting of the NameNode . After restarting it can read the fsimage and then push it to memory it will be faster.

  • @sureshm6906
    @sureshm6906 5 ปีที่แล้ว

    Very informative. Thanks

  • @amansehgal9917
    @amansehgal9917 7 ปีที่แล้ว +1

    Highly recommended for anyone who wishes to learn about how fault tolerance is managed in HDFS.
    In addition to this, I've a question: Are block recovery, lease recovery and pipeline recovery done in addition to the methods describe in video for fault tolerance or these are done at deeper level of the described methods?

  • @maheshkumar3657
    @maheshkumar3657 5 ปีที่แล้ว

    nice tutorial sir

  • @aneksingh4496
    @aneksingh4496 5 ปีที่แล้ว

    can u make a video why RDD is immutable and what would have happened had it not been immutable

  • @krishnap6035
    @krishnap6035 5 ปีที่แล้ว

    Good lecture.

  • @worthwatchingeslam
    @worthwatchingeslam 7 ปีที่แล้ว

    Great work, I have 2 questions.
    -Regarding the checkpoint activity does the secondary NN keeps the "on Disk FS" Image on it's local HD or is it on the Active NN HD ?
    -and the hour between each checkpoint is it configurable?

  • @sujitunim
    @sujitunim 6 ปีที่แล้ว

    Awesome tutorial

  • @Sridevi-ht9nj
    @Sridevi-ht9nj 7 ปีที่แล้ว

    very well explained

  • @ravikoganti227
    @ravikoganti227 7 ปีที่แล้ว

    Great explanation

  • @aakashpatel1003
    @aakashpatel1003 6 ปีที่แล้ว

    very good video

  • @coolguy171182
    @coolguy171182 6 ปีที่แล้ว +1

    Sir, What will happen, if the DN-1 is slow, and it does not send heartbeat as fast as compared to other nodes. If NN then thought that DN-1 is down and started replicating the data on different node say DN-2 and during replicating the data the DN-1's heartbeat reached to NN. Will it stop replicating the data on DN-2?

    • @ScholarNest
      @ScholarNest  6 ปีที่แล้ว +2

      +Pranav Wagde, I think it is hypothetical question. Either I get the heartbeat within expected interval or I don't. There is no concept of slow heartbeat. If NN realized that the block is under replicated, it will make more replicas to fix it. There is no concept of stopping in between. Later when NN realizes that block is over replicated, it will fix that also by throwing away some replicas.

    • @coolguy171182
      @coolguy171182 6 ปีที่แล้ว

      Thanks for the explanation. Understood the concept.

  • @dhirenmistry167
    @dhirenmistry167 6 ปีที่แล้ว

    Superb...Thank you so much

  • @sharathkalaallapuram5941
    @sharathkalaallapuram5941 6 ปีที่แล้ว

    Sir great explation sir. I have a dout sir 1)how to install cloudera without internet sir & and what is parcel method and packeges method.

  • @prasadkv9936
    @prasadkv9936 5 ปีที่แล้ว

    Thanks. could you please explain how to create Cloudera cluster as now a days many clients are prefer cloudera instead of Hortonworks..

  • @nguyen4so9
    @nguyen4so9 7 ปีที่แล้ว

    Excellent !

  • @sk-vs9nt
    @sk-vs9nt 6 ปีที่แล้ว

    it was clear about topic thank you so much , can you show with example

  • @nitskrishna
    @nitskrishna 7 ปีที่แล้ว

    nicely explained

  • @nagamanickam6604
    @nagamanickam6604 3 ปีที่แล้ว

    Thank you sir

  • @nileshkharat1188
    @nileshkharat1188 3 ปีที่แล้ว

    Why there's an odd no. Of JN 3 or 5??
    What's the reason behind that

  • @sabyasachiprasad8929
    @sabyasachiprasad8929 6 ปีที่แล้ว

    Nice One

  • @enriquewilliams8676
    @enriquewilliams8676 2 ปีที่แล้ว

    Good.

  • @rjlifeandtech8675
    @rjlifeandtech8675 5 ปีที่แล้ว +1

    1. zookeeper election
    2. split-brain concepts
    3. Hadoop 3, erasure coding and storage policies
    Could you please explain all above

  • @pc0riginal870
    @pc0riginal870 5 ปีที่แล้ว

    Thank you so much ...

  • @aniketamrutkar
    @aniketamrutkar 6 ปีที่แล้ว

    Awesome

  • @iotmails9519
    @iotmails9519 4 ปีที่แล้ว

    Can we have multiple replication factor for multiple tenants?

    • @ScholarNest
      @ScholarNest  4 ปีที่แล้ว

      You can have it at the topic level and I guess all Tanents of the cluster are not going to share the topics. So, answer is a Yes.

  • @testingmakeseasy
    @testingmakeseasy 7 ปีที่แล้ว +1

    please upload some hive and pig related videos ..

    • @ScholarNest
      @ScholarNest  7 ปีที่แล้ว +1

      sure, maybe in a month.

  • @VishalYadav-lw2ky
    @VishalYadav-lw2ky 7 ปีที่แล้ว

    Can we make a single node for both NameNode and as a Secondary NameNode..?

    • @ScholarNest
      @ScholarNest  7 ปีที่แล้ว +1

      Yes, we can. However, we don't do it in production.

    • @VishalYadav-lw2ky
      @VishalYadav-lw2ky 7 ปีที่แล้ว

      okk....thanks

  • @fasiuddin1874
    @fasiuddin1874 6 ปีที่แล้ว

    helpful

  • @navinnayak007
    @navinnayak007 7 ปีที่แล้ว

    how fsimage file and editlog file communicate each other?

    • @karthiknedunchezhiyan7935
      @karthiknedunchezhiyan7935 6 ปีที่แล้ว

      fsimage will not communicate with editlog but during checkpointing process new fsimage will be created by merging old fsimage with new editlog

  • @Deshammanideep
    @Deshammanideep 5 ปีที่แล้ว +2

    Hadoop is very fault tolerant. The only point of failure can be Maharashtra State Electricity Board.

    • @justACatOnYoutube
      @justACatOnYoutube 4 ปีที่แล้ว

      Lol! You can keep backup in Inverters.. Its not costly.

  • @joker-cy6qo
    @joker-cy6qo 4 ปีที่แล้ว

    1.75 x

  • @uddisasuresh9264
    @uddisasuresh9264 6 ปีที่แล้ว

    Great explanation