Data Engineering Mock Interview | Spark Optimization Interview Questions | Best Coding Practices

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 มี.ค. 2024
  • 𝐓𝐨 𝐞𝐧𝐡𝐚𝐧𝐜𝐞 𝐲𝐨𝐮𝐫 𝐜𝐚𝐫𝐞𝐞𝐫 𝐚𝐬 𝐚 𝐂𝐥𝐨𝐮𝐝 𝐃𝐚𝐭𝐚 𝐄𝐧𝐠𝐢𝐧𝐞𝐞𝐫, 𝐂𝐡𝐞𝐜𝐤 trendytech.in/?src=youtube&su... for curated courses developed by me.
    I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
    𝐖𝐚𝐧𝐭 𝐭𝐨 𝐌𝐚𝐬𝐭𝐞𝐫 𝐒𝐐𝐋? 𝐋𝐞𝐚𝐫𝐧 𝐒𝐐𝐋 𝐭𝐡𝐞 𝐫𝐢𝐠𝐡𝐭 𝐰𝐚𝐲 𝐭𝐡𝐫𝐨𝐮𝐠𝐡 𝐭𝐡𝐞 𝐦𝐨𝐬𝐭 𝐬𝐨𝐮𝐠𝐡𝐭 𝐚𝐟𝐭𝐞𝐫 𝐜𝐨𝐮𝐫𝐬𝐞 - 𝐒𝐐𝐋 𝐂𝐡𝐚𝐦𝐩𝐢𝐨𝐧𝐬 𝐏𝐫𝐨𝐠𝐫𝐚𝐦!
    "𝐀 8 𝐰𝐞𝐞𝐤 𝐏𝐫𝐨𝐠𝐫𝐚𝐦 𝐝𝐞𝐬𝐢𝐠𝐧𝐞𝐝 𝐭𝐨 𝐡𝐞𝐥𝐩 𝐲𝐨𝐮 𝐜𝐫𝐚𝐜𝐤 𝐭𝐡𝐞 𝐢𝐧𝐭𝐞𝐫𝐯𝐢𝐞𝐰𝐬 𝐨𝐟 𝐭𝐨𝐩 𝐩𝐫𝐨𝐝𝐮𝐜𝐭 𝐛𝐚𝐬𝐞𝐝 𝐜𝐨𝐦𝐩𝐚𝐧𝐢𝐞𝐬 𝐛𝐲 𝐝𝐞𝐯𝐞𝐥𝐨𝐩𝐢𝐧𝐠 𝐚 𝐭𝐡𝐨𝐮𝐠𝐡𝐭 𝐩𝐫𝐨𝐜𝐞𝐬𝐬 𝐚𝐧𝐝 𝐚𝐧 𝐚𝐩𝐩𝐫𝐨𝐚𝐜𝐡 𝐭𝐨 𝐬𝐨𝐥𝐯𝐞 𝐚𝐧 𝐮𝐧𝐬𝐞𝐞𝐧 𝐏𝐫𝐨𝐛𝐥𝐞𝐦."
    𝐇𝐞𝐫𝐞 𝐢𝐬 𝐡𝐨𝐰 𝐲𝐨𝐮 𝐜𝐚𝐧 𝐫𝐞𝐠𝐢𝐬𝐭𝐞𝐫 𝐟𝐨𝐫 𝐭𝐡𝐞 𝐏𝐫𝐨𝐠𝐫𝐚𝐦 -
    𝐑𝐞𝐠𝐢𝐬𝐭𝐫𝐚𝐭𝐢𝐨𝐧 𝐋𝐢𝐧𝐤 (𝐂𝐨𝐮𝐫𝐬𝐞 𝐀𝐜𝐜𝐞𝐬𝐬 𝐟𝐫𝐨𝐦 𝐈𝐧𝐝𝐢𝐚) : rzp.io/l/SQLINR
    𝐑𝐞𝐠𝐢𝐬𝐭𝐫𝐚𝐭𝐢𝐨𝐧 𝐋𝐢𝐧𝐤 (𝐂𝐨𝐮𝐫𝐬𝐞 𝐀𝐜𝐜𝐞𝐬𝐬 𝐟𝐫𝐨𝐦 𝐨𝐮𝐭𝐬𝐢𝐝𝐞 𝐈𝐧𝐝𝐢𝐚) : rzp.io/l/SQLUSD
    30 INTERVIEWS IN 30 DAYS- BIG DATA INTERVIEW SERIES
    This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
    Expert guest interviewer, Sachin R, / sachin-r27 imparts invaluable insights and practical advice derived from extensive experience.
    Suman Basu, / basusuman23 skilled guest interviewee, showcases an exceptional approach in answering interview questions.
    Link of Free SQL & Python series developed by me are given below -
    SQL Playlist - • SQL tutorial for every...
    Python Playlist - • Complete Python By Sum...
    Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
    Social Media Links :
    LinkedIn - / bigdatabysumit
    Twitter - / bigdatasumit
    Instagram - / bigdatabysumit
    Student Testimonials - trendytech.in/#testimonials
    Discussed Questions : Timestamp
    1:37 Introduction
    2:50 Brief about your project responsibilities
    5:26 Discuss SQL code documentation best practices for ensuring query efficiency.
    9:56 What are transformations and actions in PySpark DataFrames?
    10:35 What are the best practices you have followed specific to PySpark?
    12:39 What is the difference between cache and persist?
    13:33 Explain the concept of partitioning.
    14:58 When allocating multiple worker nodes/executors, how to increase or decrease the number of partitions?
    16:38 Which is more effective in avoiding data skewness. Repartitioning or coalesce? what is data skewness?
    18:07 Coding questions
    36:20 Dealing with data quality issues
    38:30 After fetching data from CSV files, how would you define the schema?
    41:00 Preferred file format for data loading.
    Tags
    #mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

ความคิดเห็น • 19

  • @isenhiem
    @isenhiem หลายเดือนก่อน +1

    This is such an amazing initiative...While watching the video I felt like as if I was being interviewed...I cant stress on how helpful this will be for so many people. It gave me a very good idea of the level of my preparation. Thanks a lot and I hope you will create more videos like this.

  • @Vlogs..573
    @Vlogs..573 4 หลายเดือนก่อน +3

    Sachin is really knowledgeable, and he is helping to answer the questions as well with Suman.

    • @sumitmittal07
      @sumitmittal07  4 หลายเดือนก่อน +1

      yes both have been great. Kudos to Sachin & Suman.

  • @sharankarthick3364
    @sharankarthick3364 2 หลายเดือนก่อน

    Informative!

  • @prannay19
    @prannay19 4 หลายเดือนก่อน +2

    Great initiative. Thank you Sumit Sir 🙏. Looking forward to more such videos. Keep up the good work 👍

  • @user-ji9ke8yb2d
    @user-ji9ke8yb2d 4 หลายเดือนก่อน +2

    Thank you so much Sumit sir.Really a great initiative

    • @sumitmittal07
      @sumitmittal07  4 หลายเดือนก่อน +1

      thank you very much

  • @user-oy9cc8dv8i
    @user-oy9cc8dv8i หลายเดือนก่อน

    if possible mention the experience also , to which experience level these interview are targeting (like this is for 1 year, fresher or for 3 year experience )

  • @AliKhanLuckky
    @AliKhanLuckky 4 หลายเดือนก่อน +3

    36:03 1.he is asking only highest
    2. Dept vise highest
    Use sql code as follow
    1.select max(salary) from emp;
    2 select dept,max(salary) from emp group by dept;
    As simple as that he did not asked you to write window function if he ask you then do it 😊

    • @sriharidhanakshirur9245
      @sriharidhanakshirur9245 4 หลายเดือนก่อน +1

      In case 1 , we should use WinDow function bcoz, we need to print id and name as well

    • @AliKhanLuckky
      @AliKhanLuckky 4 หลายเดือนก่อน

      @@sriharidhanakshirur9245 in this case u can use sub query as well if anyone explicitly ask you is there any other way or do it using windows then at that time interviewer will get impress 😊

  • @DataJourneyHuub
    @DataJourneyHuub 4 หลายเดือนก่อน

    Thank you Sumit Sir

    • @sumitmittal07
      @sumitmittal07  4 หลายเดือนก่อน +2

      you are welcome

  • @crunchyworks6374
    @crunchyworks6374 4 หลายเดือนก่อน +3

    Sir as I see from last 3 days everytime cloud tech you use is Azure only , please make it on AWS too it’s very helpful

    • @sumitmittal07
      @sumitmittal07  4 หลายเดือนก่อน +1

      definitely, you will see a lot of variety

  • @RohitSharma-ny1oq
    @RohitSharma-ny1oq 4 หลายเดือนก่อน +1

    Plz increase little bit complexity of interview because in actual its more complex 😊

    • @sumitmittal07
      @sumitmittal07  4 หลายเดือนก่อน

      candidates mostly get stuck in basic fundamentals. These are actual people who conduct interviews in companies.

  • @IsmailKhan-jy9ew
    @IsmailKhan-jy9ew 4 หลายเดือนก่อน

    Thankyou sumit sir for this initiative.