Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Must Watch Live Mock Interview For Data Engineers | System Design | Data Modeling #interview

FIN | สุดท้ายก็แพ้คนอย่างเธอ | ใจซ่อนรัก EP.3 | 3Plus

ภาพนี้ก็ฮาเหมือนกันนะเนี้ย #2 SS8 [ พากย์นรก MEME.EXE ] | easy boy

小丑他们砸车，原来是为了救人#小丑 #shorts

Azure Cloud Data Engineer Mock Interview | Important Questions asked in Big Data Interviews| Pyspark

Sumit Mittal

มุมมอง 3 868

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 16 พ.ค. 2024
To enhance your career as a Cloud Data Engineer, Check trendytech.in/?src=youtube&su... for curated courses developed by me.
I have trained over 20,000+ professionals in the field of Data Engineering in the last 5 years.
BIG DATA INTERVIEW SERIES
This mock interview series is launched as a community initiative under Data Engineers Club aimed at aiding the community's growth and development
Our highly experienced guest interviewer, Umesh Kumar Roy, / umesh-kumar-roy shares invaluable insights and practical guidance drawn from his extensive expertise in the Big Data Domain.
Our expert guest interviewee, Satyam Meena, / satyam-meena-0a1b46138 has an interesting approach to answering the interview questions on Apache Spark, SQL and Azure Cloud Services.
Link of Free SQL & Python series developed by me are given below -
SQL Playlist - • SQL tutorial for every...
Python Playlist - • Complete Python By Sum...
Don't miss out - Subscribe to the channel for more such informative interviews and unlock the secrets to success in this thriving field!
Social Media Links :
LinkedIn - / bigdatabysumit
Twitter - / bigdatasumit
Instagram - / bigdatabysumit
Student Testimonials - trendytech.in/#testimonials
TIMESTAMPS : Questions Discussed
00:50 Introduction
02:10 What sources do you use for data ingestion?
02:25 What connectors do you use for data ingestion?
02:45 How do you store and transform data after ingestion?
03:58 How are you preprocessing the data?
04:41 How do you eliminate duplicate records?
05:12 How do you ensure the correct records when handling duplicates?
05:50 How is your storage layer designed? Do you use mounting techniques?
06:04 Do you use delta files? Why?
07:00 What optimization techniques have you implemented?
08:05 Do you use partitions?
08:24 What factors do you consider when partitioning?
09:11 Do you use bucketing?
09:36 What are the use cases for partitioning and bucketing?
10:33 Besides broadcast joins, what other joins do you use?
10:52 Which join is the most efficient?
11:50 What is the difference between narrow and wide transformations?
12:26 What is your understanding about Spark and Databricks?
13:22 How do you consume data from the gold layer?
14:42 How do you connect Power BI to Azure Synapse?
15:46 Can you outline Spark architecture?
17:07 What is a DAG?
18:15 What is the difference between client mode and cluster mode?
19:29 Have you faced any challenges with cluster mode?
20:50 Why do DataFrames and Datasets exist?
22:17 What do you understand by normalization?
22:51 What other optimization techniques do you use?
23:33 SQL query
Music track: Retro by Chill Pulse
Source: freetouse.com/music
Background Music for Video (Free)
Tags
#mockinterview #bigdata #career #dataengineering #data #datascience #dataanalysis #productbasedcompanies #interviewquestions #apachespark #google #interview #faang #companies #amazon #walmart #flipkart #microsoft #azure #databricks #jobs

ความคิดเห็น • 4

@gudiatoka หลายเดือนก่อน ⁺⁵
When someone saying they are optimizing the code in databricks..all are faking😂😂.
Spark itself optimize your code using catalytst optimizer/Spark sql engine and after spark 3.0 when Adaptive Query Execution(AQE) introduced it also optimized join during run time and we can alter the broadcast threshold which is part of admin team during databricks cluster creation
The only things didnt impact by above two is those things stored inside user defined memory like udfs and low level programming on rdd ops which now a days no one doing in databricks.last one is caching manually also
@SrihariSrinivasDhanakshirur หลายเดือนก่อน ⁺³
Not necessarily, there are other lot of optimizations we can do on resource level, partitioning, bucketing etc
@LearnifyTvKannada-ue6op 13 วันที่ผ่านมา
@@SrihariSrinivasDhanakshirurexactly there are a lot of other optimisations
@hdr-tech4350 7 วันที่ผ่านมา
Source type, project discussion
Handling duplicates
Delta lake feature
Spark vs dbx
Power bi connect to synapse
Spark architecture
Dag
Client mode vs cluster mode
Df vs dataset
Normalisation
2nd highest salary in dep

ต่อไป

เล่นอัตโนมัติ

Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture

Big Data Engineering Mock Interview | Big Data Pipeline | AWS Cloud Services | Project Architecture

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Cloud Data Engineer Mock Interview | PySpark Coding Interview Questions |Azure Databricks #question

Must Watch Live Mock Interview For Data Engineers | System Design | Data Modeling #interview

Must Watch Live Mock Interview For Data Engineers | System Design | Data Modeling #interview

FIN | สุดท้ายก็แพ้คนอย่างเธอ | ใจซ่อนรัก EP.3 | 3Plus

FIN | สุดท้ายก็แพ้คนอย่างเธอ | ใจซ่อนรัก EP.3 | 3Plus

ภาพนี้ก็ฮาเหมือนกันนะเนี้ย #2 SS8 [ พากย์นรก MEME.EXE ] | easy boy

ภาพนี้ก็ฮาเหมือนกันนะเนี้ย #2 SS8 [ พากย์นรก MEME.EXE ] | easy boy

小丑他们砸车，原来是为了救人#小丑 #shorts

小丑他们砸车，原来是为了救人#小丑 #shorts

MAINAN SAYA MULAI BUANG AIR BESAR 😐 Pengujian gadget lucu

MAINAN SAYA MULAI BUANG AIR BESAR 😐 Pengujian gadget lucu

Azure Interview | Azure Recorded Interview | Hi-Tech Institution | Azure Mock Interview

Azure Interview | Azure Recorded Interview | Hi-Tech Institution | Azure Mock Interview

Question 10: PWC Interview Questions | data engineers | #pyspark #bigdata #pwc #interview

Question 10: PWC Interview Questions | data engineers | #pyspark #bigdata #pwc #interview

Must Watch Live Mock Interview for Aspiring Big Data Engineers | PySpark, Hive & SQL #interview

Must Watch Live Mock Interview for Aspiring Big Data Engineers | PySpark, Hive & SQL #interview

Top Big Data Interview Questions asked in 2024 | Cloud Data Engineer | Azure | Spark | SQL#interview

Top Big Data Interview Questions asked in 2024 | Cloud Data Engineer | Azure | Spark | SQL#interview

Data Engineer Complete Roadmap For Beginners and Experienced Professionals (2024) | ft. @azureli

Data Engineer Complete Roadmap For Beginners and Experienced Professionals (2024) | ft. @azureli

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

Top 15 Spark Interview Questions in less than 15 minutes Part-2 #bigdata #pyspark #interview

Top 5 Insider Interview Questions Data Analysts Must Master Before Any Interview!

Top 5 Insider Interview Questions Data Analysts Must Master Before Any Interview!

Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2

Data Engineer Mock Interview | ADF | Medallion Architecture | BRONZE, SILVER & GOLD Layer| ADLS GEN2

Big Data Engineer Mock Interview | AWS | Kafka Streaming | SQL | PySpark Optimization #interview

Big Data Engineer Mock Interview | AWS | Kafka Streaming | SQL | PySpark Optimization #interview

[안방1열 직캠4K] 베이비몬스터 치키타 'FOREVER' (BABYMONSTER CHIQUITA FanCam) @SBS Inkigayo 2404707

[안방1열 직캠4K] 베이비몬스터 치키타 'FOREVER' (BABYMONSTER CHIQUITA FanCam) @SBS Inkigayo 2404707

ผมให้ AI ควบคุมชีวิต 24 ชั่วโมง (SPD)

ผมให้ AI ควบคุมชีวิต 24 ชั่วโมง (SPD)

ONE ลุมพินี 69 Full Fight | 5 ก.ค. 2567 | Ch7HD

ONE ลุมพินี 69 Full Fight | 5 ก.ค. 2567 | Ch7HD

เฉี๊ยบ เฉียบ Ep.260 จำกันได้มั้ย? น้องเยรินโตเป็นสาวแล้ว

เฉี๊ยบ เฉียบ Ep.260 จำกันได้มั้ย? น้องเยรินโตเป็นสาวแล้ว

Incredible magic 🤯✨

Incredible magic 🤯✨

โรงพยาบาลที่แพงสุดในไทย #shorts

โรงพยาบาลที่แพงสุดในไทย #shorts

BABYMONSTER (베이비몬스터) - FOREVER @인기가요 inkigayo 20240707

BABYMONSTER (베이비몬스터) - FOREVER @인기가요 inkigayo 20240707

ฟังสดเดอะโกสเรดิโอ 7/7/2567 เรื่องเล่าผีเดอะโกส

ฟังสดเดอะโกสเรดิโอ 7/7/2567 เรื่องเล่าผีเดอะโกส