1. a.partition is a grouping a simalar type of data best of key.it is use for increase the performance of hive query; create table tablename (col1 ,col2 ) prartitioned by (col3) row format... in dynamic parition set parameters=set hive.exec.dynamic.partition.mode=nonstrict and set hive.exec.dyanamic.partition=true b. bucketing is deviding a data best on hasfunction .it is use for increase the performace of join query . create table tablename (col1 ,col2 )clustered by (col1) row format. set parameter before loading data in bucketed table = set hive.enforce.bucketing=true
Hi Sir are we not asked dsa question in DE roles. Please make a video what question a candidate needs to prepare to crack interview some people say first dsa question is asked to clear interview.Throw some light on it.
I have posted the answer in part 2 video , I hope you didn't check the description for the part 2 video for answers, here is the video - th-cam.com/video/CG51YQHH9C0/w-d-xo.html
Hi Bro, Please reply me for this my interview question. when both data frames have large size and i need to perform join we can't use broad join right. In this case which join should we prefer shuffle hash or sort merge and why is it better note: here is join keys are completely unique keys.
1. a.partition is a grouping a simalar type of data best of key.it is use for increase the performance of hive query;
create table tablename (col1 ,col2 ) prartitioned by (col3) row format...
in dynamic parition set parameters=set hive.exec.dynamic.partition.mode=nonstrict and set hive.exec.dyanamic.partition=true
b. bucketing is deviding a data best on hasfunction .it is use for increase the performace of join query .
create table tablename (col1 ,col2 )clustered by (col1) row format.
set parameter before loading data in bucketed table = set hive.enforce.bucketing=true
Can you provide me answer for 7
Hi Gowtham, can you please make a video on how to explain our daily activities in our big data project?
Sure Naveen
Thanks Gowtham🙂
Hi Gowtham please share this video
Big data is it total coading?
@@dataengineeringvideos plz make a video on how to explain projects?
Can we see a real time end to end project not just explanation but practically doing it from scratch if possible can u please make a video.
Thanks a lot! You are the guiding light for interview preparation.. Looking for more such videos..
Certainly helpful in progressing towards Big Data Engineer
The way you speak and your voice is very impressive bro ..
I will text you soon as a data engineer : )
Anna, please make video on more coding interview question expected for Data Engineer please.
For someone who is applying to an internship in big data, do you ask the same questions or this is just for people with experience
Can you please provide aws questions and answers, it will be very appreciated
As a fresher shall I get job in bigdata
Hi Sir are we not asked dsa question in DE roles.
Please make a video what question a candidate needs to prepare to crack interview some people say first dsa question is asked to clear interview.Throw some light on it.
where you get daily data in your hadoop project (client side) please ans bro
Nice wish to see more real time project videos.
Hi brother need a video on map reduce with demo codes
Nice work keep posting
what is your cluster size in your project (please tell me the ans)
Hi sir, tnq so mush, ur videos are very helpful and understandable.
Very useful Info. Thankyou for your effort.
Hi gowtham can you make a video on gcp data engineer interview questions
Awesome questions ..do you provide training...??
Very good information 😊
Can you please share interview qns for 2 years experience
Who will post the answer
We need both questions & answers
I have posted the answer in part 2 video , I hope you didn't check the description for the part 2 video for answers, here is the video - th-cam.com/video/CG51YQHH9C0/w-d-xo.html
Hello sir,
This is Anu, could you prepare me for bigdata interviews?
Very helpful.
Informative
Hi Bro,
Please reply me for this my interview question.
when both data frames have large size and i need to perform join we can't use broad join right.
In this case which join should we prefer shuffle hash or sort merge and why is it better
note: here is join keys are completely unique keys.
Thank you
👍👍