rdd in spark | Lec-9

MANISH KUMAR

มุมมอง 27 869

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 3 ต.ค. 2024
In this video I have talked about rdd in spark in great detail. please follow video entirely and ask doubt in comment section below.
Directly connect with me on:- topmate.io/man...
For more queries reach out to me on my below social media handle.
Follow me on LinkedIn:- / manish-kumar-373b86176
Follow Me On Instagram:- / competitive_gyan1
Follow me on Facebook:- / manish12340
My Second Channel -- / @competitivegyan1
Interview series Playlist:- • Interview Questions an...
My Gear:-
Rode Mic:-- amzn.to/3RekC7a
Boya M1 Mic-- amzn.to/3uW0nnn
Wireless Mic:-- amzn.to/3TqLRhE
Tripod1 -- amzn.to/4avjyF4
Tripod2:-- amzn.to/46Y3QPu
camera1:-- amzn.to/3GIQlsE
camera2:-- amzn.to/46X190P
Pentab (Medium size):-- amzn.to/3RgMszQ (Recommended)
Pentab (Small size):-- amzn.to/3RpmIS0
Mobile:-- amzn.to/47Y8oa4 ( Aapko ye bilkul nahi lena hai)
Laptop -- amzn.to/3Ns5Okj
Mouse+keyboard combo -- amzn.to/3Ro6GYl
21 inch Monitor-- amzn.to/3TvCE7E
27 inch Monitor-- amzn.to/47QzXlA
iPad Pencil:-- amzn.to/4aiJxiG
iPad 9th Generation:-- amzn.to/470I11X
Boom Arm/Swing Arm:-- amzn.to/48eH2we
My PC Components:-
intel i7 Processor:-- amzn.to/47Svdfe
G.Skill RAM:-- amzn.to/47VFffI
Samsung SSD:-- amzn.to/3uVSE8W
WD blue HDD:-- amzn.to/47Y91QY
RTX 3060Ti Graphic card:- amzn.to/3tdLDjn
Gigabyte Motherboard:-- amzn.to/3RFUTGl
O11 Dynamic Cabinet:-- amzn.to/4avkgSK
Liquid cooler:-- amzn.to/472S8mS
Antec Prizm FAN:-- amzn.to/48ey4Pj

ความคิดเห็น • 66

@sachinjalwadi1213 6 วันที่ผ่านมา
This guy is underrated in the field of Data Engineering!! Crisp & Clear explaination of RDD
@rajat1999gupta หลายเดือนก่อน ⁺¹
Man, "How to and what to" point you mentioned in the video was magical, cleared the whole picture
@biswadeepdatta5218 หลายเดือนก่อน ⁺¹
Best explanation. You are sent by God.
@laboni8359 ปีที่แล้ว ⁺⁴
best explaination of RDD i found on youtube, and no doubt one of the best teachers !
@vishaljare1284 9 หลายเดือนก่อน ⁺³
the way you are teaching is excellent and easy manner.
@PriyankShekhar-y3j 22 วันที่ผ่านมา
very nice explanation !!
@sekirrom4169 3 หลายเดือนก่อน
best one sir , you showed us the real process behind the scenes. Thanks
@sabarnaghosh1658 ปีที่แล้ว ⁺²
please be consistent,and post video ,,i am depending on you to switch my career to data engineering within 2 months
@puneetmaddy 4 หลายเดือนก่อน
Hi Manish,
Your lectures are amazing and full of knowledge. Thanks a lot for uploading such beautiful conceptual videos.
@rishav144 ปีที่แล้ว ⁺²
very well explained
@codingwithanonymous890 11 หลายเดือนก่อน ⁺¹
Every where rdd had same meaning but no proper explanation thank you sar
@ig2947 6 หลายเดือนก่อน
Brilliant..!! Thank you so much for your efforts in teaching
@amarzond5253 5 หลายเดือนก่อน
Love you sir thank you so much......❤❤❤ बोहोत अछा सिखाते हो
@Abhinavkirtisingh 2 หลายเดือนก่อน
mast samjhaya bhai maza aa gya!! Thank you
@HanuamnthReddy 8 หลายเดือนก่อน
Best yotube channel....for learning 😊
@ChandanKumar-xj3md ปีที่แล้ว ⁺¹
hard concept but you made a very easy explanation.👍
@SaljuCodes 3 หลายเดือนก่อน
Thank you sir for best tutorial ❤
@DnyaneshwariTayade-e5v 7 หลายเดือนก่อน
Best explained, the way to teaching is very clear. thanks for providing such content on youtube. I have one request, could you please create a video on difference between RDD, dataframe and Datasets
@kartikgupta2299 หลายเดือนก่อน
Pehele aapne kaha features of RDD me ki optimization or disadvantage me No optimization
Mtlb kya hai
Or full control on our data chahie to rdd use krna chahie ye point clear kro
@harshalunhale4369 5 หลายเดือนก่อน ⁺¹
at 12:15, I think it should be age < 18 for RDD1 and then again age < 10 for RDD2 🙂
@rajun3810 6 หลายเดือนก่อน
excellent content #Love you Manish Bhai
@vijayvhanale8749 6 หลายเดือนก่อน
really amazing explanation, thanks :)
@siddharthdang1521 5 หลายเดือนก่อน
hi Manish, your explaination is very good. Keep us teaching and inspiring always!!! Do we deal with unstructered data on a frequent basis, if yes then is it through RDD always? If possible can you give some example or make a video on it.
@08rajdeepsonawane94 8 หลายเดือนก่อน
Nice explanation sir
@CctnsHelpdesk 6 หลายเดือนก่อน
well explained
@mayankkandpal1565 10 หลายเดือนก่อน
nice explanation.
@sankuM ปีที่แล้ว ⁺⁴
Another good one, @MANISH KUMAR!! 🙌 By any chance, do you still work with them at Jio? 😅😅 Also, can you share what type of architecture are you working with along with data volume?
Keep continuing this series! I'm preparing for #spark for my next switch with your series!! :) Thanks a lot again on behalf of all DEs!!! 👍👏👏
@nitilpoddar 9 หลายเดือนก่อน
Hi @sankuM it has been 8 months, can I ask you about your progress and how has it been going with your switch ?
@sankuM 9 หลายเดือนก่อน
Hey@@nitilpoddar, it is going good only! I switched in May last year only!
Knowing about fundamentals and optimizations thereof really help in day-to-day work I feel now!!!
@satyamkumarjha4185 3 หลายเดือนก่อน
Hi Manish, when is the RDD originally created? Is it created just after JVM code is generated by spark engine? Thanks for all the work you do.
@anshukumari6616 ปีที่แล้ว
Waiting for the next video !!
@ChandanKumar-xj3md ปีที่แล้ว ⁺¹
Q. When you say 500MB/128MB = 4 Partition, is this means that every single partioned data will be stored over each node ?
@manish_kumar_1 ปีที่แล้ว
Yes this partitioned data is distributed over the network
@adityaverma4770 10 หลายเดือนก่อน
@@manish_kumar_1 each node may have different capacity, right? it is not necessary that each node should accommodate only one partition. say we have each node with capacity of 300 mb so in this case it can accommodate 2 partitions of 128 mb each.please confirm?
@pravinyadav8372 9 หลายเดือนก่อน
@@adityaverma4770 yes it can execute two partitions as well in a single node but the executor should have multiple cores. If it is only having 1 core then one executor will only execute one partition. Hope this helps
@wayzonic 5 หลายเดือนก่อน
Node and executer both are different. Single node can build multiple executers base on node capacity
@akumar2575. 5 หลายเดือนก่อน
i started watching lecture from last one week and now from today i am going to complete the lecture within 10 days // target started from today let's support me for this and i know i am a slow learner but i can do it 🙂// counting started from toady date 20/04/24
@manish_kumar_1 5 หลายเดือนก่อน ⁺¹
Isko jaldi complete karne me bahaduri nhi hai. Take your time and try to understand in great depth.
@akumar2575. 5 หลายเดือนก่อน
@@manish_kumar_1 ok and thank you
@prachideokar7639 6 หลายเดือนก่อน
Hello Manish.. practical wale videos ki link mil sakti hai kya?!
@helloanalyst 8 หลายเดือนก่อน
Thanks for sharing valuable information,👍
जब भी कोई RDD के बेस पर नया RDD बनता है तो वो जो नया RDD बनेगा वो क्या same क्लस्टर मे बेनग़ा या फिर किसी दूसरे मे.
@wayzonic 5 หลายเดือนก่อน
On same cluster
@sabarnaghosh1658 ปีที่แล้ว ⁺¹
That means every single dataframe will eventually transformed into RDD right?
@manish_kumar_1 ปีที่แล้ว
Yes
@riyakhandelwal8154 ปีที่แล้ว
In previous videos, it was mentioned by you that a DAG will be created only when it encounters any action(Job--> DAG). In this video you have mentioned a DAG will be created after every transformation. DAG created as an when it scans every line of code ? Or is it created when any action is encountered
@manish_kumar_1 ปีที่แล้ว ⁺¹
DAG is created when an action is encountered.
@Vinodkumar_19879 6 หลายเดือนก่อน
Love you guru ji
@adityasaini8437 10 หลายเดือนก่อน
Here dataframe code means pyspark code right???
@codetechguru1 8 หลายเดือนก่อน
@manish_kumar_1 bahi can you explain it?.
Q1. DataFrames in Spark are built on top of RDDs, but they use a more efficient execution engine called Tungsten.
Q2. RDD don't have compile-time checking. In Vise versa Data frame have Compile-time and runtime checking.
@shreyanshrathod2007 5 หลายเดือนก่อน
Thanks for the explanation, Manish.
One doubt, when we apply multiple transformations and generate new RDD's, are all the previous RDD stored in the memory until an action is called?
Thanks in advance
@camelCasePhantom 27 วันที่ผ่านมา
Whole spark works on the principal of lazy transformation. What that essentially means is, don't process any data unless an output activity is called on it. so if you called 1000 transformations , but never called an action, which is like an output activity of sorts , nothing will be executed. that means , spark won't have to create or store any rdd anywhere.
@akumar2575. 5 หลายเดือนก่อน
done👍
@mariakausar2477 7 หลายเดือนก่อน
🎉👍
@iftekharkhan3254 9 หลายเดือนก่อน
Nice explanation sir.
Red color is not proper visible , better to use other color.
@manish_kumar_1 9 หลายเดือนก่อน
Ok. Thanks
@Vinodkumar_19879 6 หลายเดือนก่อน
Guru ji good morning
@raghudeep8873 ปีที่แล้ว
👍👍👍
@akashghadage5377 ปีที่แล้ว
What's your source of learning Spark? Btw explanation is good 👍
@manish_kumar_1 ปีที่แล้ว ⁺¹
Book spark the definitive guide,
Blogs,
TH-cam videos,
Company real time projects
@nitilpoddar2697 9 หลายเดือนก่อน
done
@manish_kumar_1 ปีที่แล้ว
Directly connect with me on:- topmate.io/manish_kumar25
@anubhavrai5612 ปีที่แล้ว
Which app are you using for making notes?
@manish_kumar_1 ปีที่แล้ว
One note
@Vinodkumar_19879 6 หลายเดือนก่อน
Guru ji bhar bhar ke like subscribe karo sab log
@hell_king_ZoroJuro ปีที่แล้ว
Bhai like kr diya kro dekhne ke bad
@pramod3469 ปีที่แล้ว
very well explained

ต่อไป

เล่นอัตโนมัติ