Column vs Row Oriented Databases Explained

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 มิ.ย. 2024
  • In this video, I explain the differences between Column vs Row Oriented Database Storage how efficient each method is, and their pros & cons. We will also see how different queries perform against both types of databases.
    0:00 Intro
    2:50 Row-Oriented Database
    15:30 Column-Oriented Database
    26:30 Pros & Cons
    Download slides here
    payhip.com/b/JVHIM
    🎙️Listen to the Backend Engineering Podcast
    husseinnasser.com/podcast
    🏭 Backend Engineering Videos
    • Backend Engineering (B...
    💾 Database Engineering Videos
    • Database Engineering
    🏰 Load Balancing and Proxies Videos
    • Proxies
    🏛️ Software Archtiecture Videos
    • Software Architecture
    📩 Messaging Systems
    • Message Queues & PubSu...
    Become a Member
    / @hnasr
    Support me on PayPal
    bit.ly/33ENps4
    Stay Awesome,
    Hussein
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 122

  • @DannyPhantumm
    @DannyPhantumm 2 ปีที่แล้ว +49

    Clearly, you're a naturally gifted teacher. Great content.

  • @SrivatsanVenkatesan
    @SrivatsanVenkatesan ปีที่แล้ว +1

    I understand this is one of your older videos, but wanted to mention that your content is first class! Thank you!

  • @PrashantZombade
    @PrashantZombade ปีที่แล้ว +2

    Most interesting way of teaching I have ever found, learning can't be more fun than this!

  • @kharlcutie4242
    @kharlcutie4242 3 หลายเดือนก่อน +2

    Love the way you teach. I almost didn't want the video to end.

  • @hnasr
    @hnasr  3 ปีที่แล้ว +22

    #savetheducks

  • @richboiii17
    @richboiii17 8 หลายเดือนก่อน +2

    Incredible video. I was to solidify my understanding of the concept of columnar databases vs row based and this video not only made it easy to understand, but enjoyable too!

  • @adrianasensio4298
    @adrianasensio4298 3 ปีที่แล้ว +9

    The content of this channel is superb.

  • @foxtrotbr
    @foxtrotbr 3 ปีที่แล้ว +2

    One of the best explanations I had seen. Thanks man

  • @murugesanrm
    @murugesanrm 2 ปีที่แล้ว

    Thank you very much for making this video with a real-time example. Much appreciated

  • @williamfelippedeschamps7050
    @williamfelippedeschamps7050 3 ปีที่แล้ว +2

    Great video, well explained, fun and infomative. I loved that, thanks dude!

  • @videosforthegoodlife2253
    @videosforthegoodlife2253 ปีที่แล้ว +3

    Great video man. I like a mix of column (for logging, source of truth) and table based RDBMS and also documented oriented (which could be either row or column) for quick trashy dirty data that makes you blush when you look at it too long.
    But I've seen columns used for quick trashy data - where sums or map reduce is the highest priority and it blows everything else away. I am digging ScyllaDB lately.

  • @HopeMlebe
    @HopeMlebe ปีที่แล้ว +1

    Visually clear, funny and interesting explanations, you are greatly talented.

  • @haoyuanhuang5098
    @haoyuanhuang5098 2 ปีที่แล้ว +4

    Gives concrete examples of when column database operations are faster or slower than a row database. Thank you!

  • @harshitagrawal8556
    @harshitagrawal8556 2 ปีที่แล้ว

    Thank you so much for explaining this concept so beautifully and in such a great depth...I am a fan of your teaching.....

  • @RooftopDuvet
    @RooftopDuvet ปีที่แล้ว +1

    Great work here! So many explanations of this are too high level, and miss the key differentiator: i.e. the way in which the data is accessed. You did a great job and did it at your own pace. Hope you find success with this style.

  • @OmarBela12
    @OmarBela12 3 ปีที่แล้ว +1

    this dude is good, the channel is underrated

  • @nishantgarg7497
    @nishantgarg7497 3 ปีที่แล้ว +4

    Really awesome content. A really good source for me who is looking to improve the backend concepts. Really thank you for such good content. Just subscribed for updates

    • @hnasr
      @hnasr  3 ปีที่แล้ว +2

      Glad it was helpful! and welcome to the community

  • @muustafaaahmed9782
    @muustafaaahmed9782 ปีที่แล้ว

    I really loved your method in describing this topic .

  • @antonsizou9075
    @antonsizou9075 3 ปีที่แล้ว +1

    Thanks. Good info. Never know how column dbs work

  • @michaelgeorge6765
    @michaelgeorge6765 2 ปีที่แล้ว

    Thank you Hussien. really simple, good and funny.

  • @prateekraj1084
    @prateekraj1084 3 ปีที่แล้ว

    awesome explanation for both row and column oriented db's

  • @azureabhik
    @azureabhik ปีที่แล้ว

    Amazing visualisation of concept keeping the technicalities agnostic along with equally simplified narration.
    The quality of your material and narration is inversely proportional to the jokes :)

  • @vilewalker234
    @vilewalker234 3 ปีที่แล้ว +2

    Youre very entertaining to watch, listen, and learn from

    • @hnasr
      @hnasr  3 ปีที่แล้ว

      ❤️❤️❤️

  • @ap207
    @ap207 4 หลายเดือนก่อน

    Absolutely amazing video. Thank you

  • @muhammadfarhanmughal6101
    @muhammadfarhanmughal6101 2 ปีที่แล้ว

    Your Accent and voiceovers make it more attractive to learn.

  • @sujojohn2948
    @sujojohn2948 2 ปีที่แล้ว

    Thank you so much for this explanation!!

  • @luckyboy1986
    @luckyboy1986 3 ปีที่แล้ว

    Thank you! That's very clear !

  • @gabriellegall8278
    @gabriellegall8278 3 ปีที่แล้ว

    Thank you so much ! it was so clear

  • @abderrahmanemabroukmerabet9274
    @abderrahmanemabroukmerabet9274 3 ปีที่แล้ว

    Really good video bro, I like what you do

  • @anikettiwari6885
    @anikettiwari6885 2 ปีที่แล้ว +1

    @Hussein I love your database videos. Could you create a video on how to Alter large tables which has millions or maybe billions of records without a downtime in Postgres.

  • @prem912490022
    @prem912490022 3 ปีที่แล้ว +1

    Waiting for your udemy course. Great stuff as usual.

  • @spyros5528
    @spyros5528 10 หลายเดือนก่อน

    Great video and explanation, thank you.

  • @srb1855
    @srb1855 2 ปีที่แล้ว

    this was excellent it cleared up the fud👍 thanx!

  • @virendrabhati6685
    @virendrabhati6685 3 ปีที่แล้ว +1

    Great information in so simple way..... It's clear the concept in best of best way 👍. I loves your all videos....

    • @hnasr
      @hnasr  3 ปีที่แล้ว

      Thanks Virendra 🙏

  • @darrenz5557
    @darrenz5557 3 ปีที่แล้ว

    hi! can you do a vid with indexes? the visuals are so helpful!

  • @OfferoC
    @OfferoC 3 ปีที่แล้ว +1

    nice explanation thank you

  • @adityasethi9794
    @adityasethi9794 3 ปีที่แล้ว

    This just shows how much he loves what he does.

  • @andreabondanini2509
    @andreabondanini2509 9 หลายเดือนก่อน

    Amazing content!

  • @vinny142
    @vinny142 3 ปีที่แล้ว +2

    13:35 Aggregates read more than you need. Only if you don't have indexes on the columns you query and if your core business is querying that data then you will have that indexed anyway. Also: if the amount of reads becomes a problem the first thing you do is de-normalize that value into a separate table.
    This is where database monitorring becomes essential, a nice topic for a ten-part series that will blow your viewers minds :-)
    I'd change the pro's and con's to "what kind of applications benefit from this." because every point you mention has some serious vaceats, related issues and known workarounds.

    • @hnasr
      @hnasr  3 ปีที่แล้ว

      Correct that is why didn’t include indexes in the mix. Thanks for the feedback as usual

  • @tirthpatel8364
    @tirthpatel8364 2 ปีที่แล้ว

    Great explanation.

  • @darthmelbius
    @darthmelbius 2 ปีที่แล้ว

    8:56 I'm assuming that in your example the query will continue beyond the record (and block) it found as there is no index or keys? So it will scan all "blocks" even if it finds only one match?

  • @MohammedOmarBalousha
    @MohammedOmarBalousha 19 วันที่ผ่านมา

    Great explanation

  • @mohammedkandelhassan
    @mohammedkandelhassan 3 ปีที่แล้ว

    Great Video!

  • @vanigs8726
    @vanigs8726 ปีที่แล้ว

    ur awesome man .great explanation

  • @pesetskyps
    @pesetskyps 2 ปีที่แล้ว

    very clear. bravo!

  • @bigdata_journey
    @bigdata_journey 7 หลายเดือนก่อน

    Thanks bro, very useful

  • @chunheguo9230
    @chunheguo9230 3 ปีที่แล้ว +1

    Great vid. I am working with both data structure types :) Using the postgres as a row base to prepare it for a transformation into columnar for gpus to process :)

    • @hnasr
      @hnasr  3 ปีที่แล้ว +1

      Nice! Your going HTAP

    • @chunheguo9230
      @chunheguo9230 3 ปีที่แล้ว

      @@hnasr No, not going hybrid transactional, as the columnar data is being used as runtime in memory data until a bulk update changes it.

    • @minscj
      @minscj 3 ปีที่แล้ว

      @@chunheguo9230 hi please could you give me more details how i can do the same? please reply

    • @chunheguo9230
      @chunheguo9230 3 ปีที่แล้ว

      @@minscj Hi, the solution we went is proprietery, so I can't really go into details. I can however suggest that you take a look at the concept of apache arrow. www.dremio.com/announcements/introducing-apache-arrow/ has a nice diagram. We went very low level and didn't use many of the existing open-source abstraction layers. It all came down to understanding how the GPU's processing cycle works and the alignment of the columnar data to said cycle.

  • @Nicoblabla
    @Nicoblabla 3 ปีที่แล้ว

    great video!

  • @madanmohanpachouly6135
    @madanmohanpachouly6135 2 ปีที่แล้ว

    Nicely explained

  • @oah8465
    @oah8465 3 ปีที่แล้ว +1

    Dude that was sweet. Any chance of doing a video on file systems and mapping them to DB OPERATIONS

    • @rujhanarora7892
      @rujhanarora7892 2 ปีที่แล้ว

      yup, I wonder if I increase the text value in a column or add a new column, how does it map to disk i/o

  • @therealdrag0
    @therealdrag0 2 ปีที่แล้ว

    This column store sounds similar to inverted indexes that search engines (eg elastic search) use. Are there key differences there?

  • @saadowain3511
    @saadowain3511 3 ปีที่แล้ว

    Hussain
    Can you please make a short video of different kinds if DBs who are the providers.. what are the ideal uses.

  • @swarajgupta3087
    @swarajgupta3087 ปีที่แล้ว +2

    You opened my eyes :D

  • @praveenX
    @praveenX 10 หลายเดือนก่อน

    Funny and effective, loved it 👍

  • @ameyapatil1139
    @ameyapatil1139 3 ปีที่แล้ว +8

    "Lets confused everybody by new names" : hahahaha well said ! great video thanks

  • @umapathybabu8397
    @umapathybabu8397 3 ปีที่แล้ว

    nice demonstration

  • @KaranRaina95
    @KaranRaina95 2 ปีที่แล้ว

    Hey Hussein, wouldn't it be fair to say that to get the advantages of column db in row db, we end up making indexes in row db?

  • @yxw9104
    @yxw9104 2 ปีที่แล้ว

    Thank you so much!

  • @rajbansal3663
    @rajbansal3663 ปีที่แล้ว

    Cassandra (NoSQL) uses LSM Tree which makes it a better choice for heavy writes in comparison to SQL databases, any thoughts on this?

  • @RandomShowerThoughts
    @RandomShowerThoughts ปีที่แล้ว

    another banger of a video

  • @timbui5556
    @timbui5556 2 ปีที่แล้ว +3

    Could you please teach us how to do columnar partition in Postgres? It's easy to find lessons on horizontal partitioning, but I can't find writings on how to do vertical. Thank you!

    • @MercedeX7
      @MercedeX7 ปีที่แล้ว

      it's called horizontal partitioning. read about it on net

  • @eric7758
    @eric7758 3 ปีที่แล้ว

    Great Video Hussein.. when are you doing webrtc?

    • @hnasr
      @hnasr  3 ปีที่แล้ว +2

      I am working on the slides, once thats done Ill work on the demos so maybe a week or two

    • @eric7758
      @eric7758 3 ปีที่แล้ว

      @@hnasr Thanks

  • @md.imrulhasan8757
    @md.imrulhasan8757 2 ปีที่แล้ว

    Sir.... Here you said when searching for first_name it automatically load the final block.... it escape first block of first_name..... How can it find it? Is it because the the row_number is indexed in the db table?
    if not then why not find the final block using ssn?

  • @sexypradeepkumar
    @sexypradeepkumar 2 ปีที่แล้ว

    Which databases stores both rowbased and column based structures?

  • @mrluismartinezzz
    @mrluismartinezzz 3 ปีที่แล้ว +2

    Hussein, thanks for the videos. Today imma try & figure out how to download a TH-cam video with vanilla NodeJs if I don’t figure imma ask you guys for help

  • @Firatayrilik
    @Firatayrilik 2 ปีที่แล้ว

    Perfect!

  • @aminebenkhouya7295
    @aminebenkhouya7295 3 ปีที่แล้ว +1

    u r the best

  • @ganeshbb1
    @ganeshbb1 2 ปีที่แล้ว

    Thank you..

  • @natem889
    @natem889 3 ปีที่แล้ว

    How do you work on 1 or less column table?? 🤔

  • @dylanngo4454
    @dylanngo4454 6 หลายเดือนก่อน

    Thank bro 🎉

  • @maheshnarayan6440
    @maheshnarayan6440 7 หลายเดือนก่อน

    Great video. QQ: For columnar DB, if DB stores all the metadata about which block has 1006, won't it also store metadata about social security number 666? So we would need only 2 jumps instead of 3 jumps right?

    • @reactorscience
      @reactorscience 7 หลายเดือนก่อน

      1006 is the row id(internal to the database). The db only knows in which blocks do these intern ids exist. It doesn't store any such metadata for the other columns

  • @gyhuj1235
    @gyhuj1235 2 ปีที่แล้ว

    Why can't we just do SELECT Salary from emp? will that be efficient or will it result in the entire row read and then it will be filtered? The table can be indexed for ssn or name.

  • @user-tc8tp9xn8p
    @user-tc8tp9xn8p ปีที่แล้ว

    Awesomeeeeee

  • @Juasbhu
    @Juasbhu 2 ปีที่แล้ว

    Question, column oriented is the same with family column Db?

    • @hnasr
      @hnasr  2 ปีที่แล้ว +1

      Yes same name. Columnar and column store are other names.

  • @muhammaddavatgar4973
    @muhammaddavatgar4973 2 ปีที่แล้ว

    love you sir

  • @instantshinramen
    @instantshinramen 2 ปีที่แล้ว

    "The devil!"
    "Save the ducks guys save the ducks"
    Now I understand databases.

  • @ramadhan6273
    @ramadhan6273 3 ปีที่แล้ว

    Hussein I wanna know how you had that level of curiosity machallah? is it something gained by training?

    • @hnasr
      @hnasr  3 ปีที่แล้ว +2

      It is pure curiosity and asking why and having the humility to learn takes time.

  • @peterisawesomeplease
    @peterisawesomeplease 3 ปีที่แล้ว +1

    Good video but a couple small things. I think the video was slower than it needed to be. Like too many tangent and repetition. We can pause and go back and forth so no need to artificially slow it down. Also i think for this topic leaving out indexes does not make sense. Almost no one is going to choose to use a column oriented db before trying indexes.

  • @benevans1377
    @benevans1377 3 ปีที่แล้ว

    Hey bit of an off topic question why did you change your name from igeometry?

    • @hnasr
      @hnasr  3 ปีที่แล้ว

      Moving from GIS to personal brand so I get to cover multiple topics mainly.

  • @ianokay
    @ianokay 6 หลายเดือนก่อน

    We generally want all the columns, that's what a record or document is

  • @jasont6421
    @jasont6421 2 ปีที่แล้ว

    Lmao, what's the reference to every time you write to a disk that a duck dies? 😹

  • @yanfung7299
    @yanfung7299 ปีที่แล้ว

    If let's say in a row oriented db, from your explanation the commas does not exist but just for displaying, how will the engine knows where to start to look for first name etc?

    •  10 หลายเดือนก่อน

      For instance, PostgreSQL stores these sequences of values in tuple storage, one for each column in the table. The values are serialised and packed together to form the tuple. When querying data from a table, PostgreSQL uses the stored column names in the system catalogs to interpret the tuples' content correctly. The column names are used by the query planner and executor to map the data values from the tuple storage to their respective columns based on their positions in the tuple.

  • @carlitoz450
    @carlitoz450 3 ปีที่แล้ว

    interetsting video, shouldn't data in column oriented db be stored sorted ?

    • @hnasr
      @hnasr  3 ปีที่แล้ว

      Not necessary, the table data aren't stored sorted usually otherwise writing becomes difficult. Indexes on the other hand are sorted

  • @pdeezz
    @pdeezz 2 ปีที่แล้ว

    You’re hilarious 😂 and offer a great explanation. Thanks!
    #savetheducks

  • @vinitsunita
    @vinitsunita ปีที่แล้ว

    Lesson learnt from the video, Save the ducks :p

  • @sudharshantr8757
    @sudharshantr8757 2 ปีที่แล้ว

    at 21:58 1006 was found directly using some "tricks".. Then why can't we use the same tricks to find 666:1006 in the first try?

  • @stormcorexz
    @stormcorexz 2 ปีที่แล้ว

    I think I can not imagin how locking are working on column oriented database, it is a nightmare unless it has it's own deifferent techniques

  • @07rohitpd
    @07rohitpd 2 ปีที่แล้ว

    "...they have all this meta-data, mumbo-jumbo"
    -Hussein

  • @shubhamjaiswal7608
    @shubhamjaiswal7608 ปีที่แล้ว

    Nasser, great video. But one observation, clearly you were High while making this video.🤣

  • @Juasbhu
    @Juasbhu 2 ปีที่แล้ว

    Hahahahha you're so funny. Good video. Thanks

  • @07rohitpd
    @07rohitpd 2 ปีที่แล้ว

    "SAVE THE DUCK", guys, "SAVE THE DUCK".

  • @hectorge753
    @hectorge753 3 ปีที่แล้ว

    Six Six Six, the devil... SUBSCRIBED

  • @icbm7
    @icbm7 3 ปีที่แล้ว

    🦆🦆🦆🦆 Great video!

  • @therealdrag0
    @therealdrag0 2 ปีที่แล้ว

    Clarification: “column stores” and “wide column stores” are quite different! I watched this expecting to learn about BigTable/Cassandra. But they have key differences so this video doesn’t apply to them. TIL

    • @hnasr
      @hnasr  2 ปีที่แล้ว

      Correct wide column is different. Group of columns into a column family. Best of both words

  • @vivekshah1664
    @vivekshah1664 2 หลายเดือนก่อน

    save the ducks guys!

  • @sbylk99
    @sbylk99 3 ปีที่แล้ว

    omg, why only you can explain complicated problems in easy words!
    Tutorials always say "NoSql is good for fast write, scalable, not suitable for complicated query",
    but no one explain clearly as you! Column based NoSql is just for simple data write and AGGREGATE query. One example is number of likes of a video.
    Just define a simple table, (video_id, user_like_id), then sum(user_like_id), this scenario is the best for NoSQl.
    Or sensor data, not complicated(can tolerate write slow), but lots of aggregate query, like min(), max(), average().

  • @julianzhai6321
    @julianzhai6321 ปีที่แล้ว

    Awe man but ducks are delicious

  • @kumailn7662
    @kumailn7662 2 ปีที่แล้ว

    hussein kindly be straight forward on the videos, you to much talkative i like that but i more information centric information seeker. if you provide to the point would be appreciated alot, second don't mixed or drag the words while talking.

  • @GerardBeaubrun
    @GerardBeaubrun 2 ปีที่แล้ว

    666 thank you for the laughter my friend

  • @vraymond2048
    @vraymond2048 2 ปีที่แล้ว +1

    "Lets confused everybody by new names", make them look like a fool who can not understand things, thus makes us more "professional" and "experts"!