Querying 100 Billion Rows using SQL, 7 TB in a single table

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 ม.ค. 2025

ความคิดเห็น • 38

  • @TheElementFive
    @TheElementFive ปีที่แล้ว +10

    The first question you should always ask when working with a 100 billion row database: “Why do I have a 100 billion row database?”

    • @davidlean8674
      @davidlean8674 ปีที่แล้ว +7

      And the answer would be "because I work with a multinational enterprise customer". If you have a large market share in China (1 bill people) , India (1 Bill people), Europe 0.75 Bill, USA (350M people) it doesn't take long to get to 100 BIllion transactions.
      If you want to do Financial Year on Year comparisons, you need to keep at least 24 months of data, usually 36 months. .

    • @leksetengah
      @leksetengah หลายเดือนก่อน

      ebay? amazon store?

  • @alok5253
    @alok5253 3 ปีที่แล้ว +9

    Simple and concise, thank you!

  • @vaibhavis1
    @vaibhavis1 3 ปีที่แล้ว +4

    Thanks for the explaination. I am curious that is it just scaling of the systems, or BigQuery does query optimization to reduce the latency as well?

  • @Hrzzz1
    @Hrzzz1 ปีที่แล้ว

    we can download this database to do some testes ?
    I nice ideal for next video is compare this same situation with noSQL database.

  • @JunaidKhan-gq8nw
    @JunaidKhan-gq8nw 4 หลายเดือนก่อน +1

    Great, Thanks a lot, sir.

  • @mathteacher5670
    @mathteacher5670 ปีที่แล้ว

    excellent sir thank you so much highly motivational for passionate person

  • @WanderWisdom731
    @WanderWisdom731 2 ปีที่แล้ว

    Wow.. this experiment was really amazing to benchmark the bigquery .

  • @houssem25000
    @houssem25000 7 หลายเดือนก่อน

    So I don't have to carry about performance when I make projects ?!

  • @rajakumarkeelu9449
    @rajakumarkeelu9449 26 วันที่ผ่านมา

    Hi Bro, what if I apply FELLTEXT INDEX(View) prior to the query apply

  • @ashitoshthakur9402
    @ashitoshthakur9402 3 ปีที่แล้ว

    Wow what a gr8 video sir ji..pls sir make video on sql with ml and sql also..

  • @PradeepMishra-qs2hz
    @PradeepMishra-qs2hz 2 ปีที่แล้ว

    Awesome . Keep it up.

  • @vipulkumar7938
    @vipulkumar7938 3 ปีที่แล้ว +1

    Well Explained, Thanks a lot

  • @Mju98
    @Mju98 10 หลายเดือนก่อน

    Hello sir. I tried to import 400k data into big query sandbox. But ended with more errors. Is this possible to import those data. Pls anyone help me it's urgent ( interview assignment)

  • @Rpskmr
    @Rpskmr 11 หลายเดือนก่อน

    Nice video but while voicing better to expand the screen than side by side videos

  • @AamirKhan-vu2om
    @AamirKhan-vu2om 3 ปีที่แล้ว

    Heyy, very informative. I came here around searching for big data processing in seconds. Ive a question, I would like to build a system where I import terabytes of data into single table with keys and I want to perform all the DML operatiom in such a way it should take very less execution time as shown. Please help me out, how I can acheive. Im stuck.

    • @SK-rl3wu
      @SK-rl3wu 28 วันที่ผ่านมา

      Hi
      I have similar requirement, could you please share your analysis/solution if you find any, thank you.

  • @toxiclife
    @toxiclife ปีที่แล้ว

    what to do when I want to overwrite 100 millions of rows into new table, in minutes?
    df.write.mode("overwrite").saveAsTable("FINAL"), if you could please help with this?

  • @nfacundot
    @nfacundot ปีที่แล้ว

    Hello, can I connect it on php?

  • @skill-learning
    @skill-learning 3 ปีที่แล้ว

    I appreciate your effort. Could you put the used link for the google cloud project?

  • @merhaiakshay9625
    @merhaiakshay9625 3 ปีที่แล้ว

    Please organize the videos and make playlists , great video , very informative and helpful, which led me to subscribe , thanks 😊

  • @MDDM03
    @MDDM03 ปีที่แล้ว +1

    marketer of google cloud.. nothing states what to improve

  • @prathivenkatasaipavan9909
    @prathivenkatasaipavan9909 3 ปีที่แล้ว

    Great explanation

  • @arthurrodrigues5382
    @arthurrodrigues5382 2 ปีที่แล้ว

    Amazing!

  • @visva2005
    @visva2005 3 ปีที่แล้ว

    @Arpit Agrawal, Good. Let me know what database is behind this Console?

    • @elastiqai
      @elastiqai  3 ปีที่แล้ว

      Google Cloud Bigquery 😁

  • @himanish2006
    @himanish2006 2 ปีที่แล้ว

    This is good...

  • @ungeedh
    @ungeedh 3 ปีที่แล้ว

    Nicely explained.

  • @Helloimtheshiieet
    @Helloimtheshiieet 2 ปีที่แล้ว

    Im confused were these indexes?

    • @elastiqai
      @elastiqai  2 ปีที่แล้ว

      BigQuery doesn't have indexes. It has partitions and clustering.

  • @muhamadridwan4766
    @muhamadridwan4766 2 ปีที่แล้ว

    wow!

  • @aminremiiii
    @aminremiiii 2 ปีที่แล้ว

    Please for 50 days I am looking for this i wanna to create 2000 users in mysql and set the phone number as user name and password my be say me how can i create most users with default password? That's

  • @abhijayrajvansh
    @abhijayrajvansh 5 หลายเดือนก่อน +1

    it's always an Indian guy!

  • @MdRakib-rc6ub
    @MdRakib-rc6ub 2 ปีที่แล้ว

    I need your help

  • @sconnell194
    @sconnell194 3 ปีที่แล้ว

    👍

  • @davidlean8674
    @davidlean8674 ปีที่แล้ว

    This is nice but not that impressive. Obviously, the table is being stored using Columnstore Compression techniques. So you only need to query the columns in the select list. And they are typically grouped in blocks of 1 M or more. These header pages keep rowcount values. So you are not reading every row. Just the block headers of a single column.
    If your query forced the scan of all rows in the "block" asking it to be combined with other fields in the same row or in other tables before you could filter it. You will no longer be in the columnstore sweet spot. and the difference in query speed would be more striking.
    Still good thou, as that is a common use case.