Introduction to Scaling Analytics Using DuckDB with Python

Why use DuckDB in your data pipelines ft. Niels Claeys

Using the {arrow} and {duckdb} packages to wrangle medical datasets that are Larger than RAM

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

How to treat Acne💉

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

DuckDB Tutorial - DuckDB course for beginners

Data with Marc

มุมมอง 36 890

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 13 ม.ค. 2025

ความคิดเห็น • 51

@MHelmers77 ปีที่แล้ว ⁺¹⁴
Would love a full Udemy Course on DuckDB & dbt from you!
@shneor.e ปีที่แล้ว ⁺¹
I would buy it
@robertbutscher6824 11 หลายเดือนก่อน
great tutorial, thanks for inspiration and your interstings lessons
@tech-n-data ปีที่แล้ว ⁺¹
This is a great intro video, thank you.
@MarcLamberti ปีที่แล้ว ⁺¹
Thank you ❤️
@gatono25 ปีที่แล้ว ⁺¹
That's game changing, very nice video, Marc, Thank you.
@interestingamerican3100 ปีที่แล้ว ⁺¹
Dude.....I like your style! Subscribed and hit the bell....
@MarcLamberti ปีที่แล้ว
Thank you 🙏
@mathieujasinski2383 ปีที่แล้ว ⁺¹
I did smashed the like "booton", thanks brother, very helpful content!
@MarcLamberti ปีที่แล้ว
Thank you 🙏
@HitAndMissLab 5 หลายเดือนก่อน
Brilliant tutorial, appreciate your work.
@adiliophi ปีที่แล้ว
Awesome content! Thanks a lot!
@AliMasri91 ปีที่แล้ว
Thank you! Great content as always!
@theaudiomelon 2 หลายเดือนก่อน ⁺¹
The link in the description only provides one dataset. How do I get all the CSVs?
@BD_warriors ปีที่แล้ว ⁺¹
Please make more videos
@user-fv1576 6 หลายเดือนก่อน
do you have a video on your vscode config setup?
@yangsong6111 6 หลายเดือนก่อน
thx a lot, I learned a lot
@rayusaki88 7 หลายเดือนก่อน
Thanks Marc
@TheBryhanS 4 หลายเดือนก่อน
Thank you!
@jerrynwaeze9269 2 ปีที่แล้ว
Hi This is the best video on duckdb atm. How do you get resources on it? I heard you can connect to a postgres db using a Postgres scanner. How does that work?
@stnico ปีที่แล้ว
Great, thanks!
@MarcLamberti ปีที่แล้ว
You’re welcome ☺️
@dataslinger6379 ปีที่แล้ว
Thanks as always for the great content Marc! Just as a sanity check for myself, OLAP Databases don't necessarily need to be columnar based correct? Recent big data databases use columnar based storage, but any database that processes analytical workloads is considered an OLAP database even if it is row based correct?
@leonvictorrichard3959 ปีที่แล้ว
Would love an dbt+ duckdb course from u on Udemy. Big fan of yours 🎉
@ularkadutdotnet 2 ปีที่แล้ว
Awesome ❤
@MarcLamberti 2 ปีที่แล้ว
Thank you 🙏
@olga6163 ปีที่แล้ว
Thanks, great lesson! Could you help to fix an error, it occurs when I try to run the sales table that we`ve created. The error is "Conversion Error: Could not convert DATE (0019-04-19) to nanoseconds" Don`t understand why it`s trying to get nanoseconds with the date format. Thanks!
@CaribouDataScience ปีที่แล้ว
Why not use the pandas SQL functions?
@LeSaboteur3981 11 หลายเดือนก่อน
for example i don't think it supports the nice extended Syntax like EXCLUTE. but the main reason is speed. duckDB will execute way faster, pandas SQL is still pandas in the end. (correct me if i'm wrong)
@georgelza 5 หลายเดือนก่อน
i'm sorta stuck... your VSCode is running like a Jupiter notebook... how so... what did i miss... your screen shows a run cell run above debug above... where does that come from
@davidjackson7675 ปีที่แล้ว
Thanks.
@aruniyer1833 ปีที่แล้ว
I am getting this error
Conversion Error: Could not convert DATE (0019-04-19) to nanoseconds and run conn.execute("from sales").df()
when i create a sales table. Not sure if you have seen it?
@MarcLamberti ปีที่แล้ว ⁺¹
Let me double check that
@emilychase4610 ปีที่แล้ว
I'm getting this error too!
@aruniyer1833 ปีที่แล้ว
I was able to solve it by dropping the non date values
df = df.loc[df['Order Date'] != 'Order Date']
df['Order Date'] =pd.to_datetime(df['Order Date'],format='%m/%d/%y %H:%M')
In the sql block i made this change to match df format strptime("Order Date", '%Y-%m-%d %H:%M:%S')
Looks like the trycast is not working or some bug
@LeSaboteur3981 11 หลายเดือนก่อน
like always... way to few likes for such a great video!🎉
@rj_nelson_97 ปีที่แล้ว
Oh man, oooofff! I tried following your video here and your blog with the stock analysis data. I've run into too many errors unfortunately. That's the thing about these dependencies - there's too many moving parts where errors are persistent. I can sort through them, but it would take up too much time. We all got work to do :-).
For data analysis work, my go to still at this point is using an AWS S3 bucket to load the raw data, use an AWS Glue to create a database and an AWS Glue Crawler to upload the data while creating a table within that database. I can also use AWS Glue Studio to convert the data from .cvs format to .parquet format. From there, I can use AWS Athena to query the data.
@pba1957 5 หลายเดือนก่อน
Here is important information. Jupyter notebook extension needs to be added to VSCode. How about using the StackOverflow principle and avoid the worthless thank you comments.
@shogun8-9 ปีที่แล้ว
The files are not available anymore :(
@MarcLamberti ปีที่แล้ว
What files?
@shogun8-9 ปีที่แล้ว
@@MarcLamberti In order to work with this tutorial, you provided the sales dataset in the materials. There is a link in the description. On this page, there is a link to the kaggle dataset that you used in the video. However, this one got removed. So it is not possible anymore to follow this tutorial :(
@MarcLamberti ปีที่แล้ว
oh oh, let me check if I can fix that
@shogun8-9 ปีที่แล้ว
@@MarcLamberti Thank you! Let us know if it works again
@MarcLamberti ปีที่แล้ว
Here we go: www.kaggle.com/datasets/kushagra1211/usa-sales-product-datasetcleaned
@durgeshkshirsagar5160 ปีที่แล้ว
4:28 it is written redhshit :D Please don't mind. Thanks for the video.
@MarcLamberti ปีที่แล้ว ⁺¹
🫣🥹🥹🥹🥹
@durgeshkshirsagar5160 ปีที่แล้ว
Except that baziz, everything is perfectly fine.😀
@davidjackson7675 ปีที่แล้ว
When I ran this Python/Duckdb it only shape returned (186862, 6):
# with duckdb
cur_time = time.time()
df = conn.execute("""
SELECT *
FROM '/kaggle/input/sales-product-data/*.csv'
""").df()
print(f"time: {(time.time() - cur_time)}")
print(df.shape)

ต่อไป

เล่นอัตโนมัติ

Introduction to Scaling Analytics Using DuckDB with Python

Introduction to Scaling Analytics Using DuckDB with Python

Why use DuckDB in your data pipelines ft. Niels Claeys

Why use DuckDB in your data pipelines ft. Niels Claeys

Using the {arrow} and {duckdb} packages to wrangle medical datasets that are Larger than RAM

Using the {arrow} and {duckdb} packages to wrangle medical datasets that are Larger than RAM

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

How to treat Acne💉

How to treat Acne💉

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

มายคราฟแต่ "น้ำกับลาวา" สลับกัน!?

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

Scum Rangers LIVE-021 ขุนให้อ้วน ฟาร์มให้เงียบ

SQL Tutorial for Beginners

SQL Tutorial for Beginners

Python fundamentals part 3 - OOP (coding)

Python fundamentals part 3 - OOP (coding)

This INCREDIBLE trick will speed up your data processes.

This INCREDIBLE trick will speed up your data processes.

Build a poor man’s data lake from scratch with DuckDB

Build a poor man’s data lake from scratch with DuckDB

DuckDB in Python - The Next Pandas Killer?

DuckDB in Python - The Next Pandas Killer?

Analyze MILLIONS of points in SECONDS (on your computer) with DuckDB for GIS

Analyze MILLIONS of points in SECONDS (on your computer) with DuckDB for GIS

How Prometheus Monitoring works | Prometheus Architecture explained

How Prometheus Monitoring works | Prometheus Architecture explained

Complete Python Pandas Data Science Tutorial! (Reading CSV/Excel files, Sorting, Filtering, Groupby)

Complete Python Pandas Data Science Tutorial! (Reading CSV/Excel files, Sorting, Filtering, Groupby)

SQL Databases with Pandas and Python - A Complete Guide

SQL Databases with Pandas and Python - A Complete Guide

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

总算是用上情侣手机壳了 #玩一种很新的东西 #手机壳 #情侣

Real Vs Mannequin Challenge😱

Real Vs Mannequin Challenge😱

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

รวม10 เจ้าพ่อบ้านใหญ่! ลุ้น "โกทร" เกมหรือรอด? : 14-12-67 | iNN Top Story

หมวกกันน็อค - TaitosmitH |Official MV|

หมวกกันน็อค - TaitosmitH |Official MV|

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

“โดนัท มนัสนันท์” ไหว้ขอสามีมีอีหนูเถอะ!! “หนุ่ม กรรชัย” พร้อมช่วยเหลือ! | 3 แซ่บ (Full) 15 ธ.ค. 67

“โดนัท มนัสนันท์” ไหว้ขอสามีมีอีหนูเถอะ!! “หนุ่ม กรรชัย” พร้อมช่วยเหลือ! | 3 แซ่บ (Full) 15 ธ.ค. 67

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

ไฮไลท์ ฟุตบอล ASEAN MITSUBISHI ELECTRIC CUP 2024 : สิงคโปร์ พบ ไทย

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก

แหกหน้าพ่อค้าจีน 2 #hagatestudio #fun #funny #พากย์นรก