Surfalytics TV
Surfalytics TV
  • 168
  • 40 338
Just enough DuckDB for Data Analyst | Module 2 | Surfalytics
In this video, Dmitry Anoshin from Surfalytics dives deep into DuckDB, exploring its features, use cases, and benefits for data professionals.
He starts with a brief overview of DuckDB and its different editions, then walks you through a practical tutorial on installing and getting started with it. Dmitry demonstrates how to:
* **Interact with DuckDB:** Learn basic commands and how DuckDB simplifies querying compared to traditional SQL.
* **Manage persistent storage:** Understand how to store your data for future sessions and the advantages it offers.
* **Work with external files:** Effortlessly query data directly from CSV, Parquet, and other formats.
* **Leverage display modes:** Output your results in various formats like JSON and Markdown for easy sharing and documentation.
* **Extend functionality:** Explore the world of DuckDB extensions, including those for HTTP resources and Google Sheets.
* **Connect to external data:** Query data from cloud storage like AWS S3 using readily available plugins.
* **Understand OLAP workloads:** Learn about the analytical capabilities of DuckDB and its advantages over traditional OLTP systems.
Throughout the video, Dmitry shares practical tips and tricks, real-world examples, and insightful explanations to help you grasp the concepts effectively. He also provides homework assignments to reinforce your learning and encourages you to join the Surfalytics community for more hands-on practice.
Whether you're a data analyst, data engineer, or analytics engineer, this video will equip you with valuable knowledge and skills to enhance your data processing and analysis capabilities using DuckDB.
Resources used:
github.com/surfalytics/analytics-course/tree/5b6c52f0ba66d63eb46dd508543e88e5f7b2c9df/02_getting_started_with_databases/07_duckdb
duckdb.org/
aws.amazon.com/compare/the-difference-between-olap-and-oltp/
Timecode:
00:00 - Introduction
01:44 - Lesson begins
06:00 - Open DuckDB and create a table
09:45 - Cool features of DuckDB
18:00 - Different settings and configurations of DuckDB
20:00 - A little hand-on with DuckDB
22:00 - API in DuckDB
24:00 - Homework assignment
25:00 - A few use cases
Thank you for watching “Just enough DuckDB for Data Analyst | Module 2 | Surfalytics” on Surfalytics_TV!
Click the link to watch previous videos in Module 2: th-cam.com/video/GxLpe97h0tY/w-d-xo.html&pp=iAQB
Subscribe for more content: www.youtube.com/@SurfalyticsTV?sub_confirmation=1
#duckdb #database #sql #dataanalysis #dataengineering #analyticsengineering #bideveloper #surfalytics #tutorial #learnduckdb #dataprocessing #dataanalytics #etl #olap #datapipeline
=================
What is Surfalytics?
Inspired by West Coast surfing spots 🏖️ and Pacific Ocean vibes 🌊. Created to help you start a new career in the data analytics space, and develop data engineering and analytics skills through coaching. It will teach you not just dry skills, but will keep your focus on delivering significant value to businesses in the analytics realm as well as help you get fair compensation 💰 for the work you’re passionate about ❤️‍🔥.
The goal of Surfalytics is to assist you in achieving one of the following:
🏄‍♂️ Land your first job in the data industry with literally zero experience. I have accomplished this many times across the globe.
🏄 Advance from a middle-level role to a senior position (as an Analyst or Engineer).
🏄‍♀️ Transition from a non-technical Analyst role to a technical Engineer role.
Moreover, we will focus on creating a highly competitive CV and securing top job offers. We will not consider any lowball offers, focusing only on top-tier companies and well-paid opportunities.
Finally, Surfalytics is a results-driven community with a very narrow focus, resulting in a high return on investment (ROI). Here, ‘investment’ does not mean money but your time. I am literally fighting for your attention to encourage you to study and work hard, instead of watching Netflix or playing video games.
This is the best TH-cam channel for Data Analytics and Engineering. You will patch up a lack of knowledge and get new experience and tips to build a Data Analyst roadmap or Data Engineer roadmap for yourself.
Want to be part of our growing community? Join on Surfalytics.com
#surfalytics #dmitryanoshin #datacommunity #freecourses #dataanalysis #dataengineering #roadmap #careerpath #mindmaps #tools #overview #dataanalysttips
มุมมอง: 106

วีดีโอ

Real Interview Q&A for Staff Analytics Engineer #1 with Nikita Volynets | Surfalytics
มุมมอง 7914 ชั่วโมงที่ผ่านมา
To watch the full video join Surfalytics community for 7 days free-trial on surfalytics.com/! And find the link in our Discord channel! This is a part of the interview that Nikita Volynets had for Staff Analytics Engineer. It a conversation with a team lead who shared a lot about working processes and culture. Nikita told about his experience, asked a lot of questions, and commented on the them...
NoSQL Databases for Data Analyst and Data Engineer | Module 2 | Surfalytics
มุมมอง 33421 ชั่วโมงที่ผ่านมา
In this video, we'll cover the topics related to NoSQL databases and their role in the Data Analytics stack. We’ll walk through the essentials of MongoDB, one of the most popular document databases in full-stack development. From understanding the differences between relational and document databases to exploring MongoDB’s core components-like collections, documents, and indexes-you’ll gain ins...
Data Engineering Project | Google Big Query Overview
มุมมอง 109วันที่ผ่านมา
In this lesson, we’ll dive into Google BigQuery’s essentials, exploring how it’s designed for big data analytics and compares to other tools like Snowflake. You’ll learn practical techniques to optimize your queries with partitioning and clustering, keeping performance high and costs low. Plus, we’ll guide you on getting started with BigQuery for free, so you can start running queries immediate...
Spark UI and Query plan review project | Surfalytics
มุมมอง 98วันที่ผ่านมา
This video provides a practical guide to using Apache Spark, focusing on running Spark locally with Docker and Databricks Community Edition. Dmitry Anoshin demonstrates how to interact with the Spark UI to understand job execution, analyze performance metrics like data input size and resource usage, and troubleshoot potential issues. The tutorial covers core Spark concepts like transformations,...
Real Interview Q&A for Senior Data Engineer #4 with Nikita | Surfalytics
มุมมอง 22214 วันที่ผ่านมา
This video presents an interview with Nikita, a senior data engineer with 8 years of experience, for a senior data engineer role. Nikita discusses his experience building data pipelines and environments, utilizing technologies like Airflow, Terraform, AWS, DBT, and Snowflake. He details a recent project involving AWS infrastructure deployment, Airflow orchestration, and DBT data transformation,...
Mock Interview | SQL technical round | Episode 7 | Surfalytics
มุมมอง 17014 วันที่ผ่านมา
This video features a mock SQL interview between Nikita and Paul. It was Paul's first mock interview, and while he was nervous, he showed a good grasp of SQL fundamentals. Nikita provided helpful feedback and guidance throughout the interview, offering tips on how to approach SQL problems, optimize queries, and demonstrate clear thinking to interviewers. This video is a valuable resource for an...
Amazon Data Engineer requirements | BigTech data jobs | Surfalytics
มุมมอง 46421 วันที่ผ่านมา
To get your free-trail visit surfalytics.com/ In this video, Dmitry Anoshin from Surfalytics provides insights into the role of a data engineer at Amazon, based on his five years of experience working there. He covers the following topics: Job requirements and expectations: Dmitry reviews real Amazon data engineer job postings, highlighting the skills and experience needed. He emphasizes the im...
Visualisation elements for Databases | Module 2 | Surfalytics
มุมมอง 111หลายเดือนก่อน
To get your free-trail visit surfalytics.com/ Unlock the power of your data through visualization! This video demystifies business intelligence and shows you how to turn raw data into compelling stories. Learn to connect to databases, create stunning visuals, and build interactive dashboards that will impress stakeholders and drive business decisions. Resources mentioned in the video: app.diagr...
Adzuna AWS Project | Surfalytics
มุมมอง 192หลายเดือนก่อน
Adzuna AWS Project | Surfalytics
Mock Interview | SQL technical round | Episode 6 | Surfalytics
มุมมอง 224หลายเดือนก่อน
Mock Interview | SQL technical round | Episode 6 | Surfalytics
Real Interview Q&A for Data Engineer #2 | Surfalytics
มุมมอง 245หลายเดือนก่อน
Real Interview Q&A for Data Engineer #2 | Surfalytics
How to manage workloads and determine appropriate cluster size? | Surfalytics
มุมมอง 90หลายเดือนก่อน
How to manage workloads and determine appropriate cluster size? | Surfalytics
Real Interview Q&A for Senior Data Engineer #3 | Migration project | Surfalytics
มุมมอง 278หลายเดือนก่อน
Real Interview Q&A for Senior Data Engineer #3 | Migration project | Surfalytics
Databases in a cloud | Module 2 | Surfalytics
มุมมอง 992 หลายเดือนก่อน
Databases in a cloud | Module 2 | Surfalytics
What about data administrator role? | Surfalytics
มุมมอง 682 หลายเดือนก่อน
What about data administrator role? | Surfalytics
Data Models in Databases | Module 2 | Surfalytics
มุมมอง 3112 หลายเดือนก่อน
Data Models in Databases | Module 2 | Surfalytics
Will Data Roles be replaced by AI in 10 years? | Surfalytics
มุมมอง 2492 หลายเดือนก่อน
Will Data Roles be replaced by AI in 10 years? | Surfalytics
Introduction video | Module 5 | Surfalytics
มุมมอง 542 หลายเดือนก่อน
Introduction video | Module 5 | Surfalytics
Introduction video | Module 4 | Surfalytics
มุมมอง 802 หลายเดือนก่อน
Introduction video | Module 4 | Surfalytics
Dmitry Anoshin's Surfalytics: Engineer your career!
มุมมอง 782 หลายเดือนก่อน
Dmitry Anoshin's Surfalytics: Engineer your career!
Real Interview Q&A for Senior Data Engineer #2. Second round. | Surfalytics
มุมมอง 6662 หลายเดือนก่อน
Real Interview Q&A for Senior Data Engineer #2. Second round. | Surfalytics
Real Interview Q&A for Senior Data Engineer #2 | Surfalytics
มุมมอง 4693 หลายเดือนก่อน
Real Interview Q&A for Senior Data Engineer #2 | Surfalytics
Introduction video | Module 3 | Surfalytics
มุมมอง 783 หลายเดือนก่อน
Introduction video | Module 3 | Surfalytics
Real Interview Q&A for Data Engineer #1 | Surfalytics
มุมมอง 1563 หลายเดือนก่อน
Real Interview Q&A for Data Engineer #1 | Surfalytics
Connecting to Databases | Module 2 | Surfalytics
มุมมอง 2473 หลายเดือนก่อน
Connecting to Databases | Module 2 | Surfalytics
Real Interview Q&A for Senior Data Engineer #1 | Surfalytics
มุมมอง 6K3 หลายเดือนก่อน
Real Interview Q&A for Senior Data Engineer #1 | Surfalytics
What is Database? | Module 2 | Surfalytics
มุมมอง 963 หลายเดือนก่อน
What is Database? | Module 2 | Surfalytics
Introduction video | Module 1 | Surfalytics
มุมมอง 2283 หลายเดือนก่อน
Introduction video | Module 1 | Surfalytics
Introduction video | Module 2 | Surfalytics
มุมมอง 1383 หลายเดือนก่อน
Introduction video | Module 2 | Surfalytics

ความคิดเห็น

  • @veradanilova436
    @veradanilova436 วันที่ผ่านมา

    Really cool how DuckDB can query data directly from cloud storage like S3.

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g วันที่ผ่านมา

    This is awesome, Dmitry! Thanks for breaking down DuckDB in such a clear way.

  • @45825243m
    @45825243m 2 วันที่ผ่านมา

    DuckDB can be very useful for small projects! thanks.

  • @dmitryanoshin8004
    @dmitryanoshin8004 2 วันที่ผ่านมา

    DuckDB is becoming more and more popular!

  • @SurfalyticsTV
    @SurfalyticsTV 3 วันที่ผ่านมา

    Join our community surfalytics.com/

  • @veradanilova436
    @veradanilova436 6 วันที่ผ่านมา

    I liked the conversational approach

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g 6 วันที่ผ่านมา

    It's a great reminder of the importance of clear documentation.

  • @SurfalyticsTV
    @SurfalyticsTV 6 วันที่ผ่านมา

    Thanks for watching! Join our community! surfalytics.com/

  • @dmitryanoshin8004
    @dmitryanoshin8004 6 วันที่ผ่านมา

    Insightful!

  • @pavlosemeniuk
    @pavlosemeniuk 6 วันที่ผ่านมา

    Love the calmness and confidence. Thank you for sharing.

    • @SurfalyticsTV
      @SurfalyticsTV 6 วันที่ผ่านมา

      Thanks for watching!

  • @sio80orel
    @sio80orel 6 วันที่ผ่านมา

    a tip for the future is a bigger presentation size

  • @dogucanelci4923
    @dogucanelci4923 9 วันที่ผ่านมา

    Thats amazing bro, but i have a question, why you do not prefer to use created_timestamp , i do not understand the drawbacks of it

    • @SurfalyticsTV
      @SurfalyticsTV 9 วันที่ผ่านมา

      Good luck with job searching!

    • @dogucanelci4923
      @dogucanelci4923 9 วันที่ผ่านมา

      @@SurfalyticsTV i did not ask for job and i have by the way :D, you do not have to be like stackoverflow user, keep to cut your answers and stick as a non-stop interview, good luck to you

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g 14 วันที่ผ่านมา

    I liked how he explained his approach to optimization and troubleshooting.

  • @veradanilova436
    @veradanilova436 16 วันที่ผ่านมา

    Nikita comes across as a very competent and experienced data engineer.

  • @dmitryanoshin8004
    @dmitryanoshin8004 16 วันที่ผ่านมา

    Very helpful! Now I know what to expect from the DE interview!

  • @SurfalyticsTV
    @SurfalyticsTV 16 วันที่ผ่านมา

    Interested in learning more about data engineering best practices and overcoming real-world challenges? Join our community at surfalytics.com/ to connect with other data professionals and stay up-to-date on the latest trends in the field.

  • @victorg5056
    @victorg5056 17 วันที่ผ่านมา

    Wonderful - thanks. Just a question, can we use self join in the second task for finding the users retention?

    • @SurfalyticsTV
      @SurfalyticsTV 9 วันที่ผ่านมา

      whenever will work for you and return right result!

  • @victorg5056
    @victorg5056 17 วันที่ผ่านมา

    Fantastic - nice job guys!

    • @SurfalyticsTV
      @SurfalyticsTV 16 วันที่ผ่านมา

      Glad you enjoyed it!

  • @dmitryanoshin8004
    @dmitryanoshin8004 20 วันที่ผ่านมา

    Amazing experience!

  • @veradanilova436
    @veradanilova436 20 วันที่ผ่านมา

    +1 to what Nikita said about formatting - makes a huge difference in readability. 👍

  • @SurfalyticsTV
    @SurfalyticsTV 20 วันที่ผ่านมา

    Become a member surfalytics.com/#membership -> get your own mock interview and much more!

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g 20 วันที่ผ่านมา

    Paul, that was a solid effort for your first mock interview!

  • @pavlosemeniuk
    @pavlosemeniuk 20 วันที่ผ่านมา

    Thank you for watching me struggle. Hope you learned something new )

  • @johnsmith21123
    @johnsmith21123 20 วันที่ผ่านมา

    Don't you think answers like "just double the compute/costs" are wrong?

    • @dmitryanoshin8004
      @dmitryanoshin8004 18 วันที่ผ่านมา

      it is wrong from the expectations, but this is what happening in the real life=)

    • @studentinfomovie
      @studentinfomovie 8 วันที่ผ่านมา

      Some companies would rather see you move faster and avoid premature optimizations.

  • @victorg5056
    @victorg5056 24 วันที่ผ่านมา

    I hope the book makes it back to its owner.

    • @SurfalyticsTV
      @SurfalyticsTV 24 วันที่ผ่านมา

      It didn't 😭

  • @rasimsalimov
    @rasimsalimov 26 วันที่ผ่านมา

    Hi Dmitry, do you have any book recommendations for Data Analysts who aspire to be Data Engineers? Cheers from Boston!

    • @SurfalyticsTV
      @SurfalyticsTV 24 วันที่ผ่านมา

      I will think on that.

  • @pavlosemeniuk
    @pavlosemeniuk 26 วันที่ผ่านมา

    Thank you for covering this topic.

  • @romanvolozhanin4192
    @romanvolozhanin4192 26 วันที่ผ่านมา

    Thank you so much for the video! It gave me a lot of ideas on what to prepare for

  • @SurfalyticsTV
    @SurfalyticsTV 27 วันที่ผ่านมา

    Use the timecode to navigate the video: 00:00 - Intro 00:59 - Beginning 08:18 - Payment levels in Amazon 14:48 - AWS Architectures 16:39 - AWS Architect Pilars 19:23 - AWS Purpose-built analytics 22:52 - Network analytics 24:00 - Amazon Leadership Principles

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g 27 วันที่ผ่านมา

    Thanks for sharing your experience and insights about the data engineer role at Amazon.

  • @veradanilova436
    @veradanilova436 27 วันที่ผ่านมา

    Thanks for the informative video!

  • @HumilityAlways
    @HumilityAlways 27 วันที่ผ่านมา

    Thanks you so much Dmitry! Brazil watching

  • @maxkozlove
    @maxkozlove 27 วันที่ผ่านมา

    Great breakdown with valuable insights!

    • @SurfalyticsTV
      @SurfalyticsTV 27 วันที่ผ่านมา

      Appreciate it!

  • @cllearlyservice6973
    @cllearlyservice6973 28 วันที่ผ่านมา

    Oh, I don't have Mac, so I improvised with pgadmin instead postgressapp

  • @PowerBiWithVivek
    @PowerBiWithVivek 28 วันที่ผ่านมา

    In business, they don't care about the number but why this number.

    • @SurfalyticsTV
      @SurfalyticsTV 27 วันที่ผ่านมา

      Totally agree!

  • @rasimsalimov
    @rasimsalimov 29 วันที่ผ่านมา

    Actually the best channel on Data Analytics/Engineering I have found.

    • @SurfalyticsTV
      @SurfalyticsTV 27 วันที่ผ่านมา

      I'm really happy to hear it!

  • @victorg5056
    @victorg5056 หลายเดือนก่อน

    Thank you - the perfect lesson, especially about the per-commits and GitHub actions

    • @SurfalyticsTV
      @SurfalyticsTV หลายเดือนก่อน

      Glad it was helpful!

  • @veradanilova436
    @veradanilova436 หลายเดือนก่อน

    I need to try those sql-ex task. Seems cool.

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g หลายเดือนก่อน

    Smart visualisation always touches a nerve.

  • @dmitryanoshin8004
    @dmitryanoshin8004 หลายเดือนก่อน

    SQL is a king!

  • @MaxZdor
    @MaxZdor หลายเดือนก่อน

    Super helpful, tnx

    • @SurfalyticsTV
      @SurfalyticsTV หลายเดือนก่อน

      Yeah, tasks were really interesting!

  • @carlos99junior
    @carlos99junior หลายเดือนก่อน

    wow very nice, thank you for sharing! How many years have you been working in the field?

    • @SurfalyticsTV
      @SurfalyticsTV หลายเดือนก่อน

      15 years. Check this video, I'm telling more about it. th-cam.com/video/JAHmy1EjAnc/w-d-xo.htmlsi=EYUjpDr5EaNKbnkU

  • @anna10son
    @anna10son หลายเดือนก่อน

    Thanks for the recording, looking forward to part 3!

  • @45825243m
    @45825243m หลายเดือนก่อน

    It's a really good project! You learn a lot from this! Don't forget to make notes during the process.

  • @MaximKazartsev
    @MaximKazartsev หลายเดือนก่อน

    Awesome! Thank you for recording a video for this project! Part 3 is ahead :)

    • @SurfalyticsTV
      @SurfalyticsTV หลายเดือนก่อน

      Thank you too!

  • @veradanilova436
    @veradanilova436 หลายเดือนก่อน

    This is awesome, Maks!

  • @MaksimGabdeev-g8g
    @MaksimGabdeev-g8g หลายเดือนก่อน

    This was super helpful for understanding the process.

  • @dmitryanoshin8004
    @dmitryanoshin8004 หลายเดือนก่อน

    Very insightful, great to see AWS in action!

  • @dmitryanoshin8004
    @dmitryanoshin8004 หลายเดือนก่อน

    So helpful!

  • @SurfalyticsTV
    @SurfalyticsTV หลายเดือนก่อน

    It is the record of the second day. Sorry for the quality, we will work on that. Medium page of the project to make the first part by yourself: medium.com/@kazarmax/from-api-to-dashboard-building-an-end-to-end-etl-pipeline-with-aws-3c1f4048676d Join surfalytics.com/ to learn with us or lead your own project!