duckplyr: Tight Integration of duckdb with R and the tidyverse - posit::conf(2023)

แชร์
ฝัง
  • เผยแพร่เมื่อ 7 ก.ค. 2024
  • Presented by Kirill Müller
    The duckplyr R package combines the convenience of dplyr with the performance of DuckDB. Better than dbplyr: Data frame in, data frame out, fully compatible with dplyr.
    duckdb is the new high-performance analytical database system that works great with R, Python, and other host systems. dplyr is the grammar of data manipulation in the tidyverse, tightly integrated with R, but it works best for small or medium-sized data. The former has been designed with large or big data in mind, but currently, you need to formulate your queries in SQL.
    The new duckplyr package offers the best of both worlds. It transforms a dplyr pipe into a query object that duckdb can execute, using an optimized query plan. It is better than dbplyr because the interface is "data frames in, data frames out", and no intermediate SQL code is generated.
    The talk first presents our results, a bit of the mechanics, and an outlook for this ambitious project.
    Materials: github.com/duckdblabs/duckplyr/
    Presented at Posit Conference, between Sept 19-20 2023,
    Learn more at posit.co/conference.
    --------------------------
    Talk Track: Databases for data science with duckdb and dbt.
    Session Code: TALK-1100

ความคิดเห็น • 4

  • @aswanijehangeer
    @aswanijehangeer 4 หลายเดือนก่อน

    Great Presentation, gonna try duckplyr!

  • @mubangansofu7469
    @mubangansofu7469 6 หลายเดือนก่อน +1

    Exellent development I must say

  • @My-NaMeS_jEfF
    @My-NaMeS_jEfF 6 หลายเดือนก่อน +2

    It’s got a drake meme I’m sold

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 6 หลายเดือนก่อน +2

    Can duckdb work with shape files (maps)?