For further learning, here are the links from the next to last slide: Arrow cheatsheet: raw.githubusercontent.com/rstudio/cheatsheets/master/arrow.pdf video intro: th-cam.com/video/O42LUmJZPx0/w-d-xo.html full workshop from useR!: arrow-user2022.netlify.app DuckDB website: duckdb.org R package: cran.r-project.org/web/packages/duckdb/index.html data.table website: rdatatable.gitlab.io/data.table dtplyr (a data.table translator): dtplyr.tidyverse.org
A neat question to answer. I'm using the duckplyr library and it's nice to not have to think about anything. It does make a strong argument for having a fast hard drive (an SSD is an order of magnitude faster than a traditional HDD, an M2 is an order of magnitude faster than that, and modern nvme drives are even faster).
Really useful presentation, and timely for me. Personally I find data.table statements are greatly improved with just a little whitespace.
This talk blew my mind. Thank you very much!
For further learning, here are the links from the next to last slide:
Arrow
cheatsheet: raw.githubusercontent.com/rstudio/cheatsheets/master/arrow.pdf
video intro: th-cam.com/video/O42LUmJZPx0/w-d-xo.html
full workshop from useR!: arrow-user2022.netlify.app
DuckDB
website: duckdb.org
R package: cran.r-project.org/web/packages/duckdb/index.html
data.table
website: rdatatable.gitlab.io/data.table
dtplyr (a data.table translator): dtplyr.tidyverse.org
A neat question to answer.
I'm using the duckplyr library and it's nice to not have to think about anything. It does make a strong argument for having a fast hard drive (an SSD is an order of magnitude faster than a traditional HDD, an M2 is an order of magnitude faster than that, and modern nvme drives are even faster).
Masterful deployment of the "Kobayashi Maru" reference. 🖖
Wow, thank you for this illuminating presentation.
Great presentation.
Perfectly summarizes my big data journey. Really good!
Great talk. Concise and to the point.
Loved your presentation. Well done Sir!😊
You are a genius! Fantastic video! Thanks!
Excellent presentation!
great video and interesting analysis use case!
How about just using duckdb and SQL?