Polars: Blazingly Fast DataFrames in Rust and Python

แชร์
ฝัง
  • เผยแพร่เมื่อ 23 ม.ค. 2025

ความคิดเห็น • 29

  • @JeremyChone
    @JeremyChone 2 ปีที่แล้ว +28

    Great video. I have been playing with Polars, Python, and Rust, and it is a fantastic technology.
    Also, I think Polars' Python facade with Rust backend / clean API is a great model for other data-centric technologies that need performance and scripting facade interfaces.
    Big thanks to the Polars team for this gem.

  • @ControlTheGuh
    @ControlTheGuh 2 ปีที่แล้ว +4

    Learn rust, python and rust synergies will be all over the place in data science soon. Great work with Polars

    • @jmnunezd1231
      @jmnunezd1231 ปีที่แล้ว

      Why do you think that? Can you elaborate more please? Thanks!

    • @ptarslatinum2806
      @ptarslatinum2806 ปีที่แล้ว

      Can you please provide some insight? I am thinking of learning rust and I wish to enter this field.

  • @АнатолийБугаков-е9г
    @АнатолийБугаков-е9г 2 ปีที่แล้ว +15

    Polars is just great! After trying it I won't go back to pandas.

  • @incremental_failure
    @incremental_failure ปีที่แล้ว +2

    Haven't been so excited about data analysis for a while. Now comes the chore of converting pandas to polars in a 15k line project.

  • @saaddahmani1870
    @saaddahmani1870 2 ปีที่แล้ว +2

    Really great crate and great video, thanks.

  • @Josh-wb7ii
    @Josh-wb7ii 2 ปีที่แล้ว +4

    For a lot of my job its not possible to use eagerly evaluated DFs, polars is a fantastic option.

  • @swalsh1123
    @swalsh1123 2 ปีที่แล้ว

    I'm going to try this out. Thanks

  • @milo4433
    @milo4433 2 ปีที่แล้ว +2

    I wonder if this could ever be used with multiple machines similar to spark. Polars looks very fast and with the addition of distributed compute it would be very powerful. Perhaps it was not designed to allow that though.

    • @АнатолийБугаков-е9г
      @АнатолийБугаков-е9г 2 ปีที่แล้ว

      Polars uses all logical cores of your machine in parallel. Doesn't that mean distributed compute?

    • @milo4433
      @milo4433 2 ปีที่แล้ว +1

      @@АнатолийБугаков-е9г I am asking specifically about cross machine distribution. For example if we had multiple 8-core machines (like VMS in aws) available, can we distribute polars work across those vms instead of being limited to a single machine and its cores.

    • @АнатолийБугаков-е9г
      @АнатолийБугаков-е9г 2 ปีที่แล้ว

      @@milo4433 aah I see the difference now

    • @matthiasdebernardini3388
      @matthiasdebernardini3388 2 ปีที่แล้ว

      ​@@milo4433 yea you totally could but would need to do that yourself, not sure the crate supports it out of the box

    • @kopodgorski
      @kopodgorski 2 ปีที่แล้ว

      Wouldn't be a load balancer be enough to make it happen?

  • @headbangingidiot
    @headbangingidiot 2 ปีที่แล้ว +1

    This is great, can we use polars on a databricks cluster?

    • @rakeshjamdare2049
      @rakeshjamdare2049 ปีที่แล้ว

      Sure, it terms of python, its just another library so can use it very conveniently!!!

  • @Birdsneverfly
    @Birdsneverfly 2 ปีที่แล้ว

    Does polars with with scikit?

    • @jmnunezd1231
      @jmnunezd1231 ปีที่แล้ว +2

      I think they work most of the time. The good think in that if something fails, you can pass polars to numpy very easily

  • @josedejesuslopezdiaz
    @josedejesuslopezdiaz 2 ปีที่แล้ว +1

    great API

  • @samuelemmettbray
    @samuelemmettbray 2 ปีที่แล้ว

    Super confused. I thought Polars was deprecating in favor of Arrow2?

  • @hiongun
    @hiongun ปีที่แล้ว

    need pandas compatible api, just get fast conversion from pandas

    • @headder7802
      @headder7802 ปีที่แล้ว

      It has methods to read and export to Pandas out of the books. It can also write or read Delta tables (locally or in S3).

  • @therendra
    @therendra ปีที่แล้ว

    Nice presentation