Stephan T. Lavavej “Floating-Point <charconv>: Making Your Code 10x Faster With C++17's Final Boss”

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ม.ค. 2025

ความคิดเห็น • 28

  • @NonTwinBrothers
    @NonTwinBrothers ปีที่แล้ว +3

    I admire this channel's ability to put < and > symbols in the title even when youtube disallows it

  • @AminAramoon
    @AminAramoon 5 ปีที่แล้ว +80

    This guys always gives some of the best talks in CppCon

  • @isitanos
    @isitanos 4 ปีที่แล้ว +14

    Would be nice if this library provided you with constants for the right size of the buffer to accomodate the worst case for each numeric type.

  • @grahambest3809
    @grahambest3809 5 ปีที่แล้ว +7

    Wow! Steven is a great speaker!
    Awesome talk. I did learn quite a bit of new things :-)

  • @mkg4215
    @mkg4215 5 ปีที่แล้ว +14

    Great talk, as always. Ryu seems to be a real game changer.

  • @TheEmT33
    @TheEmT33 4 ปีที่แล้ว +2

    Packed content, concise explanation, great talk!

  • @gast128
    @gast128 5 ปีที่แล้ว +6

    Fast, locale independent, round trip serialization of numbers is exactly what we needed a long time ago. The additional speed improvements are always welcome. Pity that it doesn't support wchar_t. It might be an option for us to workaround it and add the conversion from string to wstring ourselves.

    • @StephanLavavej
      @StephanLavavej 5 ปีที่แล้ว +1

      It would be fairly easy to template the code on character type (crucially, the lookup tables aren't affected; I use a lookup table to convert char to digit which wouldn't be possible for wider types, but that's fine), so I could imagine a proposal being accepted.

    • @kuhluhOG
      @kuhluhOG 4 ปีที่แล้ว +3

      besides using a (weird/legacy) library or using the Windows API, why do you use wchar_t?

  • @movax20h
    @movax20h 4 ปีที่แล้ว +5

    Great talk. Going to watch Ryu stuff now. As of vectorizing for 32-bit, don't bother. Using 64-bit you can assume SSE2 and use it always.

  • @ReaperUnreal
    @ReaperUnreal 4 ปีที่แล้ว +2

    Wish I'd had the "plain" to_chars mode a few years ago. I was working on a compiler and the compiler itself would crash when outputting the trace of a specific compiler test. It turns out the sprintf of a long double was overflowing the buffer and stomping the return pointer. Didn't have access to snprintf because of the platform. This is how I discovered general mode, but "plain" was really what I wanted.

  • @IndellableHatesHandles
    @IndellableHatesHandles ปีที่แล้ว

    Charconv has warnings within it according to VC++, which is funny but also makes it impossible to interpret warnings as errors.

  • @zvxcvxcz
    @zvxcvxcz 4 ปีที่แล้ว +6

    Can we please get arbitrary (multiple) precision in the standard library though? GMP may be awesome but there is a real shortage of well supported higher level libraries utilizing multiple precision because adding the implementation is often not very straightforward. I suspect that it would be a lot easier for higher level libraries to add support if these types were available in the standard library and could be easily dropped in.

  • @goshisanniichi
    @goshisanniichi 5 ปีที่แล้ว +26

    lol. Ryu is Japanese for dragon. Gotta love programmer humor.

    • @guiorgy
      @guiorgy 3 ปีที่แล้ว +2

      Original algorithm: Dragon4 (1990)
      New algorithm: Ryu (2018)
      35:40

  • @AlwinMao
    @AlwinMao 2 ปีที่แล้ว +4

    Link to Ulf''s 2018 PDTI talk: th-cam.com/video/kw-U6smcLzk/w-d-xo.html

  • @Astfresser
    @Astfresser 2 ปีที่แล้ว

    How does gcc implement it and how does it perform for RISC-V processors? This would be crucial to know for embedded use.

  • @10100rsn
    @10100rsn 2 ปีที่แล้ว +2

    Great! And Ryu means Dragon in Japanese... Full circle now.

  • @VioletGiraffe
    @VioletGiraffe 4 ปีที่แล้ว +1

    So what exactly is wrong with multiplying and dividing by a power of 10? I often use this to trim the value to N digits after the decimal point.

    • @sander_bouwhuis
      @sander_bouwhuis 4 ปีที่แล้ว +4

      I guess because log2(10)≈3.32192809489? When you divide by 10 you lose precision and it takes many more cycles. Also, he didn't state that dividing by a power of 10 in one step (e.g. div 10e5) was wrong, but repeatedly doing that (div 10 div 10 div 10 div 10 div 10) to build up the number.
      PS : Thanks for the CppCheck addin! (I presume you are the same guy/girl?)

  • @jxsl13
    @jxsl13 5 ปีที่แล้ว +1

    Great talk.

  • @Carutsu
    @Carutsu 5 ปีที่แล้ว +4

    Any links to Ulf talk?

    • @tsafin
      @tsafin 5 ปีที่แล้ว +3

      I guess this one th-cam.com/video/kw-U6smcLzk/w-d-xo.html

    • @nielsdegroot9138
      @nielsdegroot9138 4 ปีที่แล้ว +1

      Check the links slide @51:00

  • @DrGreenGiant
    @DrGreenGiant 3 ปีที่แล้ว +3

    Yikes, that's a lot of memory required for this then. RIP embedded use :(

    • @Astfresser
      @Astfresser 2 ปีที่แล้ว +3

      I thought the same. Well i'll test it anyway, i hope theres a lot of constexpr paths that can be compiled out..