CppCon 2018: Peter Sommerlad “Sane and Safe C++ Classes”

CppCon 2018: Nir Friedman “Understanding Optimizers: Helping the Compiler Help You”

CppCon 2016: Nicholas Ormrod “The strange details of std::string at Facebook"

ท่อนบน คนกาฝาก 2 ( ตอนเดียวจบ ) | Endoparasitic 2

🥊 LIVE : RWS ราชดำเนิน เวิลด์ ซีรีส์ | 23พ.ย. 67

ทุกคนต้องไม่เชื่อแน่ๆ ว่าเธอจะใช้มีดอีโต้เฉาะมะพร้าวให้ออกมาเป็นแบบนี้ #negi #coconut

CppCon 2018: Bob Steagall “Fast Conversion From UTF-8 with C++, DFAs, and SSE Intrinsics”

CppCon

มุมมอง 13 336

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 23 พ.ย. 2024

ความคิดเห็น • 18

@OperationDarkside 6 ปีที่แล้ว ⁺³⁰
As dry as the topic is, this talk was amazing. It was clearly structured, most topics built on the previos ones and the wording was easy to understand. I watched it at a far too late hour, but understood most of it, although struggeling at the DFA explanation.
If someone asks me about utf-8 and parsing, I will recommend this talk.
@MatthijsvanDuin 6 ปีที่แล้ว
Well, the first thing you should always do when someone asks about parsing utf-8 is check whether they're not doing so for a bogus reason. Decoding utf-8 into codepoints is something that's rarely needed, instead you can usually simply treat utf-8 strings as byte-strings that satisfy a certain grammar (which needs to be validated for untrusted inputs, but otherwise isn't too important).
@gabrielaubut-lussier1668 6 ปีที่แล้ว ⁺⁵
As far as error handling is concerned, invalid code units cannot simply be dropped. They must be substituted with U+FFFD or the conversion must halt. The security implications are covered here : unicode.org/reports/tr36/#Ill-Formed_Subsequences
Thank you for the great presentation!
@soonts 6 ปีที่แล้ว ⁺⁶
mm_set and mm_set1 intrinsics compile to RAM references. You should use _mm_setzero_si128 to zero registers.
@YSoreil 6 ปีที่แล้ว ⁺⁹
Skip to somewhere around 14:00 if you already know how UTF-8 and friends work to get straight to talking about the code.
@robinmoussu 6 ปีที่แล้ว ⁺¹⁷
Extremely interesting presentation. IMHO it's C (but really high C quality) and not C++, but this doesn't remove anything to the quality of the talk.
@bit2shift 6 ปีที่แล้ว ⁺²
Actually, the C version of the code shown would be a lot uglier considering the enums would leak out of struct scopes.
@pskocik 5 ปีที่แล้ว
@@bit2shift It wouldn't matter. The enums don't need to be in public headers so they don't need to be scoped anyway. But even if you do scope your enums in C, foo__bar is hardly any different from foo::bar. Sure you would always need to use foo__bar and could never just do bar (actually, you could have a macro to bring unscoped variants into a function scope), but some consider such context sensitivity ugly. Best to forget about such petty "what's ugly and what's not" discourse and just focus on generating fast assembly.
@bit2shift 5 ปีที่แล้ว
@@pskocik yes, let the compiler do what it knows pretty well.
Doesn't matter if the enums are hidden from the public interface or not. The point is that unscoped enums can very easily lead to subtle bugs.
@ZiggyGrok 6 ปีที่แล้ว ⁺¹
I would've liked to see some kind of analysis of why his code is so much more efficient -- especially compared to the other DFA implementation. I suspect his performance advantage may evaporate once he handles the cases that the other implementations deal with.
@urisimchoni3936 4 ปีที่แล้ว
if the code handles majority of cases and reliably identify exceptions without losing speed, then the default implementation can be used as fallback. It's done all the time when letting one (non-CPU) piece of hardware handle the fast path, and punt the exceptional cases to the CPU. In this case - if the UTF-8 is well formed, it can be handled fast.
@3bdo3id หลายเดือนก่อน
00:15:00
The conditions of the if-else ladder here confused me and after some digging, I found it is just to check the first 1, 3 , 4 and 5 bits and it would be more unerstandable to do this with just an increasing last points of
@yyny0 3 ปีที่แล้ว
I actually tested many 'optimized' UTF-8 decoders the other day, unfortunately, many could not correctly handle overlong or incorrect codepoints (even if they claimed they could), report error positions for invalid/corrupted bytes accurately, or beat a naive UTF-8 decoder when decoding mostly ASCII (Due to branch mispredicts). I like the presentation of this topic, but unfortunately, most of these 'optimized' decoders just aren't practical for production software.
@llothar68 6 ปีที่แล้ว ⁺⁷
I'm surprised that Microsoft had the best results among the competition.
@mrmodtube 6 ปีที่แล้ว ⁺²
Want to have strlen() results to feel the speed.
@dragdu 6 ปีที่แล้ว ⁺¹
"DFAs can recognize simple regular expressions" thats kinda backwards, regular expressions were first defined as a way to describe regular languages, which are, by definition, accepted by DFAs. The problem is that Perl then implemented regexes via backtracking and started adding features that are not regular, but easy to implement when you have a backtracking solver...
@bernadettetreual 6 ปีที่แล้ว
Energy consumption should also be compared.
@davidjohnston4240 3 ปีที่แล้ว ⁺¹
Energy consumption of SIMD instructions are typically going to be lower than for equivalent SISD instructions over the same data.

ต่อไป

เล่นอัตโนมัติ

CppCon 2018: Peter Sommerlad “Sane and Safe C++ Classes”

CppCon 2018: Peter Sommerlad “Sane and Safe C++ Classes”

CppCon 2018: Nir Friedman “Understanding Optimizers: Helping the Compiler Help You”

CppCon 2018: Nir Friedman “Understanding Optimizers: Helping the Compiler Help You”

CppCon 2016: Nicholas Ormrod “The strange details of std::string at Facebook"

CppCon 2016: Nicholas Ormrod “The strange details of std::string at Facebook"

ท่อนบน คนกาฝาก 2 ( ตอนเดียวจบ ) | Endoparasitic 2

ท่อนบน คนกาฝาก 2 ( ตอนเดียวจบ ) | Endoparasitic 2

🥊 LIVE : RWS ราชดำเนิน เวิลด์ ซีรีส์ | 23พ.ย. 67

🥊 LIVE : RWS ราชดำเนิน เวิลด์ ซีรีส์ | 23พ.ย. 67

ทุกคนต้องไม่เชื่อแน่ๆ ว่าเธอจะใช้มีดอีโต้เฉาะมะพร้าวให้ออกมาเป็นแบบนี้ #negi #coconut

ทุกคนต้องไม่เชื่อแน่ๆ ว่าเธอจะใช้มีดอีโต้เฉาะมะพร้าวให้ออกมาเป็นแบบนี้ #negi #coconut

“คลัง” ปรับแผนแจก 10,000 เร็วขึ้น ลุ้นประกาศผล ธ.ค.นี้ | โฟกัสเศรษฐกิจ | 21 พ.ย. 67

“คลัง” ปรับแผนแจก 10,000 เร็วขึ้น ลุ้นประกาศผล ธ.ค.นี้ | โฟกัสเศรษฐกิจ | 21 พ.ย. 67

A Crash Course in Unicode for C++ Developers - Steve Downey - [CppNow 2021]

A Crash Course in Unicode for C++ Developers - Steve Downey - [CppNow 2021]

CppCon 2018: Geoffrey Romer “What do you mean "thread-safe"?”

CppCon 2018: Geoffrey Romer “What do you mean "thread-safe"?”

*(char*)0 = 0; - What Does the C++ Programmer Intend With This Code? - JF Bastien - C++ on Sea 2023

*(char*)0 = 0; - What Does the C++ Programmer Intend With This Code? - JF Bastien - C++ on Sea 2023

Unicode in C++ - James McNellis - Meeting C++ 2016

Unicode in C++ - James McNellis - Meeting C++ 2016

CppCon 2018: Alan Talbot “Moving Faster: Everyday efficiency in modern C++”

CppCon 2018: Alan Talbot “Moving Faster: Everyday efficiency in modern C++”

Interesting Characters (UTF-16, utf-8, Unicode, encodings)

Interesting Characters (UTF-16, utf-8, Unicode, encodings)

CppCon 2018: Michael Caisse “Modern C++ in Embedded Systems - The Saga Continues”

CppCon 2018: Michael Caisse “Modern C++ in Embedded Systems - The Saga Continues”

Parsing JSON Really Quickly: Lessons Learned

Parsing JSON Really Quickly: Lessons Learned

Characters, Symbols and the Unicode Miracle - Computerphile

Characters, Symbols and the Unicode Miracle - Computerphile

หนุ่ม กรรชัย ลั่น! ไม่อยากได้ยินคำขอโทษ สิ่งที่คุณทำเป็นความเสื่อมทรามในสังคม (คลิปจัดเต็ม)

หนุ่ม กรรชัย ลั่น! ไม่อยากได้ยินคำขอโทษ สิ่งที่คุณทำเป็นความเสื่อมทรามในสังคม (คลิปจัดเต็ม)

หนังเต็มเรื่อง | ฝ่ามือยูไล | หนังแอคชั่น หนังกำลังภายใน หนังกังฟูจีน | พากย์ไทย HD

หนังเต็มเรื่อง | ฝ่ามือยูไล | หนังแอคชั่น หนังกำลังภายใน หนังกังฟูจีน | พากย์ไทย HD

7383273011413077287.mp4

7383273011413077287.mp4

ไวรัลหนักมาก! “สุกี้พรศิริ” ขายดีจนต้องเพิ่มเตา เปิดเพียง 3 ชม. พีกสุด 130 คิว | เส้นทางเศรษฐี

ไวรัลหนักมาก! “สุกี้พรศิริ” ขายดีจนต้องเพิ่มเตา เปิดเพียง 3 ชม. พีกสุด 130 คิว | เส้นทางเศรษฐี

Bibib sudah tahu rahasianya!! 🤢😭 #funnyvideo #funny #funnyanimals #cuteanimals ##cute #pets

Bibib sudah tahu rahasianya!! 🤢😭 #funnyvideo #funny #funnyanimals #cuteanimals ##cute #pets

كم بصير عمركم عام ٢٠٢٥😍 #shorts #hasanandnour

كم بصير عمركم عام ٢٠٢٥😍 #shorts #hasanandnour

I Meet MrBeast To Break The Internet!!

I Meet MrBeast To Break The Internet!!

หาทำ EP.54 : ลาบปลาทับทิมทอดครั้งแรก ของ "เจ๊มิ่ง" | จือปาก

หาทำ EP.54 : ลาบปลาทับทิมทอดครั้งแรก ของ "เจ๊มิ่ง" | จือปาก