Data-Oriented Design Revisited: Type Safety in the Zig Compiler - Matthew Lugg

Zig SHOWTIME

มุมมอง 10 871

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 21 พ.ย. 2024

ความคิดเห็น • 24

@edgeeffect 2 หลายเดือนก่อน ⁺¹⁷
In the intro to this talk... I was thinking "is he going to put the shirt on or not?!" ... I wasn't disappointed.
@galaxygur 2 หลายเดือนก่อน ⁺⁵
00:05:40 - Optimize your code by focusing on data layout and memory usage for better performance.
00:09:40 - Optimize memory usage in compilers by using indices and separating data into arrays, making your data easily serializable and saving space.
00:14:01 - Efficient data encoding in Zig reduces wasted space and optimizes memory usage.
00:18:34 - Understanding how extra data is used in if-else and switch statements is key to mastering efficient memory representation in compilers.
00:22:49 - Use unions and enums in Zig to manage data types and improve type safety in your code.
00:27:17 - Simplify your API by separating internal data representation from user-facing types for better efficiency and usability.
00:31:38 - Keep your APIs clean and type-safe by hiding the messy implementation details.
00:36:41 - Keeping your API implementation hidden lets you leverage your language's strengths while optimizing for hardware performance.
@N1c0_792 2 หลายเดือนก่อน ⁺¹
Great talk, interesting and very understandable throughout
@Bvngee 2 หลายเดือนก่อน ⁺²
Great talk! very engaging, enjoyed
@kallebalik 2 หลายเดือนก่อน ⁺¹
Nice talk! Very well done.
@sadhlife 2 หลายเดือนก่อน ⁺¹⁰
Where can I buy the shirt?
@CoolestPossibleName 2 หลายเดือนก่อน ⁺⁶
Are you planning on doing a talk on dod too?
@KooShnoo 2 หลายเดือนก่อน ⁺¹
fascinating
@SimGunther 2 หลายเดือนก่อน ⁺¹
There's CPU local performance, which these DOD talks discuss pretty well, and then there's inter-coprocessor and inter-computer performance.
While DOD can help with data layout to make the most out of the cacheline, it can get people stuck in the weeds. It's best saved for when you know the problem requirements and the bare minimum data that must be returned at each step from the DB all the way down to the (co)processors and see whether it's too little data to make the best use of co processors that make bulk data processing super fast or if the loops make it so a more "OOP" layout makes sense or if SOA/AOSOAs are more practical for maintenance and runtime performance.
@ukyoize 2 หลายเดือนก่อน
13:38 one could have just "flag" tag with data describing what kind of flag it is
@mlugg5499 2 หลายเดือนก่อน ⁺⁴
Do you mean collapse `plus` and `minus` into one tag which uses `data` to encode which "actual" tag it is? If so, yep, that's sometimes a helpful strategy! We don't do it for the AST, since we have enough tags from 1 byte anyway, so there's no point in it. However, for ZIR, one similar thing we do (which I didn't cover in the talk for simplicity / time reasons) is we have a tag called `extended`, which essentially sacrifices 2 bytes of our 8-byte payload for a second tag enum. The reason for this is that, broadly speaking, a small number of ZIR instructions are very frequently used, and a lot of instructions are very rarely used. So, for frequently used ones, it's helpful to have the whole 8 byte payload to minimize the need for `extra`; but for the rarely used ones (think instructions for things like inline asm), we can happily sacrifice a bit of memory.
@Bobbias 2 หลายเดือนก่อน
@@mlugg5499I was wondering about why you guys did that. Wasn't expecting to just stumble into the answer in a TH-cam comment, but I'm happy I now know.
@morglod 2 หลายเดือนก่อน
do you have comparison on how much cache misses you have with SOA and without? because in my mind it should be more cache misses because usually you check tag and then go to data, so you first load tags to cache than load datas to cache, then load tag etc
@mlugg5499 2 หลายเดือนก่อน ⁺⁵
It's not a problem that the accesses jump back and forth between those regions, because the CPU has lots of cache. If you imagine a hypothetical CPU with two cache lines for data accesses, then one line could store a block of tags, the other a block of data; neither gets evicted until we move onto the next block of tags/data. Of course, in reality, CPUs have orders of magnitude more cache lines than this -- so there's no problem here at all, the chance of those 2 lines getting evicted while you're e.g. in a hot loop over all your instructions/nodes/etc is basically zero. So, data fragmentation of this form isn't really an issue; it only matters once your data is fragmented enough that you can't make full use of a cache line before it's evicted.
@morglod 2 หลายเดือนก่อน
@@mlugg5499 okay, thank you!
@morglod 2 หลายเดือนก่อน
@@mlugg5499 cant find anywhere info about "two L1" caches. Everywhere its like L3 has many L2 which has many L1
do you have any link about it? like something very detailed
@hemerythrin 2 หลายเดือนก่อน ⁺¹
@@morglod Comment doesn't say anything about L1 caches, so I assume there's some sort of misunderstanding.
Caches are separated into "lines" (usually 64 bytes), where each line shares a single tag and is loaded/evicted together.
@morglod 2 หลายเดือนก่อน ⁺¹
@@hemerythrin okay
So it actually has L1 misses a lot
But L2 is ok
Understood, thank you
@VoyivodaFTW1 2 หลายเดือนก่อน
First!!!
Jk. Good talk. Great language
@LarryWhite-o6m หลายเดือนก่อน
Jackson Robert White Laura Johnson Helen

ต่อไป

เล่นอัตโนมัติ

Defeating the Optimizer: How to Write (and avoid) Unoptimizable Code - Martin Wickham