Advanced Topics: Peterson's Algorithm for Mutual Exclusion

Advanced Topics: Software Memory Barriers

Arm's Weakly-Ordered Memory Model and Barrier Requirements - Ash Wilding, Amazon

MISS GRAND PHRAE 2025 | FINAL SHOW

เพื่อนผมมันทำได้ไง มันทำไม่ได้ไม่ใช่หรอ?? | Minecraft #minecraft #มายคราฟ #fypシ #minecraftmemes #ตลก

🔴Live โหนกระแส ตั้มจ๋าหยุดลาก่อน "สายหยุด" หยุดก่อนจะสาย โบกมือลา ขอถอนตัวคดี "ทนายตั้ม"

Advanced Topics: Hardware Memory Barriers

CoffeeBeforeArch

มุมมอง 6 628

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 26 พ.ย. 2024
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 19

@MiriBenNissan 4 หลายเดือนก่อน ⁺¹
Such a great lecture! This is the best explanation of HW barriers I heard. Thanks for that!
@blipman17 3 ปีที่แล้ว ⁺⁵
As always, awesome content! Also very informative on how and why the cpu reorders loads and stores.
@CoffeeBeforeArch 3 ปีที่แล้ว ⁺³
Thanks - glad you found it informative!
@markusbuchholz3518 3 ปีที่แล้ว ⁺²
Thanks Nick for sharing your passion and knowledge. As I mentioned some time ago about your video performance => how you "discuss with audience" is probably taken directly from Broadway theatres (or even you are far beyond) . Amazing for all the senses. Great you display advanced topics since it forces community to capture new knowledge. It is probably obvious the programming concepts/techniques go further so the complexity has to grow. Great to see Intel which develops state-of-the-art compilers and libraries (I mean the latest releases oneAPI TBB). Thanks && have a nice day!
@archanasampath4809 ปีที่แล้ว ⁺¹
Thats the best explanation for barrier!
@abhishekpandey71 2 ปีที่แล้ว ⁺¹
awesome, exactly what i was looking for.
@__karthikkaranth__ 3 ปีที่แล้ว ⁺²
1) Why does the store buffer have 56 entries? Is this just some heuristic chosen by Intel?
2) Would it make sense to have more granular fences? Ex: mfence(0x456688ff) to just flush that one write? Or is too granular to be efficient?
Thanks for making these videos, I've learnt so much from them!
@CoffeeBeforeArch 3 ปีที่แล้ว ⁺²
1. Like the size/configuration of any hardware structure, it'll be determined by some sort "common case" analysis. 56 entries is probably just "good enough" for most cases.
2. Depending on the specifics of what you mean, that would break the x86 processor ordering memory model. If the write you want to flush is not next to be drained to the L1$ in what would logically be a FIFO store buffer, you would be reordering that write past earlier writes in program order that have not become globally visible yet. There are more relaxed memory models that allow reordering of more than older writes with later reads, but x86's does not allow this.
Glad you are enjoying the videos!
@goobensteen 3 ปีที่แล้ว ⁺¹
Great content, as always. Do you have a discord server or something similar for questions? It's kinda hard to elaborate in the comment section here.
@CoffeeBeforeArch 3 ปีที่แล้ว
Nothing set up at the moment. Easiest way to chat is by email (coffeebeforearch@gmail.com) or to schedule a meeting through google for something like a video call
@jankeshchakravarthy9389 ปีที่แล้ว ⁺¹
Thanks Nick for very informative videos. I wonder why software memory barrier did not work? Thanks
@CoffeeBeforeArch ปีที่แล้ว
Software barriers ensure that the compiler does not reorder memory accesses, but that makes no guarantees about what the hardware does at runtime (it’s free to execute some operations out of program order)
@93Mosfet 2 ปีที่แล้ว
Really good video. Thanks!
@archanasampath4809 11 หลายเดือนก่อน
Instead of the hardware barrier instruction, we can also readback the value we just wrote (from the same address)..This will force CPU to flush the writes before the read..
@leonwoestenberg6001 5 หลายเดือนก่อน
Not in the general case, as the compiler might not do the actual read from memory, as it already has the value in a register. Remember that the compiler only sees one thread of execution and will optimize that. (With volatile, this might work, but at a cost.)
@nisachannel7077 3 ปีที่แล้ว
Awesome, awesome!!
@qubasaqube1112 3 ปีที่แล้ว ⁺²
Uhh ohh that’s something that nobody knows about. God damn concurrency is hard. Thanks a lot!
@CoffeeBeforeArch 3 ปีที่แล้ว
Never a dull moment in parallel programming :^)
@raghul1208 2 ปีที่แล้ว
awesome!

ต่อไป

เล่นอัตโนมัติ

Advanced Topics: Peterson's Algorithm for Mutual Exclusion

Advanced Topics: Peterson's Algorithm for Mutual Exclusion

Advanced Topics: Software Memory Barriers

Advanced Topics: Software Memory Barriers

Arm's Weakly-Ordered Memory Model and Barrier Requirements - Ash Wilding, Amazon

Arm's Weakly-Ordered Memory Model and Barrier Requirements - Ash Wilding, Amazon

MISS GRAND PHRAE 2025 | FINAL SHOW

MISS GRAND PHRAE 2025 | FINAL SHOW

เพื่อนผมมันทำได้ไง มันทำไม่ได้ไม่ใช่หรอ?? | Minecraft #minecraft #มายคราฟ #fypシ #minecraftmemes #ตลก

เพื่อนผมมันทำได้ไง มันทำไม่ได้ไม่ใช่หรอ?? | Minecraft #minecraft #มายคราฟ #fypシ #minecraftmemes #ตลก

🔴Live โหนกระแส ตั้มจ๋าหยุดลาก่อน "สายหยุด" หยุดก่อนจะสาย โบกมือลา ขอถอนตัวคดี "ทนายตั้ม"

🔴Live โหนกระแส ตั้มจ๋าหยุดลาก่อน "สายหยุด" หยุดก่อนจะสาย โบกมือลา ขอถอนตัวคดี "ทนายตั้ม"

How many people are in the changing room? #devil #lilith #funny #shorts

How many people are in the changing room? #devil #lilith #funny #shorts

Uh-oh, It's I/O Ordering! - Will Deacon, Arm

Uh-oh, It's I/O Ordering! - Will Deacon, Arm

Advanced Topics: False Sharing

Advanced Topics: False Sharing

Software Development with C++: Debugging with GDB

Software Development with C++: Debugging with GDB

Parallel C++: Spinlocks

Parallel C++: Spinlocks

wtf is “the stack” ?

wtf is “the stack” ?

Data Oriented Design: Introduction

Data Oriented Design: Introduction

17. Synchronization Without Locks

17. Synchronization Without Locks

CppCon 2016: Timur Doumler “Want fast C++? Know your hardware!"

CppCon 2016: Timur Doumler “Want fast C++? Know your hardware!"

Why is Python 150X slower than C?

Why is Python 150X slower than C?

Máy báo động cho gia đình mãi đỉnh

Máy báo động cho gia đình mãi đỉnh

📌 shopee มุมซ้ายล่าง กดเลย #จอพกพา #arzopa #arzopaportablemonitor

📌 shopee มุมซ้ายล่าง กดเลย #จอพกพา #arzopa #arzopaportablemonitor

[spin9] รวมฟีเจอร์ Apple Watch ล่าสุด - ของใหม่เก่งแค่ไหนแล้ว? เหมาะกับใคร? ทำอะไรได้บ้าง?

[spin9] รวมฟีเจอร์ Apple Watch ล่าสุด — ของใหม่เก่งแค่ไหนแล้ว? เหมาะกับใคร? ทำอะไรได้บ้าง?

iPhone 16 vs Samsung…💀 #shorts #edit #trollface

iPhone 16 vs Samsung…💀 #shorts #edit #trollface

Q Mobile SL100 Big Battery 2800 mah Full LED Torch light Best Mobile || Review 2024

Q Mobile SL100 Big Battery 2800 mah Full LED Torch light Best Mobile || Review 2024

ซื้อไอโฟนอายุ15ปี แต่ว่ามีเจ้าของแล้ว??

ซื้อไอโฟนอายุ15ปี แต่ว่ามีเจ้าของแล้ว??

อสม. ต้องรู้ 3เรื่องสำคัญ แอป สมาร์ท อสม.

อสม. ต้องรู้ 3เรื่องสำคัญ แอป สมาร์ท อสม.

How to quickly draw up a plan on iPad Procreate! #ipad手工#设计人

How to quickly draw up a plan on iPad Procreate! #ipad手工#设计人