Neural Network Forward Pass | GPU Programming | Episode 3

What is the Smallest Possible .EXE?

Dynamic Programming isn't too hard. You just don't know what it is.

ตามล่าหาไอติมราเมงแดรี่ควีน ไอติมแปลก #ร็อคจะรีวิว #เช้งกับร็อค #luckytree #chengandrock

ผมเปิดร้านอาหารใหม่ เงินเดือน 100,000 บาท !

จริงหรือไม่ "นุ่น สุทธิภา" ชอบถ่ายรูป...ให้ผู้ชายดู | ซุป'ตาร์ พาตะลุย

Kernel Grid | GPU Programming | Episode 2

Simon Oz

มุมมอง 1 852

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 16 ก.ย. 2024
Support this channel at:
buymeacoffee.c...
More on Matrix Multiplication:
• Matrix multiplication ...
en.wikipedia.o...
Code for animations and examples:
github.com/Szy...

ความคิดเห็น • 5

@gowiththeflo59 12 วันที่ผ่านมา
This is a great series, thank you!
@dimanft6160 หลายเดือนก่อน ⁺⁷
How does this have only 165 views, it's so good
@vastabyss6496 หลายเดือนก่อน
ikr! Even 3 weeks later, it's not even at 1k :(
@bhavindhedhi 6 วันที่ผ่านมา
equations at 2:24 are incorrect
@Stefan-td1pw หลายเดือนก่อน ⁺¹
Hi, I've been watching these videos in addition to reading the Programming Massively Parallel Processors,
My take on the exercise: (for the sake of brevity, I will not include assigning memory or memcpy for now)
```c
// Kernel Function for Array Summing
__global__ void sumArrays_Kernel(float *A, float *B, float *C, float *D, int Width, int Height, int Depth) {
int x = blockIdx.x * blockDim.x + threadIdx.x;
int y = blockIdx.y * blockDim.y + threadIdx.y;
int z = blockIdx.z * blockDim.z + threadIdx.z;
if (x < Width && y < Height && z < Depth) {
int index = x + y * Width + z * Width * Height; // Defined as index as used twice in next line
D[index] = A[index] + B[y * Width + x] + C[x];
}
}
void sumArrays_Host(float *A, float *B, float *C, float *D, int X, int Y, int Z) {
float *A_d, *B_d, *C_d, *D_d;
// Malloc and Memcpy vars (i.e A -> A_d)
dim3 block(2, 2, 2); // I'm not massively sure on good sizing here
dim3 grid((X + block.x - 1) / block.x,
(Y + block.y - 1) / block.y,
(Z + block.z - 1) / block.z);
sumArraysKernel(d_A, d_B, d_C, d_D, X, Y, Z);
// memcpy result back, and then free memory
}
```
General idea is that we're using a different index for each input vector, based on the logic you were mentioning earlier, the block and grid logic is just making sure we're in bounds

ต่อไป

เล่นอัตโนมัติ

Neural Network Forward Pass | GPU Programming | Episode 3

Neural Network Forward Pass | GPU Programming | Episode 3

What is the Smallest Possible .EXE?

What is the Smallest Possible .EXE?

Dynamic Programming isn't too hard. You just don't know what it is.

Dynamic Programming isn't too hard. You just don't know what it is.

ตามล่าหาไอติมราเมงแดรี่ควีน ไอติมแปลก #ร็อคจะรีวิว #เช้งกับร็อค #luckytree #chengandrock

ตามล่าหาไอติมราเมงแดรี่ควีน ไอติมแปลก #ร็อคจะรีวิว #เช้งกับร็อค #luckytree #chengandrock

ผมเปิดร้านอาหารใหม่ เงินเดือน 100,000 บาท !

ผมเปิดร้านอาหารใหม่ เงินเดือน 100,000 บาท !

จริงหรือไม่ "นุ่น สุทธิภา" ชอบถ่ายรูป...ให้ผู้ชายดู | ซุป'ตาร์ พาตะลุย

จริงหรือไม่ "นุ่น สุทธิภา" ชอบถ่ายรูป...ให้ผู้ชายดู | ซุป'ตาร์ พาตะลุย

Cute kitty gadget💛💚 #gadget

Cute kitty gadget💛💚 #gadget

CPU vs GPU | GPU Programming | Episode 1

CPU vs GPU | GPU Programming | Episode 1

Fast Inverse Square Root - A Quake III Algorithm

Fast Inverse Square Root — A Quake III Algorithm

TSMC FinFlex: How Chips are made Worse to get Better

TSMC FinFlex: How Chips are made Worse to get Better

Metaprogramming and JIT Compilers - A Brief Overview

Metaprogramming and JIT Compilers - A Brief Overview

Writing a game the hard way - from scratch using C. #1

Writing a game the hard way - from scratch using C. #1

Apple October Event LEAKS - 7 NEW Devices are COMING!

Apple October Event LEAKS - 7 NEW Devices are COMING!

How I installed the HARDEST operating system

How I installed the HARDEST operating system

these compression algorithms could halve our image file sizes (but we don't use them) #SoMEpi

these compression algorithms could halve our image file sizes (but we don't use them) #SoMEpi

OHANA บ้าพลัง EP.117 : เกมการ์ดโอฮาน่า x จ๊อบ เจง RUBSARB

OHANA บ้าพลัง EP.117 : เกมการ์ดโอฮาน่า x จ๊อบ เจง RUBSARB

#JasonDeruloTV // Interesting 🤩 #GotPermissionToPost From @jesse_martin_ #FromTheIslands

#JasonDeruloTV // Interesting 🤩 #GotPermissionToPost From @jesse_martin_ #FromTheIslands

เขมรเปิดฉากสาดกระสุนข้ามชายแดน เพราะตกใจเสียงไอพ่น F-16 ไทย... #historyworld #สงคราม

เขมรเปิดฉากสาดกระสุนข้ามชายแดน เพราะตกใจเสียงไอพ่น F-16 ไทย... #historyworld #สงคราม

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 4 : เซาธ์แฮมป์ตัน พบ แมนเชสเตอร์ ยูไนเต็ด

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 4 : เซาธ์แฮมป์ตัน พบ แมนเชสเตอร์ ยูไนเต็ด

ไฮไลท์ไทยลีก : บีจี ปทุม ยูไนเต็ด พบ หนองบัว พิชญ เอฟซี

ไฮไลท์ไทยลีก : บีจี ปทุม ยูไนเต็ด พบ หนองบัว พิชญ เอฟซี

สาวถูกทั้งครอบครัวเกลียดเพราะเธอท้อง แต่จู่ๆ พ่อของลูกก็เป็นบอสที่ซ่อนตัวตน

สาวถูกทั้งครอบครัวเกลียดเพราะเธอท้อง แต่จู่ๆ พ่อของลูกก็เป็นบอสที่ซ่อนตัวตน

การแข่งขัน RoV Pro League 2024 Winter | รอบเก็บคะแนน Week 5 Day 3

การแข่งขัน RoV Pro League 2024 Winter | รอบเก็บคะแนน Week 5 Day 3

เอก - รักไม่ช่วยอะไร - Blind Auditions -The Voice Thailand 2024 - 15 Sep 2024

เอก - รักไม่ช่วยอะไร - Blind Auditions -The Voice Thailand 2024 - 15 Sep 2024