Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Vision Transformers: Using transformer neural network architecture with images - Data Hub Tech Talk

The tip and the iceberg: deep learning and embodiment (CVPR 2024 keynote).

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

ซินเดอเรลล่ากลายเป็นภรรยาของลุงสุดหล่อหลังจากคืนโรแมนติกนั้น ไม่รู้ว่าเธอได้พบกับมหาเศรษฐี

🔴𝐋𝐈𝐕𝐄 การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 9

All Things ViTs || CVPR 2023 Tutorial || Hila Chefer and Sayak Paul

Sayak Paul

มุมมอง 23 236

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 29 ม.ค. 2025

ความคิดเห็น • 14

@vi5hnupradeep ปีที่แล้ว ⁺¹
Thanks for sharing this Sayak Paul . As always , amazing work in covering everything in so much detail.
@lorenzoleongutierrez7927 ปีที่แล้ว ⁺¹
Thanks for sharing!
@НиколайНовичков-е1э ปีที่แล้ว
Thanks for sharing! It was very interesting!
@tydsuper3122 ปีที่แล้ว ⁺¹
what a good presentation！
@aritraroygosthipaty3662 ปีที่แล้ว ⁺¹
Amazing! Congratulations.
@ritwikraha ปีที่แล้ว ⁺²
This is fantastic! Congratulations Sayak da! 🎉
@sbeg-wv7fz ปีที่แล้ว
Thanks for sharing
@deepaksingh-vt2gq ปีที่แล้ว ⁺¹
I have a doubt, at 19:41 (From Self-Attention to Cross-Attention) slide, at the bottom, shouldn't we group Q and K for Text and V for Image?
@hoangminhnguyen435 ปีที่แล้ว ⁺²
Cross Attention means you want to find the relevance between different objects inputs, so basically Q and K must come from different source, and K and V must come from same source because K and V are representation of same object with different scope.
@amitpareek4215 ปีที่แล้ว ⁺¹
Congrats sir
@soumyasarkar4100 ปีที่แล้ว
congrats !
@reloto5665 ปีที่แล้ว ⁺¹
Is there a small mistake at 53:09?
Shouldn't it be 0.2+0.05+0.04 = 0.31 instead of 0.11?
@SphereofTime ปีที่แล้ว
1:43:16
@mikhailandreev1595 ปีที่แล้ว ⁺²
Sayak is the backbone of half the global ML community

ต่อไป

เล่นอัตโนมัติ

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Vision Transformers: Using transformer neural network architecture with images - Data Hub Tech Talk

Vision Transformers: Using transformer neural network architecture with images - Data Hub Tech Talk

The tip and the iceberg: deep learning and embodiment (CVPR 2024 keynote).

The tip and the iceberg: deep learning and embodiment (CVPR 2024 keynote).

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

HIGHLIGHTS : Singapore 2-4 Thailand | ASEAN Championship 2024 | 17.12.24

ซินเดอเรลล่ากลายเป็นภรรยาของลุงสุดหล่อหลังจากคืนโรแมนติกนั้น ไม่รู้ว่าเธอได้พบกับมหาเศรษฐี

ซินเดอเรลล่ากลายเป็นภรรยาของลุงสุดหล่อหลังจากคืนโรแมนติกนั้น ไม่รู้ว่าเธอได้พบกับมหาเศรษฐี

🔴𝐋𝐈𝐕𝐄 การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 9

🔴𝐋𝐈𝐕𝐄 การแข่งขัน RoV นานาชาติ AIC 2024 รอบ Swiss Stage วันที่ 9

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

ทัวร์สตรีมเมอร์ ROV ชิงเงินรางวัลรวม 25,000 บาท 8 ทีม : รอบ 8 ทีม

Masked Autoencoders Are Scalable Vision Learners - Paper explained and animated!

Masked Autoencoders Are Scalable Vision Learners – Paper explained and animated!

[CVPR2023 Tutorial Talk] Large Multimodal Models: Towards Building and Surpassing Multimodal GPT-4

[CVPR2023 Tutorial Talk] Large Multimodal Models: Towards Building and Surpassing Multimodal GPT-4

MIT 6.S191 (2023): Recurrent Neural Networks, Transformers, and Attention

MIT 6.S191 (2023): Recurrent Neural Networks, Transformers, and Attention

Hila Chefer - Transformer Explainability

Hila Chefer - Transformer Explainability

Vision Transformer Basics

Vision Transformer Basics

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

CVPR #18558 - Recent Advances in Vision Foundation Models

CVPR #18558 - Recent Advances in Vision Foundation Models

Controlling Text-to-Image Diffusion Models: Assorted Approaches | UC Berkeley | 2024

Controlling Text-to-Image Diffusion Models: Assorted Approaches | UC Berkeley | 2024

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

เดี่ยว - วันที่ได้คำตอบ - Live Show - The Voice Thailand 2024 - 15 Dec 2024

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

หนูกับเต้ รัก ”พี่อู๋จูน“ นะ

Highlight | อัจฉริยะสาวไส้...เบื้องลึกเหตุยิง "สจ.โต้งปราจีนบุรี" | เปิดโต๊ะข่าว | 17 ธ.ค.67

Highlight | อัจฉริยะสาวไส้...เบื้องลึกเหตุยิง "สจ.โต้งปราจีนบุรี" | เปิดโต๊ะข่าว | 17 ธ.ค.67

Highlight : นายใหญ่ฉุนใคร?

Highlight : นายใหญ่ฉุนใคร?

The White Lotus Season 3 | Official Teaser | Max

The White Lotus Season 3 | Official Teaser | Max

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

Live! ถ่ายทอดสดหวย ถ่ายทอดสดการออกรางวัลสลากกินแบ่งรัฐบาล งวดวันที่ 16 ธันวาคม 2567

“โดนัท มนัสนันท์” ไหว้ขอสามีมีอีหนูเถอะ!! “หนุ่ม กรรชัย” พร้อมช่วยเหลือ! | 3 แซ่บ (Full) 15 ธ.ค. 67

“โดนัท มนัสนันท์” ไหว้ขอสามีมีอีหนูเถอะ!! “หนุ่ม กรรชัย” พร้อมช่วยเหลือ! | 3 แซ่บ (Full) 15 ธ.ค. 67