ไม่สามารถเล่นวิดีโอนี้

ขออภัยในความไม่สะดวก

The Math behind Transformers | Srijit Mukherjee | Computer Vision | Natural Language Processing

Srijit Mukherjee

มุมมอง 8 581

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 21 ก.พ. 2023
In this video, I explain the mathematics in the optimum possible way behind the Transformers, which is the widely used architecture behind the natural language processing tasks. Recently, transformers have been used to solve computer vision problems too. The explanation is based on the paper "Attention is all you need" [arxiv.org/abs/...] by Ashish Vaswani et al.
There are some errors in the video which are mentioned in the comments as well as on the following page. Many other learning resources are being shared on the page below.
May you find join in learning about the science of data.

ความคิดเห็น • 26

@arshadkazi4559 ปีที่แล้ว ⁺³
Great Video! Liked the simplicity and honesty in the explanation.
@mukherjeesrijit ปีที่แล้ว
Thank you Arshad.
@BiprojitNath ปีที่แล้ว ⁺²
This is amazing. Keep making more such tutorial.
@mukherjeesrijit ปีที่แล้ว
Thank you Biprojit. I will.
@sayantan336 ปีที่แล้ว ⁺⁴
Just a point the embeddings (512 dim vectorized representations) of the input/output words are also learned as a by-product of the training process.... i.e. The embedding layers both on the encoder side & the decoder side - are part of the learnable parameters of the model
@mukherjeesrijit ปีที่แล้ว
Thank you for the input.
@annyd3406 ปีที่แล้ว ⁺⁴
Please make more videos you are great thank you for this!!!
Make INDIA proud....
@mukherjeesrijit ปีที่แล้ว
Thank you! I will try my best.
@michaelestrinone2111 ปีที่แล้ว
India Or America? His is a PhD student in American(!) Uni. If he wanted to make India proud he would stay in India.
@sayantan336 ปีที่แล้ว ⁺⁴
Well explained ... Thanks Srijit ... Enjoyed quite a lot..
@mukherjeesrijit ปีที่แล้ว
Thank you.
@Rahulsircar94 4 หลายเดือนก่อน ⁺¹
lol transformers for bengalis.loved it.
@harishravi9936 ปีที่แล้ว ⁺¹
At 55:44,will the mask matrix be a lower triangular with all -inf ?
@harishnayak976 8 หลายเดือนก่อน
What is the equation for whole process in one mathematical form?
@hari8568 ปีที่แล้ว
At 31:51 if we are finding similarities between two words, shouldn't QK^T matrix be symmetric?because relation score between any two words should be the same
@gasun1274 ปีที่แล้ว
exactly
@hari8568 ปีที่แล้ว
At 41:51 how exactly do we get Zi?Are we using PCA?
@sbera07 ปีที่แล้ว
can you provide the notes u have made in this video
@user-zn8rz3hu2k ปีที่แล้ว
I would like to recommend a fantastic paper by DeepMine that provides comprehensive and detailed explanations regarding this topic.
@mukherjeesrijit 6 หลายเดือนก่อน
please do.
@subhadipsarkar7692 11 หลายเดือนก่อน
❤
@gasun1274 ปีที่แล้ว ⁺¹
you have a tendency to overly raise your tone while you speak, i do this subconsciously too but you need to know that it is very annoying.
@mukherjeesrijit ปีที่แล้ว
Thank you for your suggestion. I will keep it in my mind.
@vishruttalekar8626 ปีที่แล้ว ⁺²
Disappointing!!
@mukherjeesrijit ปีที่แล้ว
Let me know how it can be improved.
@emrahe468 2 หลายเดือนก่อน
@@mukherjeesrijit I'm a confused by the diagram in your video at @6:00. The left side seems to show inputs in a foreign language being fed into an encoder, while the right side displays multiple sequences in English. Is this setup discussing a specific type of decoder model like GPT-2, or is it more about an encoder-decoder architecture used for translation? The background of the diagram makes it hard to determine the exact context. With that diagram things get messy for me, hence couldn't stand much, sorry

ต่อไป

เล่นอัตโนมัติ

AI, Machine Learning, Deep Learning and Generative AI Explained

AI, Machine Learning, Deep Learning and Generative AI Explained

Transformers, explained: Understand the model behind GPT, BERT, and T5

Transformers, explained: Understand the model behind GPT, BERT, and T5

Illustrated Guide to Transformers Neural Network: A step by step explanation

Illustrated Guide to Transformers Neural Network: A step by step explanation

🍌Hiwwhee Games EP.1 | เปิดงานหิ้วหวีกีฬาสีใครจะอยู่สีไหนบ้าง

🍌Hiwwhee Games EP.1 | เปิดงานหิ้วหวีกีฬาสีใครจะอยู่สีไหนบ้าง

“อุ๊งอิ๊ง” ไพ่ลับ “ทักษิณ” เดิมพันครั้งสุดท้าย “ชินวัตร”? | คมชัดลึก | 16 ส.ค.67 | FULL

“อุ๊งอิ๊ง” ไพ่ลับ “ทักษิณ” เดิมพันครั้งสุดท้าย “ชินวัตร”? | คมชัดลึก | 16 ส.ค.67 | FULL

หนังตะลุงน้องเดียวลูกทุ่งวัฒนธรรม ตอน หนุกหนัดคนกับสัตว์ผลัดกันฮา

หนังตะลุงน้องเดียวลูกทุ่งวัฒนธรรม ตอน หนุกหนัดคนกับสัตว์ผลัดกันฮา

ทักษะฟุตบอลที่ดีที่สุด 2024/25

ทักษะฟุตบอลที่ดีที่สุด 2024/25

Vision Transformer for Image Classification

Vision Transformer for Image Classification

Transformer Neural Networks Derived from Scratch

Transformer Neural Networks Derived from Scratch

TRANSFORMATIONS SOUND AND CYBERTRONIAN LANGUAGE COMPILATION

TRANSFORMATIONS SOUND AND CYBERTRONIAN LANGUAGE COMPILATION

Transformers explained | The architecture behind LLMs

Transformers explained | The architecture behind LLMs

10 Detective Riddles Only the Most Attentive 1% Can Solve

10 Detective Riddles Only the Most Attentive 1% Can Solve

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

7 Riddles That Will Test Your Brain Power

7 Riddles That Will Test Your Brain Power

Backpropagation in 5 Minutes (tutorial)

Backpropagation in 5 Minutes (tutorial)

Vision Transformer (ViT)

Vision Transformer (ViT)

เดินจากอยุธยาสู่อ่างทองกับพี่ตั๊กบริบูรณ์...จะถึงไหมครับ

เดินจากอยุธยาสู่อ่างทองกับพี่ตั๊กบริบูรณ์...จะถึงไหมครับ

เปิดคำต่อคำ ช่อ-หมอวรงค์ ปมยุบพรรคประชาชน l STORY LIVE EP.43 (HIGHLIGHT)

เปิดคำต่อคำ ช่อ-หมอวรงค์ ปมยุบพรรคประชาชน l STORY LIVE EP.43 (HIGHLIGHT)

[Official Music Video] ฝังใจ - BOW Maylada [Prod. By Tan LIPTA]

[Official Music Video] ฝังใจ - BOW Maylada [Prod. By Tan LIPTA]

ลูกชิ้นทอดเจ๊ซิมจักรวรรดิ เหลาปลาร้ากุ้งสด ใครจะอดใจไหว!! | BB Memory

ลูกชิ้นทอดเจ๊ซิมจักรวรรดิ เหลาปลาร้ากุ้งสด ใครจะอดใจไหว!! | BB Memory

Try the #NewWomanChallenge now! Brought to you by YouTube Shorts #LISAxNewWoman

Try the #NewWomanChallenge now! Brought to you by YouTube Shorts #LISAxNewWoman

ลุงตู่มาเหนือเมฆจริงๆทักษิณติดกับดักรัฐธรรมนญปี60แล้วจ่อโดนยุบพรรคอีก

ลุงตู่มาเหนือเมฆจริงๆทักษิณติดกับดักรัฐธรรมนญปี60แล้วจ่อโดนยุบพรรคอีก

ติดบนแพ กลางทะเล 24 ชั่วโมง!! จะรอดหรือจม?!!

ติดบนแพ กลางทะเล 24 ชั่วโมง!! จะรอดหรือจม?!!

🍌Hiwwhee Games EP.1 | เปิดงานหิ้วหวีกีฬาสีใครจะอยู่สีไหนบ้าง

🍌Hiwwhee Games EP.1 | เปิดงานหิ้วหวีกีฬาสีใครจะอยู่สีไหนบ้าง