ไม่สามารถเล่นวิดีโอนี้
ขออภัยในความไม่สะดวก

Vision Transformer (ViT) - An image is worth 16x16 words | Paper Explained

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ส.ค. 2024
  • ❤️ Become The AI Epiphany Patreon ❤️ ► / theaiepiphany
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    In this video I do a (semi) deep dive of the "An image is worth 16x16 words:
    transformers for image recognition at scale" paper which introduced the Vision Transformer.
    The paper is very interesting as it showed that with minimal modifications transformers can give better results than CNNs on the image classification problem.
    Until now transformers were ruling the NLP world and now they are coming for the CV world as well!
    You'll learn about:
    ✔️ How the vision transformer works
    ✔️ Main ideas in the paper
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    ✅ paper: arxiv.org/abs/...
    ✅ transformer Jupyter Notebook: github.com/gor...
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    ⌚️ Timetable:
    0:00 Enter The Vision Transformer, Jupyter Notebook
    1:00 Deep dive intro
    3:14 How does Vision Transformer work?
    4:39 Let's go even deeper
    8:33 Positional encoding inductive bias
    9:50 Model variants and results
    11:50 VTAB benchmark results
    13:00 Perf vs amount of pretrained data
    16:08 What does Vision Transformer learn? (attention span)
    19:35 Self-supervision vs Supervised learning
    21:10 Scaling the transformer (future research prediction)
    22:15 Positional Encodings details
    24:15 Logging out
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    💰 BECOME A PATREON OF THE AI EPIPHANY ❤️
    If these videos, GitHub projects, and blogs help you,
    consider helping me out by supporting me on Patreon!
    The AI Epiphany ► / theaiepiphany
    One-time donation:
    www.paypal.com...
    Much love! ❤️
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    💡 The AI Epiphany is a channel dedicated to simplifying the field of AI using creative visualizations and in general, a stronger focus on geometrical and visual intuition, rather than the algebraic and numerical "intuition".
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    👋 CONNECT WITH ME ON SOCIAL
    LinkedIn ► / aleksagordic
    Twitter ► / gordic_aleksa
    Instagram ► / aiepiphany
    Facebook ► / aiepiphany
    👨‍👩‍👧‍👦 JOIN OUR DISCORD COMMUNITY:
    Discord ► / discord
    📢 SUBSCRIBE TO MY MONTHLY AI NEWSLETTER:
    Substack ► aiepiphany.sub...
    💻 FOLLOW ME ON GITHUB FOR COOL PROJECTS:
    GitHub ► github.com/gor...
    📚 FOLLOW ME ON MEDIUM:
    Medium ► / gordicaleksa
    ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
    #transformers #computervision #visiontransformer

ความคิดเห็น • 62