ไม่สามารถเล่นวิดีโอนี้
ขออภัยในความไม่สะดวก

Swin Transformer

แชร์
ฝัง
  • เผยแพร่เมื่อ 19 เม.ย. 2024
  • In this video, we continue the vision transformer series, covering Swin Transformer, a general-purpose transformer backbone for computer vision. Swin Transformer is based on two key ideas: (1) designing a multi-scale hierarchical backbone suitable for computer vision, and (2) a carefully designed Swin Block composed of two window-based attention for efficient self-attention computation, while still enabling the long-range interactions between visual tokens.
    Link to Relative Attention video: • Relative Self-Attentio...

ความคิดเห็น • 1