OpenAI Sora and DiTs: Scalable Diffusion Models with Transformers

MusicGen: Simple and Controllable Music Generation Explained

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Explained

ละลาย (LALALYE) - DAOU PITTAYA ต้าห์อู๋ พิทยา [OFFICIAL MV]

It works #beatbox #tiktok

🔴𝐋𝐈𝐕𝐄 การแข่งขัน Asian Esports Games 2024 เกม RoV รอบชิงชนะเลิศ

Stable Diffusion 3: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Gabriel Mongaras

มุมมอง 4 710

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 2 ธ.ค. 2024

ความคิดเห็น • 8

@TTTrouble 8 หลายเดือนก่อน ⁺³
Man I took a break from getting into the weeds of the AI papers but I really appreciate that you’re still at it man, and it inspires me to jump back into the jungle. You’ve definitely been a fantastic source of knowledge and helped me break down some of this stuff in a really meaningful way. Keep up the great work!
@kevinxu9562 8 หลายเดือนก่อน ⁺¹
GOATED damn thank you so much for making a video on this! Timing is goated, just started going through your diffusion series as I'm trying to build a diffusion model!
@VisionTang หลายเดือนก่อน
Thanks you a lot for sharing this! It helps me a lot
@vladandronik5711 2 หลายเดือนก่อน
Thanks for sharing! Struggled a bit with understanding flows, but you explained everything really nicely
@alexalex-lz8sg 7 หลายเดือนก่อน
Cool, what about latent adversarial diffusion distillation(LADD) video?
@gabrielmongaras 7 หลายเดือนก่อน
Oh yea that was a good paper. Lemmie maybe a video on that. This week seemed a bit lacking in terms of papers :/
@mathiasbang1999 7 หลายเดือนก่อน
Hey I was wondering if you could clarify something for me. You say that the [154; 4096] matrix holds the "fine grained" information, but when explaining the MM-DiT block setup Y is marked as fine grained information. It does seem to make more sense for the Y to be fine grained information in my opinion as it is post reduction information, however as I am not entirely sure I would love for you to maybe correct me on that :). Really appreciate the video! makes a lot of sense overall
@zeogod100 2 หลายเดือนก่อน
I stopped watching when you named the image height "L", BLASPHAMY!

ต่อไป

เล่นอัตโนมัติ

OpenAI Sora and DiTs: Scalable Diffusion Models with Transformers

OpenAI Sora and DiTs: Scalable Diffusion Models with Transformers

MusicGen: Simple and Controllable Music Generation Explained

MusicGen: Simple and Controllable Music Generation Explained

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Explained

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Explained

ละลาย (LALALYE) - DAOU PITTAYA ต้าห์อู๋ พิทยา [OFFICIAL MV]

ละลาย (LALALYE) - DAOU PITTAYA ต้าห์อู๋ พิทยา [OFFICIAL MV]

It works #beatbox #tiktok

It works #beatbox #tiktok

🔴𝐋𝐈𝐕𝐄 การแข่งขัน Asian Esports Games 2024 เกม RoV รอบชิงชนะเลิศ

🔴𝐋𝐈𝐕𝐄 การแข่งขัน Asian Esports Games 2024 เกม RoV รอบชิงชนะเลิศ

ติดเกาะกลางทะเล คุณจะเลือกอะไรไปด้วย? เลือกเลย!

ติดเกาะกลางทะเล คุณจะเลือกอะไรไปด้วย? เลือกเลย!

Stable Diffusion 3

Stable Diffusion 3

Rectified Flow: The Game-Changing Technique Powering Stable Diffusion 3 (Full Reimplementation!)

Rectified Flow: The Game-Changing Technique Powering Stable Diffusion 3 (Full Reimplementation!)

KAN: Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

LADD: Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

LADD: Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation

DiT: The Secret Sauce of OpenAI's Sora & Stable Diffusion 3

DiT: The Secret Sauce of OpenAI's Sora & Stable Diffusion 3

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Visual AutoRegressive Modeling:Scalable Image Generation via Next-Scale Prediction

Visual AutoRegressive Modeling:Scalable Image Generation via Next-Scale Prediction

Deterministic Image Editing with DDPM Inversion, DDIM Inversion, Null Inversion and Prompt-to-Prompt

Deterministic Image Editing with DDPM Inversion, DDIM Inversion, Null Inversion and Prompt-to-Prompt

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

[LIVE] : ONE ลุมพินี 89 | คู่เอก "ยอดไอคิว vs คิริลล์"

[LIVE] : ONE ลุมพินี 89 | คู่เอก "ยอดไอคิว vs คิริลล์"

ทุกครั้งที่หลับตา (Lucid Dream) - AYLA's [ Official MV ]

ทุกครั้งที่หลับตา (Lucid Dream) - AYLA's [ Official MV ]

This marshmallow hack is APPROVED @chefkoudy

This marshmallow hack is APPROVED @chefkoudy

พูดอีสานทั้งแก๊ง 1 วัน!! ตะลุยบ้านเกิด BOSS YELLOW!!

พูดอีสานทั้งแก๊ง 1 วัน!! ตะลุยบ้านเกิด BOSS YELLOW!!

My lovely daughter arranged for me, a security guard, to marry a female CEO.

My lovely daughter arranged for me, a security guard, to marry a female CEO.

เพลง มนต์รักลูกทุ่ง - ไรอัล กาจบัณฑิต | ไรอัลขับขานเพลงครู "ไพบูลย์ บุตรขัน"

เพลง มนต์รักลูกทุ่ง - ไรอัล กาจบัณฑิต | ไรอัลขับขานเพลงครู "ไพบูลย์ บุตรขัน"

เลือกพลังพิเศษ ที่คุณจะใช้

เลือกพลังพิเศษ ที่คุณจะใช้

วาทะลูกหนังขอเสนอ"ลิเวอร์พูล VS แมนซิตี้ หลังเกม หงส์สยบเรือนำเป็นจ่างฝูงห่าง9แต้ม"

วาทะลูกหนังขอเสนอ"ลิเวอร์พูล VS แมนซิตี้ หลังเกม หงส์สยบเรือนำเป็นจ่างฝูงห่าง9แต้ม"