Visual AutoRegressive Modeling:Scalable Image Generation via Next-Scale Prediction

Stable Diffusion 3: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Why 4d geometry makes me sad

แม่หยัวศรีสุดาจันทร์ สตรีผู้เขย่าบัลลังก์อยุธยา | โลกวิวัฒน์ Podcast EP.65

มายคราฟ แต่ คุณต้องเลือก ชีวิต หรือ ความตาย!!! #minecraft #พี่เก้า #มายคราฟ

[Full] 4 ต่อ 4 Celebrity EP.922 | 10 พ.ย. 67 | one31

Deterministic Image Editing with DDPM Inversion, DDIM Inversion, Null Inversion and Prompt-to-Prompt

Gabriel Mongaras

มุมมอง 1 299

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 10 พ.ย. 2024

ความคิดเห็น • 5

@EkShunya 3 หลายเดือนก่อน ⁺¹
great one, really liked it
thanks
@ml-ok3xq หลายเดือนก่อน
congrats on writing a paper! i notice that another recent paper from NVIDIA uses a unit vector for attention (nGPT) where the dot product is naturally equal to cosine as the lengths are one. are these two works related to each other in any way?
@gabrielmongaras หลายเดือนก่อน ⁺¹
Thanks!! I only read through the nGPT paper briefly, but I think nGPT was trying to make softmax attention/transformers more expressive and efficient by changing a few things. They do normalize before they apply the softmax function, making the logits a cosine similarity between -1 and 1. However they keep the softmax operation which forces the model to stay quadratic in terms of complexity. The paper I worked on removed the softmax function which allowed the attention mechanism to be changed into an RNN which is linear in complexity.
@陈兆伟-s5w 3 หลายเดือนก่อน
How is the equality in DDPM established in 17:49?
@gabrielmongaras 3 หลายเดือนก่อน ⁺¹
Looks like I forgot to write out the square root over the first term. As for the inner term that got turned into a fraction, I just multiplied sqrt{1-a_t} by the fraction sqrt{1-a_t}/sqrt{1-a_t}.

ต่อไป

เล่นอัตโนมัติ

Visual AutoRegressive Modeling:Scalable Image Generation via Next-Scale Prediction

Visual AutoRegressive Modeling:Scalable Image Generation via Next-Scale Prediction

Stable Diffusion 3: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Stable Diffusion 3: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Why 4d geometry makes me sad

Why 4d geometry makes me sad

แม่หยัวศรีสุดาจันทร์ สตรีผู้เขย่าบัลลังก์อยุธยา | โลกวิวัฒน์ Podcast EP.65

แม่หยัวศรีสุดาจันทร์ สตรีผู้เขย่าบัลลังก์อยุธยา | โลกวิวัฒน์ Podcast EP.65

มายคราฟ แต่ คุณต้องเลือก ชีวิต หรือ ความตาย!!! #minecraft #พี่เก้า #มายคราฟ

มายคราฟ แต่ คุณต้องเลือก ชีวิต หรือ ความตาย!!! #minecraft #พี่เก้า #มายคราฟ

[Full] 4 ต่อ 4 Celebrity EP.922 | 10 พ.ย. 67 | one31

[Full] 4 ต่อ 4 Celebrity EP.922 | 10 พ.ย. 67 | one31

[안방1열 직캠4K] 베이비몬스터 치키타 'DRIP' (BABYMONSTER CHIQUITA FanCam) @SBS Inkigayo 241110

[안방1열 직캠4K] 베이비몬스터 치키타 'DRIP' (BABYMONSTER CHIQUITA FanCam) @SBS Inkigayo 241110

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits and BitNet

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits and BitNet

KAN Practical Implementation (Kolmogorov-Arnold Networks Algorithm)

KAN Practical Implementation (Kolmogorov–Arnold Networks Algorithm)

The Right Way To Return API Errors in .NET

The Right Way To Return API Errors in .NET

Diffusion Models (DDPM & DDIM) - Easily explained!

Diffusion Models (DDPM & DDIM) - Easily explained!

Llama: The Open-Source AI Model that's Changing How We Think About AI

Llama: The Open-Source AI Model that's Changing How We Think About AI

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

NULL-text Inversion for Editing Real Images using Guided Diffusion Models

NULL-text Inversion for Editing Real Images using Guided Diffusion Models

CoPE - Contextual Position Encoding: Learning to Count What's Important

CoPE - Contextual Position Encoding: Learning to Count What's Important

How I animate 3Blue1Brown | A Manim demo with Ben Sparks

How I animate 3Blue1Brown | A Manim demo with Ben Sparks

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 11 : ไบรท์ตัน พบ แมนเชสเตอร์ ซิตี้

ไฮไลท์ฟุตบอล พรีเมียร์ลีก 2024/25 สัปดาห์ที่ 11 : ไบรท์ตัน พบ แมนเชสเตอร์ ซิตี้

ขนมปังแผ่นละ35บาท! ขนมปังโปรตีน? #chengandrock #เช้งกับร็อค #luckytree

ขนมปังแผ่นละ35บาท! ขนมปังโปรตีน? #chengandrock #เช้งกับร็อค #luckytree

UNLIMITED CHOCOLATE 😲😍| My Dad is a Vending Machine!

UNLIMITED CHOCOLATE 😲😍| My Dad is a Vending Machine!

coco的赛跑 1%🔋vs100%🔋

coco的赛跑 1%🔋vs100%🔋

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

[안방1열 풀캠4K] 베이비몬스터 'DRIP' (BABYMONSTER FullCam)│@SBS Inkigayo 241110

แม่หยัวศรีสุดาจันทร์ สตรีผู้เขย่าบัลลังก์อยุธยา | โลกวิวัฒน์ Podcast EP.65

แม่หยัวศรีสุดาจันทร์ สตรีผู้เขย่าบัลลังก์อยุธยา | โลกวิวัฒน์ Podcast EP.65

ถ่ายทอดสด l ฟุตซอลชิงแชมป์อาเซียน 2024 l รอบชิงอันดับที่ 3 l ออสเตรเลีย v ไทย

ถ่ายทอดสด l ฟุตซอลชิงแชมป์อาเซียน 2024 l รอบชิงอันดับที่ 3 l ออสเตรเลีย v ไทย

โชคชะตาความซวย • คุณโอ๊ต 9 บาท | 9 พ.ย. 67 | THE GHOST RADIO

โชคชะตาความซวย • คุณโอ๊ต 9 บาท | 9 พ.ย. 67 | THE GHOST RADIO