ゾーンに入るBGM アンビエント528HZの勉強に集中できる音楽タイマー

كرات بلاستيكية ملونة في حمام سباحة مع خمسة أطفال

Peak Solutions Certification: 2-6 Month Courses to Elevate Your Skills

นี่คือเรื่องราวสุดหลอน ของสตีฟที่บิดเบี้ยว !

ศึกทัวร์นาเมนต์ 1v1 | การแข่งขัน RoV ระดับนานาชาติ AIC 2024

MISS GRAND KRUNG THEP MAHA NAKHON 2025 | FINAL SHOW

MusicGen: Simple and Controllable Music Generation Explained

Gabriel Mongaras

มุมมอง 4 430

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 2 ธ.ค. 2024

ความคิดเห็น • 24

@Junebug_bass ปีที่แล้ว ⁺³
This is exactly the kind of breakdown of this paper I was looking for as I am just getting into the deep end of neural audio generation. The explanation of RVQ is very helpful. Thank you for making these videos!!
@odysy5179 7 หลายเดือนก่อน ⁺¹
Hi, what I am still wondering is why we cant just use teacher forcing on the codebook dimension? Thanks.
@schnik24 ปีที่แล้ว
What an amazing video, finally I fully understood the paper! Thank you so much!
@zhaosilas2494 7 หลายเดือนก่อน
thank you so much! Your explanation is so clear, and it is excatly what I'm looking for.
@audiocipher ปีที่แล้ว ⁺¹
Fantastic overview Gabriel, thanks for sharing
@gabrielmongaras ปีที่แล้ว
Glad you found it helpful!
@sedthh ปีที่แล้ว ⁺¹
Thanks! Just what I needed.
@zoahmed8923 11 หลายเดือนก่อน ⁺¹
Great video! Is the transformer essentially predicting the indices of each codebook?
@willwhite866 ปีที่แล้ว
Amazing overview
@ganeshsuryanarayanan4274 ปีที่แล้ว
Nicely Done!
@tellmebaby183 ปีที่แล้ว
nice illustration
@MicheleLugano 2 หลายเดือนก่อน
@gabrielmongaras: Hi, thanks very much for the very detailed and easy to understand explanation. Just one thing, for me it isn't super clear from the video why with the parallel pattern the error compounds. Thanks again!
@crushedkeyz7681 ปีที่แล้ว ⁺²
Great explanation on the paper and audio encoding and decoding. But where does the text prompt fit in to this?
@crushedkeyz7681 ปีที่แล้ว
Where do the text tokens fit in?
@gabrielmongaras ปีที่แล้ว
MusicGen kind of experiments with three different conditioning methods. Generally, when using a transformer, you can just condition the transformer with a cross-attention block like in the original attention is all you need paper. MusicGen specifically experiments with three different encoders, but the method of conditioning the transformer is the same for all text encoding types.
@KwstaSRr ปีที่แล้ว
@@gabrielmongaras The model is supposed to accept text as input right? When it generates music based on a text prompt what is the model's input?
@gabrielmongaras ปีที่แล้ว
The text would be put through some sort of encoder transformer architecture and the output of that encoder would then be used to condition the main audio generation model through cross-attention. The audio is generated the same, just with this extra conditioning.
@KwstaSRr ปีที่แล้ว
@@gabrielmongaras ok I understand the conditioning but without audio input at inference mode, to what do we add the text-conditioning embedding? maybe a silly question i dont know.
@hi_6546 4 หลายเดือนก่อน
Hey, if the frame rate is equal to 50*64, that mean that each second the model need to predict 50*64 values? (without counting the dim)
@hi_6546 4 หลายเดือนก่อน
the model works with the index of the codebook no?
@nickackerman8755 4 หลายเดือนก่อน
Where do you get the 64? If we have e.g. 4 codebooks, and there are 50 time steps per one second of audio, we need to predict 4 * 50 values per second (4 codebook indices per time step), I believe.

ต่อไป

เล่นอัตโนมัติ

ゾーンに入るBGM アンビエント528HZの勉強に集中できる音楽タイマー

ゾーンに入るBGM アンビエント528HZの勉強に集中できる音楽タイマー

كرات بلاستيكية ملونة في حمام سباحة مع خمسة أطفال

كرات بلاستيكية ملونة في حمام سباحة مع خمسة أطفال

Peak Solutions Certification: 2-6 Month Courses to Elevate Your Skills

Peak Solutions Certification: 2-6 Month Courses to Elevate Your Skills

นี่คือเรื่องราวสุดหลอน ของสตีฟที่บิดเบี้ยว !

นี่คือเรื่องราวสุดหลอน ของสตีฟที่บิดเบี้ยว !

ศึกทัวร์นาเมนต์ 1v1 | การแข่งขัน RoV ระดับนานาชาติ AIC 2024

ศึกทัวร์นาเมนต์ 1v1 | การแข่งขัน RoV ระดับนานาชาติ AIC 2024

MISS GRAND KRUNG THEP MAHA NAKHON 2025 | FINAL SHOW

MISS GRAND KRUNG THEP MAHA NAKHON 2025 | FINAL SHOW

ชาวบ้านผวา! หนุ่มบุกยิงประธาน อบต. สาหัส | ข่าวเย็นช่องวัน | สำนักข่าววันนิวส์

ชาวบ้านผวา! หนุ่มบุกยิงประธาน อบต. สาหัส | ข่าวเย็นช่องวัน | สำนักข่าววันนิวส์

FREE Text Generated Music Is Getting Too Good... [Meta's MusicGen]

FREE Text Generated Music Is Getting Too Good... [Meta's MusicGen]

Encodec: High Fidelity Neural Audio Compression Explained

Encodec: High Fidelity Neural Audio Compression Explained

MusicLM Generates Music From Text [Paper Breakdown]

MusicLM Generates Music From Text [Paper Breakdown]

LoRA: Low-Rank Adaptation of LLMs Explained

LoRA: Low-Rank Adaptation of LLMs Explained

11.2 conditions for parallelogram

11.2 conditions for parallelogram

How to Install Meta's FREE Text-to-Music AI Generator Locally (AudioGen)

How to Install Meta's FREE Text-to-Music AI Generator Locally (AudioGen)

AI Music, simply explained (feat. Grimes and Spotify's CEO)

AI Music, simply explained (feat. Grimes and Spotify's CEO)

Stable Diffusion 3: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Stable Diffusion 3: Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

musicgen | audiocraft | text to music | ai music | facebook | Google Colab

musicgen | audiocraft | text to music | ai music | facebook | Google Colab

Buy me your gift please

Buy me your gift please

[ ออกกอง ] วัยหนุ่ม 2544 โลกหลังลูกกรง..ทัวร์กองถ่ายหนังใน "คุกของจริง" | JUSTดูIT.

[ ออกกอง ] วัยหนุ่ม 2544 โลกหลังลูกกรง..ทัวร์กองถ่ายหนังใน "คุกของจริง" | JUSTดูIT.

Tallest and shortest woman meet for the first time 🥰

Tallest and shortest woman meet for the first time 🥰

Highlight : เพื่อความสง่างามของพรรค และเจ้าหน้าที่ต้องทำหน้าที่อย่างยุติธรรม

Highlight : เพื่อความสง่างามของพรรค และเจ้าหน้าที่ต้องทำหน้าที่อย่างยุติธรรม

This Slinky Can Walk Forever!

This Slinky Can Walk Forever!

Virtual VR: 🥽 I miss You

Virtual VR: 🥽 I miss You

“พญาตองซู” หนีเข้าไทยต่อเนื่อง | ข่าวเจาะย่อโลก | 30 พ.ย. 67

“พญาตองซู” หนีเข้าไทยต่อเนื่อง | ข่าวเจาะย่อโลก | 30 พ.ย. 67

ว้าแดงทางใต้เรียกระดมพลครั้งใหม่ เอาจำนวนมากว่าเดิม จีนกดดันว้าอยู่ในอุ้งมือ

ว้าแดงทางใต้เรียกระดมพลครั้งใหม่ เอาจำนวนมากว่าเดิม จีนกดดันว้าอยู่ในอุ้งมือ