Building our first simple GAN

Aladdin Persson

มุมมอง 120 400

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 31 ม.ค. 2025

ความคิดเห็น • 166

@AladdinPersson 4 ปีที่แล้ว ⁺¹⁹
If you're completely new to GANs I recommend you check out the GAN playlist where there is an introduction video to how GANs work and then watch this video where we implement the first GAN architecture from scratch. If you have recommendations on GANs you think would make this into an even better resource for people wanting to learn about GANs let me know in the comments below and I'll try to do it :)
I learned a lot and was inspired to make these GAN videos by the GAN specialization on coursera which I recommend. Below you'll find both affiliate and non-affiliate links, the pricing for you is the same but a small commission goes back to the channel if you buy it through the affiliate link.
affiliate: bit.ly/2OECviQ
non-affiliate: bit.ly/3bvr9qy
Here's the outline for the video:
0:00 - Introduction
0:29 - Building Discriminator
2:14 - Building Generator
4:36 - Hyperparameters, initializations, and preprocessing
10:14 - Setup training of GANs
22:09 - Training and evaluation
@aminasadi1040 ปีที่แล้ว ⁺³
Awesome video, you explain exactly what should be explained, I love it!
@Carbon-XII 3 ปีที่แล้ว ⁺²⁷
8:00 - transforms.Normalize((0.1307,), (0.3081,)) will not work because of the following:
* nn.Tanh() output of the Generator is (-1, 1)
* MNIST values are [0, 1]
* Normalize does the following for each channel: image = (image - mean) / std
* So transforms.Normalize((0.5,), (0.5,)) converts [0, 1] to [-1, 1], which is ALMOST correct, because nn.Tanh() output of Generator (-1, 1) excluding one and minus one.
* transforms.Normalize((0.1307,), (0.3081,)) converts [0, 1] to ≈ (-0.42, 2.82). But Generator can not generate values greater than 0.9999... ≈ 1, so it will not generate 2.8 for white color.
That is why transforms.Normalize((0.1307,), (0.3081,)) will not work.
P.S. To use transforms.Normalize((0.1307,), (0.3081,)) you should multiply nn.Tanh() with 2.83 ≈ nn.Tanh() * 2.83 ≈ (-2.83, 2.83)
@AladdinPersson 3 ปีที่แล้ว ⁺²
This makes total sense, thanks for clarifying!
@drishtisharma3933 3 ปีที่แล้ว ⁺¹
Thank you so much for explaining this... :)
@sourabharsh16 ปีที่แล้ว
Thanks a lot for the video. It really helped me in understanding the naunces of GAN and helped me write it from scratch as well. Keep on going, buddy!
@wongjerry3229 2 ปีที่แล้ว ⁺⁵
I think in 18:22 usung detach is better. for one thing retain_graph = True cost more memories and for another if we dont use detach we optimize the paras in G when we train D
@minister1005 ปีที่แล้ว ⁺¹
if we use detach, what is the point of disc_fake?
disc_fake = disc(fake.detach()).view(-1) and if we do a backward() we get no grads out of it(because fake.detach()'s require_grad=False) which means no update happens here
@minister1005 ปีที่แล้ว
Ah, my bad. fake.detach() won't get updated but disc()'s parameters will
@ShahryarSalmani 4 ปีที่แล้ว ⁺²
Perfect explanation of the loss function and why we use the minimization instead of maximization of Discriminator.
@Htyagi1998 2 ปีที่แล้ว
Doing minimization of anything is way simpler and faster in terms of computation rather than computing maxima
@aadarshraj1890 4 ปีที่แล้ว ⁺²
You Are Awesome😎😎. Please Continue This Series...Thanks For Awesome Video Series
@saurabhjain507 4 ปีที่แล้ว ⁺²
Very nicely explained. Loved your clarity.
@car6647 2 ปีที่แล้ว
thanks a lot, now i have a better understanding of GAN
@sr1jankundu 2 หลายเดือนก่อน
At 15:33, and 19:14 why use disc.zero_grad() and gen.zero_grad() instead of using opt_disc.zero_grad(), and opt_gen.zero_grad() respectively ?
@kae4881 4 ปีที่แล้ว ⁺²
Dang Man! Love your videos, you're EPIC!!!
@hwuM927udq 4 หลายเดือนก่อน
Thanks for sharing , that's really helpful!
@hackercop 3 ปีที่แล้ว
This worked for me thanks, am enjoying this playist!
@mohammedshehada5373 3 ปีที่แล้ว ⁺³
Thanks for the amazing content really helpful, Can we have some GAN stuff using audio data please? voice cloning maybe?
Thanks again
@philwhln 3 ปีที่แล้ว ⁺¹
Nice intro to GANs, thanks!
@thederivo5545 9 หลายเดือนก่อน ⁺¹
Hello, i love your videos as they are very precised and perfect but how do i view using colab instead of tensor flow
@suryagaur4363 4 ปีที่แล้ว ⁺¹⁴
Can you made a video on Cyclic GAN ?
@AladdinPersson 3 ปีที่แล้ว ⁺⁴
A bit late but it's finished now. Paper walkthrough is up and implementation from scratch will be up in a few days :)
@Abby.Tripathi 4 ปีที่แล้ว ⁺³
how can i include tensorboard features within the GAN ipynb file to visualize the log files?
@nitishrawat6872 4 ปีที่แล้ว ⁺³
If you're using Google Colab
Just add these lines:
%load_ext tensorboard # To load the tenserboard notebook extension
%tensorboard --logdir logs # before training your model
@Huy-G-Le 3 ปีที่แล้ว ⁺¹
The code run great, but how did you make those images at 20:37 appear????
I been trying to do that in google colab, the code work, but no image.
@ZOBAER496 10 หลายเดือนก่อน
Same question.
@Arya-cn4kk 9 หลายเดือนก่อน
@@ZOBAER496 %load_ext tensorboard
%tensorboard --logdir logs
run this in a seperte cell
@maqboolurrahimkhan 4 ปีที่แล้ว ⁺¹
Thanks Awesome and simple implementation :)
@rabia3746 3 ปีที่แล้ว ⁺²
Hello. Thx for the video. I tried this code exactly except that i used 400 epoch. But still fake images are like noises. How did you get this results on the tensorboard. Can you please share the hyperparams that you used?
@mustafashaikh116 ปีที่แล้ว
Question : Why we use zero_grad with disc and dis and not opt_disc and opt_gen?
@sourabhbhattacharya9133 3 ปีที่แล้ว
I had confusion regarding line 68 and 70 why are we creating ones and zeros in criterion?
Please clarify this portion....
great work as always....
@kdubovetskyi 3 ปีที่แล้ว
Roughly saying, we want the discriminator to estimate the *probability that its input is real*. Therefore the desired output for disc(real) is 1, and 0 for disc(fake).
@NewNew-qn7kh 4 หลายเดือนก่อน
I love the way your ide looks, what are you using/ what settings?
@pocketchamp 4 ปีที่แล้ว ⁺⁴
Thank you so much for the material, this is awesome! I have a small question. Why would it be `disc.zero_grad()` instead of `opt_disc.zero_grad()`? in general, are these 2 statements interchangeable?
@leviack4396 2 ปีที่แล้ว
yeah ,i'm confused about that too, dude
@VuongNguyen-jr4gl 2 ปีที่แล้ว
they're the same
@minister1005 ปีที่แล้ว
Both are the same since it optimizes the model parameters
@aymensekhri2133 3 ปีที่แล้ว
Thank you very much! I got lots of things
@holder-k5e 9 หลายเดือนก่อน
Why are we using 128 nodes in the Discriminator class? Isn't that kind of a random number?
And why 256 in the Generator?
@mohsenmehranian7571 3 ปีที่แล้ว
Thanks, it was a very good video!
@nark4837 2 ปีที่แล้ว
Hey, so this simple GAN generates any number? What I mean is, the neural networks have not learnt the features of 0, 1, 2, 3, ... individually, they have learnt what features make up a number in general? Then when z, the random sample from a distribution, is plugged into the generator, it generates a random number because of the noise it was given? Hence, the results could be better if you created a GAN pair for each individual number, which would obviously take a lot more training time and the networks would be mutually exclusive and not random, so you'd have a GAN pair that generates a fake version of every digit.
@yardennegri874 2 ปีที่แล้ว ⁺¹
how do you get the images to show on tensorboard?
@Astrovic1 2 ปีที่แล้ว
how is the song called at 20:00? sounds so chill made me move like on the dancefloor while at my working desk learning GANs with u
@bashirsadeghi2821 ปีที่แล้ว
Great Tutorial.
@maxim2727 2 ปีที่แล้ว
Why is your tensorboard updating automatically the new images? For me I have to refesh the page in order for it to update
@christianc8265 2 ปีที่แล้ว ⁺¹
out of experience, mixing relu with tanh does not work super well, this is also a point you might add to your final possible improvements list, like only use tanh for the whole generator.
@niveyoga3242 4 ปีที่แล้ว ⁺²
Heyo, awesome vid as always! I wanted to ask you if you could do some variational autoencoders in pytorch & maybe also cover some of the mathematics of the special variants, if you are interested (i.e. as you're doing for GANs)? :)
@MorisonMs 4 ปีที่แล้ว
Question:
18:18
Code line 77.
We have to compute disc(fake) twice? can't we simply write: "output = disc_fake"?
(I thought we add retain_graph=True in order to avoid the computation of the disc(fake) twice)
@AladdinPersson 4 ปีที่แล้ว ⁺¹
We do retain_graph so that we don't have to compute fake twice so we can re-use the same image that has been generated. We send it through the discriminator again because we updated the discriminator, and they way I showed in the video is the most common setup I've seen when training GANs. Although it would probably also work if you did reuse disc_fake from previously
@MorisonMs 4 ปีที่แล้ว ⁺¹
@@AladdinPersson Got you...! Thanks a lot
@deepudeepak1390 4 ปีที่แล้ว ⁺²
awesome!! one request from me ...can you make a video on text to image using GAN's please !!!
@dvrao7489 4 ปีที่แล้ว ⁺¹
Really love this series man!! Just a quick question though why did we use fixed_noise and noise differently. In the training part can we not have used fixed_noise as input to generator because noise is noise right? Does it matter if we start from the same point?
@generichuman_ 2 ปีที่แล้ว ⁺²
Fixed noise is used to display the images to track the progress of the GAN. Fixed means it doesn't change over time, so if you were to use this in training, you would be feeding the GAN the same vector over and over again, and the GAN would only be able to generate a single image, and the rest of the latent space would remain unexplored.
@123epsilon 4 ปีที่แล้ว
Hi can you explain why we would use BCE loss on the Generator as well and why we would compare it to a tensor of 1s? It makes sense to me to use it for the discriminator as it is a classifier, but is the generator not doing some form of regression?
@ibrahimaba8966 3 ปีที่แล้ว
The formula is log(1 - D(G(Z))). So we use it on the discriminator.
@FORCP-bq5fo 7 หลายเดือนก่อน
What are your IDE settings btw? Theme and font
@noamsalomon01 2 ปีที่แล้ว
Thank you, helped me alot
@icanyagmur 2 ปีที่แล้ว
Nice work!
@vatsal_gamit 4 ปีที่แล้ว ⁺¹
You're like a magic 🔥
@sardorabdirayimov 2 ปีที่แล้ว
Great effort. Good tutorial
@travelthetropics6190 2 ปีที่แล้ว
how would it be different if we use AdamW instead of Adam?
@HungDuong-dt3lg 3 ปีที่แล้ว
On line #66, why gen function only takes in one argument noise. I thought it must takes in two arguments z_dim and img_dim. Can you explain please?
@gabrielyashim706 3 ปีที่แล้ว
This video is was really helpful, but what if I don't want to use the MNIST dataset and I want to use my own dataset from my local machine, please how do I go about it?
@AladdinPersson 3 ปีที่แล้ว
I have separate videos on how to use custom datasets, for something written I highly recommend: pytorch.org/tutorials/beginner/data_loading_tutorial.html
@mariamnaeem443 4 ปีที่แล้ว
Nice video, thanks. Can you please make a video on RCGAN?
@aras9319 2 ปีที่แล้ว
Hello. What should be different for non-square image data?
@pelodofonseca6106 2 ปีที่แล้ว
CNNs instead of fc layers.
@shambhaviaggarwal9977 3 ปีที่แล้ว
What changes will be there in the code if we use disc(fake).detach() instead? Will there be any changes in line 77? at 18:34
@samernoureddine 3 ปีที่แล้ว
When computing lossD, what is the difference in practice between summing versus averaging lossD_real and loss_Dfake? @15:20
@privacywanted434 4 ปีที่แล้ว ⁺¹
How did you get the tensorboard site to pop up?
@AladdinPersson 4 ปีที่แล้ว ⁺²
Perhaps I didn't show it in the video but you have to run it through conda prompt (or terminal etc). I have more info on using tensorboard in a separate video so I was kind of assuming that people knew it but I could've been clearer on that!
@privacywanted434 4 ปีที่แล้ว ⁺³
@@AladdinPersson this is new for me so I’m still learning all the tools. Please keep doing tutorials btw!! You have been helping me learn AI so much faster due to your pytorch implementations.
@AladdinPersson 4 ปีที่แล้ว ⁺³
@@privacywanted434 Thanks for saying that, I appreciate you 👊
@eyakaroui3718 3 ปีที่แล้ว ⁺¹
How can I flip the labels 1 for fake and 0 for real ? Thanks a lot this video is helping me a lot !!! 😍
@sidrasafdar7325 3 ปีที่แล้ว
Very good explanation of each and every line of code. Can you please make a video on how to optimize GANs with Whale Optimization Algorithm. i have to do my project in GAN and this is my base paper "Automatic Screening of COVID‑19 Using an Optimized Generative
Adversarial Network". I have searched a lot about how to optimize GANs with WHO but couldn't find any related result. please help me as you have a detailed knowledge about GANs.
@imdadood5705 3 ปีที่แล้ว ⁺¹
Hello! I have been subscribed to you since a long time. I haven’t watched your videos other than machine learning from the scratch videos. What are prerequisites to start learning from this series??!
People who are very well versed deep learning. How did you all learn? I am not intimidated by math... but it takes some time for me to understand. Give me some helpful tips please.In what order should I start learning deep learning?
This would be a great help.
@AladdinPersson 3 ปีที่แล้ว ⁺³
Hey, I got a video How to learn deep learning that answers your questions I think:)
@Zeoytaccount 2 ปีที่แล้ว
What notebook prompt are you using to call up that TensorBoard UI?
@Zeoytaccount 2 ปีที่แล้ว
Figured it out, for anyone with the same question. In a separate cell run:
%load_ext tensorboard
%tensorboard --logdir logs
magic
@Arya-cn4kk 9 หลายเดือนก่อน
@@Zeoytaccount Oh Babe it works ,such a sweety wish I could send you a thankuuuuuu
@AliAhmed-mw2vc ปีที่แล้ว
Please can someone tell which editor he is using?
@brianjohnbraddock9901 3 ปีที่แล้ว
Thanks!
@UserSingh-no3wt 3 หลายเดือนก่อน
Hey please help me ...we wrote down the same code still getting error what. Should we do
@tmspeeches8405 3 ปีที่แล้ว
how do we get the training accuracy at each epoch ?
@ZOBAER496 10 หลายเดือนก่อน
Do you have this GAN code available for downloading?
@SAINIVEDH 3 ปีที่แล้ว
Is the intro eq. cross entropy loss function ?!
@virtualecho777 3 ปีที่แล้ว
I have no idea what is happening but its soo interesting
@lker7489 3 ปีที่แล้ว
wonderful intro to GAN, thank you very much! actually not feel a little confused what is z_dim...
@christianc8265 2 ปีที่แล้ว
these are the parameters you can change according to a known distribution to use the generator to produce images. I guess 64 is way to high for mnist. maybe you can use 10 so you can blend any of the digits.
@cowmos9276 2 ปีที่แล้ว
thank you~
@NinjaTactiks 3 ปีที่แล้ว
What version of CUDA are you using?
@AladdinPersson 3 ปีที่แล้ว
The latest one always pretty much, which as of right now is cuda 11.1 I think
@purnamakkena9553 3 ปีที่แล้ว
I can't see tensorboard. I am running the same code on colab. Please help me. Thank You
@Arya-cn4kk 9 หลายเดือนก่อน
%load_ext tensorboard
%tensorboard --logdir logs
run this in a seperate cell , it works
@mustafasidhpuri1368 4 ปีที่แล้ว
in GANs generator loss should decrese and discriminator loss should increase is that so? i am little bit confused .
@AladdinPersson 4 ปีที่แล้ว
The loss in GANs don't tell us anything really (one will go up when the other goes down and vice-versa). The only thing you want to watch out for is if discriminator would go to 0 or something like that, so that would be the case if one of them "takes over"
@ruochenli5574 3 ปีที่แล้ว
How do you enter the Tensorboard ???
@Arya-cn4kk 9 หลายเดือนก่อน
im stuck there someone link a vid
@Arya-cn4kk 9 หลายเดือนก่อน
%load_ext tensorboard
%tensorboard --logdir logs
run this in a seperate cell it works
@ahsannadeem746 4 ปีที่แล้ว
Is it possible to train this gan with a .CSV dataset?
@madhuvarun2790 3 ปีที่แล้ว ⁺¹
At discriminator
we want max log(D(real)) + log(1-d(g(z))). Since loss functions work by minimizing error we can minimize
- (log(D(real)) + log(1-d(g(z)))). The bceloss is similar to min the above written loss. So it works fine.
At Generator
we want to max log(d(g(z))). Could you please explain how criterion(output, torch.ones_like(output)) maximizes log(d(g(z)))? because the loss function is ln =−wn [yn.logxn+(1−yn)⋅log(1−xn)]. According to your code aren't we trying to maximize -log(d(g(z)))? because there is a negative in loss function. shouldn't we add negative in our training phase? please explain me. I am stuck here
@madhuvarun2790 3 ปีที่แล้ว
Nevermind, I understood it. Thanks
@asagar60 3 ปีที่แล้ว
@@madhuvarun2790 can you please elaborate . as i see it on discriminator side, loss_real = - (log(D(real)) and loss_fake = - log(1-d(g(z)))).. but its still minimizing right ? I cant understand how thats maximizing the loss, the same doubt with generator loss
@madhuvarun2790 3 ปีที่แล้ว ⁺²
@@asagar60 Yes. It is minimizing the loss. I was wrong. At discriminator we are minimizing -(log(d(real)). At generator we are minimizing -log(d(g(z)))
@Arya-cn4kk 9 หลายเดือนก่อน
!python3 -c "import tensorflow as tf; print(tf.reduce_sum(tf.random.normal([1000, 1000])))" - what is the relevance of this to the GAN you have worked on in this video
@novinnouri764 5 หลายเดือนก่อน
thanks
@joefahy4806 3 ปีที่แล้ว
what program do you do this in?
@DIYGUY999 4 ปีที่แล้ว ⁺¹
Would you mind sharing the name of intro music? :D
@kae4881 4 ปีที่แล้ว
SAME!
@beizhou2488 4 ปีที่แล้ว ⁺¹
It is Straight Fuego by Matt Large
@DIYGUY999 4 ปีที่แล้ว
@@beizhou2488 Thankyou mah man.
@tzachcohen9124 3 ปีที่แล้ว
How can I transfer this code to work with RGB images? It keeps printing lines as an output after learning instead of images :(
@ibrahimaba8966 3 ปีที่แล้ว
you need to use dcgan instead of gan.
@prakhar3134 ปีที่แล้ว
can someone explain what z_dim is actually?
@MorisonMs 4 ปีที่แล้ว
11:25
You forgot to put right parenthesis..
Kidding :P
Thanks for the video bro
@ethaneaston6443 2 ปีที่แล้ว
what does the parameter z_dim means？
@utkarshjyani8350 2 ปีที่แล้ว
for batch_idx, (real, _) in enumerate(loader):
for this part its giving an error
TypeError: 'module' object is not callable
@ABWXII 2 ปีที่แล้ว
hello sir can you tell me how to convert GANs generated dataset in to .jpg format??? please
@generichuman_ 2 ปีที่แล้ว
Be careful with jpgs in your training set. Jpg uses 8x8 blocks that introduce artifacts, either use very high quality jpgs, or even better, pngs
@EllenReborn 4 ปีที่แล้ว
How would I edit this if I wanted to use my own dataset?
@AladdinPersson 4 ปีที่แล้ว ⁺³
If it's not a dataset included in Pytorch torchvision you could create a custom dataset class (it's not too difficult). I have separate video on custom datasets in Pytorch you could take a look at. Here is also a great official tutorial from Pytorch: pytorch.org/tutorials/beginner/data_loading_tutorial.html
@hardtokick-uz2xk 6 หลายเดือนก่อน
anyone is having trouble related with the tensorboard visualization? like I am using Pycharm but the visualization part doent get executed
@thirashapw 3 ปีที่แล้ว
how can i run that tensorboard?
@Arya-cn4kk 9 หลายเดือนก่อน
same doubt
@NamTran-cc1ml 2 ปีที่แล้ว
why do we have (lossD_real + lossD_fake)/2
@talha_anwar 4 ปีที่แล้ว ⁺¹
is not it should be optimizer.zero_grad instead of model.zero_grad
@AladdinPersson 4 ปีที่แล้ว
You can use both
@Sercil00 4 ปีที่แล้ว
Is it normal that this easily takes 1-2 hours for 50 epochs?
I first ran it on my computer which unfortunately has no nvidia GPU. Then I tried it on Google Colab, which originally had it running on its CPU too. So I changed their Hardware acceleration to GPU, aaaaand... if it's faster, then not by much. Is that normal? Does this not benefit significantly from GPUs?
@parthrangarajan3241 3 ปีที่แล้ว
Hey, how did you overcome this error in colab?
TypeError Traceback (most recent call last)
in ()
1 for epoch in range(num_epochs):
----> 2 for batch_idx, (real, _) in enumerate(loader):
3 real=real.view(-1, 784).to(device)
4 batch_sz= real.shape[0]
5
4 frames
/usr/local/lib/python3.7/dist-packages/torchvision/datasets/mnist.py in __getitem__(self, index)
132
133 if self.transform is not None:
--> 134 img = self.transform(img)
135
136 if self.target_transform is not None:
TypeError: 'module' object is not callable
@parthrangarajan3241 3 ปีที่แล้ว
@@drishtisharma3933 Hey Drishti! Yes, I was able to overcome this error but I do not remember the exact changes I made to the code.
I could share my colab notebook for your clarity.
Honestly, I didn't try your approach. I was following the video as a code-along.
Link: colab.research.google.com/drive/1l1Vt7mcoEQKFxxVbpQOeKZ-UiEHU9ggt?usp=sharing
@secretgame6434 ปีที่แล้ว
idont know much about pytorch but ill figure it out...
@m11m 3 ปีที่แล้ว
I'm admittedly a noob to all of this, but I keep getting this "TypeError: __init__() takes 1 positional argument but 2 were given" and I can't figure out how to resolve the issue, any advice would be appreciated
@AladdinPersson 3 ปีที่แล้ว
Difficult to say w/o code, in this case it seems like you're sending in too many arguments haha
@generichuman_ 3 ปีที่แล้ว
If I had to guess, you might have a class method that doesn't have a "self" parameter
@deeshu3456 3 ปีที่แล้ว
it seems like while defining the class method you originally coded a method which takes one argument , but while calling the same method as object you provided two arguments in there.
eg.
def lets_solve(error):
pass
#Instantiating an object now
solution = lets_solve(error, YOU PROVIDED ONE EXTRA ARGUMENT HERE)
YOU PROVIDED ONE EXTRA ARGUMENT HERE ----> denotes the extra argument which you shouldn't have provided going by the original code which takes just one arg. Hope this makes sense. Good luck!
@pantherwolfbioz13 3 ปีที่แล้ว
Why do we maximize the generator loss? Shouldn't the generator be good at identifying the fake generated by descriminator?
@jamesadeke9873 2 ปีที่แล้ว
Generator don't identify. It only generates. To minimize loss, is to make the generator generate samples very close to real in order not to be identified by the discriminator
@vaibhavpujari124 5 หลายเดือนก่อน
can you please share the code
@ArunKumar-sg6jf 4 ปีที่แล้ว
Nn.linear for what bro
@judedavis92 3 ปีที่แล้ว
Ooh the jacobian
@canozturk369 ปีที่แล้ว
GREAT
@hoaanduong3869 2 ปีที่แล้ว
Damn, I nearly heartbreak when i set wrong values for transforms.Normalizer
@Champignon1000 3 ปีที่แล้ว
7:45 bruh :D
@yurinjathi5029 3 หลายเดือนก่อน
lr of 3e-4 was a joke by Karpathy
@AladdinPersson 3 หลายเดือนก่อน
the karpathy constant was never a joke
@saurrav3801 4 ปีที่แล้ว
🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥
@flakky626 ปีที่แล้ว
Not pytorch;-; I gotta learn pytoch nonetheless
@ashekpc106 ปีที่แล้ว
please makea video about anime infogan

ต่อไป

เล่นอัตโนมัติ