ขนาดวิดีโอ: 1280 X 720853 X 480640 X 360
แสดงแผงควบคุมโปรแกรมเล่น
เล่นอัตโนมัติ
เล่นใหม่
Great explanation! Didn't understand it fully until I saw this video. Thanks!
Thanks! What about the pi policy of the network? Why we never use it to make a move?
We use it in the U part of the choosing function. The prior probability gets evaluated by the NN
Great explanation! Didn't understand it fully until I saw this video. Thanks!
Thanks! What about the pi policy of the network? Why we never use it to make a move?
We use it in the U part of the choosing function. The prior probability gets evaluated by the NN