Yuan Cao - Understanding Deep Learning Through Phenomena Discovery and Explanation

Aditya Varre - On the spectral bias of two-layer linear networks

SP25 CS 128H Lecture 2 - Rust Basics

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 1

Oren helps Durple escape Pinki in a way you wouldn't expect

Bloxfruits player after Dragon update🐲| Doge Gaming

Mufan Li - Infinite-Depth Neural Networks as Depthwise Stochastic Processes

One world theoretical machine learning

มุมมอง 1 080

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 29 ม.ค. 2025

ความคิดเห็น • 1

@Kram1032 9 หลายเดือนก่อน
very interesting talk!
Seems like a surprisingly easy shaping method you found. That should be huge!
This is perhaps only indirectly related, but:
I wonder if we could somehow go from a "discrete" layer-for-layer evaluation to one that's akin to various monte-carlo techniques which just sample from the continuum solution.
Not sure how you'd take care of or replace the discrete parameters of an NN in such a setting, but I'm kinda picturing the difference between "radiosity" and "path tracing" when it comes to rendering. In Path tracing, if done correctly, you can kinda directly and unbiasedly approximate the continuum limit of the distribution of light in a scene, and it's all built on stochastic processes.
You can even take care of "infinitely deep paths" correctly by stochastically cancelling paths at a *finite* depth through a Russian Roulette procedure, and you can combine many sampling procedures optimally through multiple importance sampling. More recently, that's even possible for a *continuum* of sampling methods in the form of *stochastic* importance sampling.
I'd imagine something similar could be used for *actually* training evaluating *"infinite"* (both in width and depth) NNs by simply evaluating them to some finite but task-dependent depth.
The main question to me is how to even set or store weights in such a setting in a finite amount of memory. I'm guessing you'd somehow have weights be defined through like a Gaussian mixture process or the like, but it's probably much easier said than done.

ต่อไป

เล่นอัตโนมัติ

Yuan Cao - Understanding Deep Learning Through Phenomena Discovery and Explanation

Yuan Cao - Understanding Deep Learning Through Phenomena Discovery and Explanation

Aditya Varre - On the spectral bias of two-layer linear networks

Aditya Varre - On the spectral bias of two-layer linear networks

SP25 CS 128H Lecture 2 - Rust Basics

SP25 CS 128H Lecture 2 - Rust Basics

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 1

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 1

Oren helps Durple escape Pinki in a way you wouldn't expect

Oren helps Durple escape Pinki in a way you wouldn't expect

Bloxfruits player after Dragon update🐲| Doge Gaming

Bloxfruits player after Dragon update🐲| Doge Gaming

How to treat Acne💉

How to treat Acne💉

Stochastic Depth for Neural Networks Explained

Stochastic Depth for Neural Networks Explained

AI Is Making You An Illiterate Programmer

AI Is Making You An Illiterate Programmer

Instability is All You Need: The Surprising Dynamics of Learning in Deep Models

Instability is All You Need: The Surprising Dynamics of Learning in Deep Models

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Micah Goldblum - Bridging the Gap between Deep Learning Theory and Practice

Micah Goldblum - Bridging the Gap between Deep Learning Theory and Practice

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Ilya Sutskever: "Sequence to sequence learning with neural networks: what a decade"

Automatic Neural Network Hyperparameter Tuning for TensorFlow Models using Keras Tuner in Python

Automatic Neural Network Hyperparameter Tuning for TensorFlow Models using Keras Tuner in Python

Nicolas Boulle - Elliptic PDE learning is provably data-efficient

Nicolas Boulle - Elliptic PDE learning is provably data-efficient

Neural Networks in the Limit of Infinite Depth-and-Width - Vector's Machine Learning Theory Workshop

Neural Networks in the Limit of Infinite Depth-and-Width - Vector's Machine Learning Theory Workshop

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

Uyurken Kendimi Kurtçukların Arasında Buldum🤯😬🪱

Uyurken Kendimi Kurtçukların Arasında Buldum🤯😬🪱

หลอกเพื่อนจับอึ #funny #แกล้ง #แกล้งเพื่อน #อึ #เพื่อนแกล้ง #ละคร

หลอกเพื่อนจับอึ #funny #แกล้ง #แกล้งเพื่อน #อึ #เพื่อนแกล้ง #ละคร

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

🔴LIVE กัมพูชา vs ติมอร์-เลสเต | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024 | รอบแรก กลุ่ม A

【พากย์ไทย】ฮ่องเต้เมาและหลับไปกับนางใน แต่นางในตั้งท้องมังกรทันที จึงได้รับการแต่งตั้งเป็นพระมเหสี

【พากย์ไทย】ฮ่องเต้เมาและหลับไปกับนางใน แต่นางในตั้งท้องมังกรทันที จึงได้รับการแต่งตั้งเป็นพระมเหสี

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 1

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 1

มายคราฟ แต่ ผมห้ามตาย..!!! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

มายคราฟ แต่ ผมห้ามตาย..!!! #minecraft #พี่เก้า #มายคราฟ #minecraftmtr

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!