Q Learning Project

Deep Q-Networks Explained!

Q - Learning Algorithm - Reinforcement Machine learning- شرح عربي

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

Q Learning Algorithm شرح

د. محمد زيدان

มุมมอง 6 924

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 30 ม.ค. 2025

ความคิดเห็น • 17

@roozy_313a2 7 หลายเดือนก่อน ⁺¹
رحم الله والديك دكتور 🌺
@jawaheralbaddawi ปีที่แล้ว ⁺¹
what we do if start from room 0 ?
@shahd.h6430 ปีที่แล้ว ⁺¹
episode 1:
state= room 4, action-room =0
Q(4,0)=100+ 0.1*0 =100
episode 2:
state=3, action=4 Q(3,4)=0+0.1*100=10
episode 3:
state=1, action=3
Q(1,3)=0+0.1*10=1
So the path will be from room 1 to room 3 to room 4 to room 0 which is the goal
@youssef-ns9ny ปีที่แล้ว
Youssef mohamed
The quiz consists of three episodes, where the goal is to determine the values of certain Q-functions. In episode 1, the Q-function for state (4,0) is determined to be 100, which signals the completion of the episode. In episode 2, the Q-function for state (5,4) is found to be 10, with the Q-function for state (4,0) still being 100, and the episode is completed. Similarly, in episode 3, the Q-function for state (3,4) is found to be 10, with the Q-function for state (4,0) still being 100, and the episode is completed. The final path to reach the goal room 0 involves moving from room 1 to room 3 to room 4 before reaching the goal in room 0.
@gamaladel9308 ปีที่แล้ว ⁺¹
❤❤
@yosrmahmod5473 ปีที่แล้ว
Episode 1:
Initial state at room 1
Q(1,3)=0+0.1*0=0
Q(3,4)=0+0.1*100=10
Episode 2:
Initial state at room 4
Q(4,0)=100+0.1*0=100
Episode 3:
Initial state at room 5
Q(5,4)=0+0.1*100=10
Path 5 to 4 to 0
@nadaabdo7830 ปีที่แล้ว
Nada Abdelregal
The answer of the quiz is:
Episode 1:
State:room4 , action: room 0
Q(4,0)=100 which is the goal
Episode 2:
State:room5 , action: room 4
Q(5,4)=0+0.1*(100 ,0 ,0)=10
Episode 3:
State:room3 , action: room 4
Q(3,4)=0+0.1*(100 ,0 ,0)=10
So the path will be from room 1 to room 3 to room 4 to room 0 which is the goal
@AbdallahAli-n3r ปีที่แล้ว
Abdallah Ali Ahmed
In episode 1, the quiz begins with the initial state at room 1. The Q-function for state (1,3) is calculated to be 0, where the reward obtained from moving from state (1,3) to state (3,4) is 0, and the discount factor is 0.1. On the other hand, the Q-function for state (3,4) is determined to be 10, where the reward obtained from moving from state (3,4) to terminal state (4,0) is 100, and the discount factor is 0.1.
In episode 2, the quiz starts with the initial state at room 4. The Q-function for state (4,0) is calculated to be 100, where the reward obtained from moving from state (4,0) to terminal state (4,0) is 0, and the discount factor is 0.1.
In episode 3, the quiz begins with the initial state at room 5. The Q-function for state (5,4) is determined to be 10, where the reward obtained from moving from state (5,4) to terminal state (4,0) is 100, and the discount factor is 0.1.
To reach the goal at room 0, the final path involves moving from state (5,4) to state (3,4) to state (4,0).
final path from 5 to 4 to 0,
Thank you,
@AhmedAli-sz9vi ปีที่แล้ว
Ahmed Ali :
the answer of quiz :
episode 1:
Q(4,0)=100 (goal)
Then finish the episode
episode 2:
Q(5,4)=10
Q(4,0)=100 (goal)
Then finish the episode
episode 3:
Q(3,4)=10
Q(4,0)=100 (goal)
Then finish the episode
and the path in the end will be
from room 1 to room 3 to room 4 to the goal room 0
@MaiMohamed-d5j ปีที่แล้ว
Mai Mohamed
The answer of the quiz is:
Episode 1:
State:room4 , action: room 0
Q(4,0)=100 which is the goal
Episode 2:
State:room5 , action: room 4
Q(state, action)=R(state, action)+Gamma*max(Q(next state)
Q(5,4)=0+0.1*(100 ,0 ,0)=10
Episode 3:
State:room3 , action: room 4
Q(state, action)=R(state, action)+Gamma*max(Q(next state)
Q(3,4)=0+0.1*(100 ,0 ,0)=10
So the path will be from room 1 to room 3 to room 4 to room 0 which is the goal
@SaraAhmed-t1f ปีที่แล้ว
Episode 1
Q(4,0) = 100+0.1*0 =100
Episode 2
Q(3,4) = 0 + 0.1*100= 10
Episode 3
Q(1,3) = 0+0.1*10 = 1
So from room 1 to 3 to 4 to 0 and that's our goal
@manarmohamed2923 ปีที่แล้ว
Manar Mohamad
The Answer Of the Quiz is :
Episode 1
Random Stata room 3 action 4
Update Q Matrix
Q(3,4)=0+0.1*100=10
4 not a goal
Then choose state 4 action 0
Q(4,0)=100+0.1*0=100
0 is a goal - > Episode 1 Finished
--------------------------
Episode 2
Random State room 5 action 4
Update Q Matrix
Q(5,4)=0+0.1*100=10
4 not a goal
Then choose state 4 action 0
Q(4,0)=100+0.1*0=100
0 is a goal - > Episode 2 Finished
--------------------------
Episode 3
Random State room 4 action 0
Update Q Matrix
Q(4,0)=100+0.1*0=100
0 is a goal - > Episode 3 Finished
--------------------------
Start From Stare 5 by choosing Max value
Optimal path is 5 to 4 Then From 4 to 0
[5-4-0]
@mouradfakhfakh4823 10 หลายเดือนก่อน
Would you please share the python code. Thanks!
@mirnasaied-x6t ปีที่แล้ว
episode 1:
state= room 4 , action=room =0
Q(4,0)=100 + 0.1*0 =100
episode 2:
state=5 , action=1
Q(5,1)= 0 + 0.1*0 =0
episode 3:
state=3 , action=4
Q(3,4)= 0 + 0.1*100 =10
Then finish the episode
@MoatazMEwis ปีที่แล้ว
Here is the quiz, prof. Mohamed:
drive.google.com/file/d/12cYcErLIkQWl3pyGn-ya5N6hSftSJBTp/view?usp=drivesdk
@MK-cu1se 7 หลายเดือนก่อน
❤❤❤

ต่อไป

เล่นอัตโนมัติ

Q Learning Project

Q Learning Project

Deep Q-Networks Explained!

Deep Q-Networks Explained!

Q - Learning Algorithm - Reinforcement Machine learning- شرح عربي

Q - Learning Algorithm - Reinforcement Machine learning- شرح عربي

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ถ้าม้าโดนแกล้งที่โรงเรียน ม้าจะฟ้องครูว่าอะไร #แต้มเซน #การ์ตูน #tamzen #ตลก #shortvideo #การ์ตูน

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

ตรวจหวยงวดวันที่ 16 ธันวาคม 2567 พร้อมรางวัล N3 รางวัลพิเศษ รางวัล 2 ตัว : Matichon Online

LIVE🔴 : Cambodia vs Timor-Leste | ASEAN Championship 2024 | 17.12.24

LIVE🔴 : Cambodia vs Timor-Leste | ASEAN Championship 2024 | 17.12.24

Reinforcement Learning Series: Overview of Methods

Reinforcement Learning Series: Overview of Methods

What is Q-Learning (back to basics)

What is Q-Learning (back to basics)

Reinforcement Learning: Q-learning Algorithm in Python

Reinforcement Learning: Q-learning Algorithm in Python

Apprentissage par renforcement #5 : Introduction au Q-Learning

Apprentissage par renforcement #5 : Introduction au Q-Learning

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

A friendly introduction to deep reinforcement learning, Q-networks and policy gradients

Q-Learning: A Complete Example in Python

Q-Learning: A Complete Example in Python

Reinforcement Learning: Lecture 1 | Fundamental Concepts

Reinforcement Learning: Lecture 1 | Fundamental Concepts

Q-learning - Explained!

Q-learning - Explained!

شرح وحل مثال خوارزمية Bellman-Ford Algorithm

شرح وحل مثال خوارزمية Bellman-Ford Algorithm

ช่วยหนูด้วยคะ #shorts #แม่สุซูกัส

ช่วยหนูด้วยคะ #shorts #แม่สุซูกัส

Players vs Trophies 🤯

Players vs Trophies 🤯

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

ผู้หญิงแต่งงานกับขอทาน แต่กลับถูกดูหมิ่น ในที่สุดชายขเทานก็เผยตัวตย#ละครหวานๆ#ชอบ

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

ทัวร์สตรีมเมอร์ ROV รอบชิงชนะเลิศ | ชิงเงินรางวัลรวม 25,000 บาท

"ทักษิณ" ยึดปราจีนฯ ลูกน้องโกทรแปรพักตร์| DAILYNEWSTODAY 17/12/67

"ทักษิณ" ยึดปราจีนฯ ลูกน้องโกทรแปรพักตร์| DAILYNEWSTODAY 17/12/67

ใครขยับไม่ได้เป็น!!

ใครขยับไม่ได้เป็น!!

人是不能做到吗？#火影忍者 #家人 #佐助

人是不能做到吗？#火影忍者 #家人 #佐助

#เดอะตุ๊ก !! เจาะเดือด ทีมชาติ ผ่าฟอร์ม !! ทีมชาติไทย มันส์ เปิด สาเหตุ !! ระบบ+แท็คติก

#เดอะตุ๊ก !! เจาะเดือด ทีมชาติ ผ่าฟอร์ม !! ทีมชาติไทย มันส์ เปิด สาเหตุ !! ระบบ+แท็คติก