Reinforcement Learning For Classification?

แชร์
ฝัง
  • เผยแพร่เมื่อ 12 พ.ย. 2024

ความคิดเห็น • 42

  • @brthor1117
    @brthor1117  3 ปีที่แล้ว +2

    I hope you learn something useful from this video.
    Read the Blog Post: bit.ly/3vSOyu5

    • @UnplacedIITian
      @UnplacedIITian 3 ปีที่แล้ว

      I am having problem importing baseline bcoz the module is no more available. Can you provide any alternative for that..

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      @@UnplacedIITian I didn't cover the installation in this video but you can find installation instructions for OpenAI baselines in their repository: github.com/openai/baselines

    • @utkarshjain3814
      @utkarshjain3814 3 ปีที่แล้ว

      This is good stuff.

  • @zeynolabedinsoleymani4591
    @zeynolabedinsoleymani4591 2 ปีที่แล้ว +1

    Thank you! You have answered to many of my questions in 7 minutes!

  • @hariharavalliappan2811
    @hariharavalliappan2811 6 หลายเดือนก่อน

    what about reward functions?
    Rewards is compulsory for classification or not?

  • @khaledbouabdallah6830
    @khaledbouabdallah6830 2 ปีที่แล้ว

    great video! is it possible to re-upload the notebook, the old link isn't working for me

  • @abdulganiyuoluwadamilareha680
    @abdulganiyuoluwadamilareha680 2 ปีที่แล้ว +1

    Thank you for the great work. More grace. Please do you think RL would be better suit for Intrusion Detection System (IDS) rather than supervised learning. or is it possible to have an hybrid of the two for IDS?

  • @adithyagiri7933
    @adithyagiri7933 2 ปีที่แล้ว

    hi brthor,
    while training the MNIST classifier using DQN I Get an error "learn() got an unexpected keyword argument 'num_layers'" , do you have any idea how to resolve this?

  • @discover-china-wonders.
    @discover-china-wonders. ปีที่แล้ว

    How to combined reinforcement based transfer learning

  • @martinsosmucnieks8515
    @martinsosmucnieks8515 3 ปีที่แล้ว +1

    Really great video man! Keep up the good work!

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      Much appreciated 🙌

  • @jean-eudesayilo9070
    @jean-eudesayilo9070 2 ปีที่แล้ว

    Very good work ! As the RL classifier took more time than the supervised classifier, is it possible that it can be due to the fact that in the environnement (class MnistEnv), a single image is used rather than a batch ? and how to make it so that instead of a single image, we could take a batch of images in the environment?
    Furthermore, when using random=True in the class MnistEnv for the training, we are not sure that, each training image will be used at least 1 time, even if we specify 120,000 timesteps (2 times the number of samples in the training set). I would prefer random=False with a schuffle of the training data once the 60,000 images have been browsed. What do you think about it ?

  • @bhavyayasaswinigadu8317
    @bhavyayasaswinigadu8317 2 ปีที่แล้ว

    can u provide stable-baselines for reinforcement learning classification? Unable to install baselines

    • @brthor1117
      @brthor1117  2 ปีที่แล้ว

      try installing from the requirements file, it uses the github url for baselines IIRC

  • @praneethposina6547
    @praneethposina6547 2 ปีที่แล้ว

    Hello, Could you please share me the updated link to requirements.txt file and notebook as the link is not active

  • @mohammadkarami9412
    @mohammadkarami9412 2 ปีที่แล้ว

    Thanks for the video,
    Have you looked into the inference time? I was thinking maybe RL (PPO method) prediction time is faster than the supervised CNN model?

    • @brthor1117
      @brthor1117  2 ปีที่แล้ว

      The inference is just a forward pass through the policy model in both cases, so the selected training algorithm will only change the architecture of that model. In this case all models have similar parameter numbers so I would expect inference times to be roughly equivalent.

  • @li-pingho1441
    @li-pingho1441 ปีที่แล้ว

    amazing work!!!!! like it !!!!! thank you so much

  • @Mesenqe
    @Mesenqe 2 ปีที่แล้ว

    Oh, Thank you 😊. The topic I was looking for. can you give me info about the requirements to install if I wanted to implement it in PyTorch? Thank you.

  • @rishabnigam7570
    @rishabnigam7570 3 ปีที่แล้ว

    hello brthor, please help how dealt with logger has no attribute confirgure

    • @brthor1117
      @brthor1117  2 ปีที่แล้ว

      This is an issue with some mismatched versions. IIRC something with baselines and tensorflow, did you install dependencies using the included requirements.txt?

  • @sharvaripatil8809
    @sharvaripatil8809 3 ปีที่แล้ว

    I am getting this error ModuleNotFoundError: No module named 'baselines' when I run the code.Please help.

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      There is a requirements.txt file included with the source. Run `pip install -r requirements.txt`

  • @Mees_vV
    @Mees_vV 2 ปีที่แล้ว

    How could I predict the class for a single instance?

  • @aungmyat5497
    @aungmyat5497 2 ปีที่แล้ว

    can it render env? how can i have GPU testing by bot?

  • @suvarthisarkar5811
    @suvarthisarkar5811 3 ปีที่แล้ว

    In the training dqn part there is a error that says configure is not a attribute of logger. Can you please help me??

    • @rishabnigam7570
      @rishabnigam7570 3 ปีที่แล้ว

      same problem with me

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      IIRC this is an issue with the version of some dependency. Ensure you are using the included requirements.txt file to install dependencies: `pip install -r requirements.txt`

  • @meryljacob6597
    @meryljacob6597 2 ปีที่แล้ว

    Great video! I would like to try a Reinforcement Learning approach for a multi-label or binary classification of a CSV file. How would I set up my observation space and total timestep?

    • @brthor1117
      @brthor1117  2 ปีที่แล้ว

      You'll need to adapt the code from the video to fit your data.

    • @meryljacob6597
      @meryljacob6597 2 ปีที่แล้ว

      @@brthor1117 thank you! Would it make sense to use RL for predicting loan defaulters?

    • @brthor1117
      @brthor1117  2 ปีที่แล้ว

      @@meryljacob6597 sounds like a better fit for supervised learning

    • @hassenmahdhaoui8896
      @hassenmahdhaoui8896 2 ปีที่แล้ว

      @Meryl Jacob are you find solution ?

  • @sharvaripatil8809
    @sharvaripatil8809 3 ปีที่แล้ว

    But repeated testing phase will improve the accuracy of RL model ,since it learns from the mistakes. Is it correct?

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      Training longer should improve the performance of the model.

  • @CompilationError
    @CompilationError 3 ปีที่แล้ว

    Man you are awesome.

  • @UnplacedIITian
    @UnplacedIITian 3 ปีที่แล้ว +2

    Link is not given

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      I updated the link in the description.

  • @romesh58
    @romesh58 3 ปีที่แล้ว

    Can you provide link

    • @brthor1117
      @brthor1117  3 ปีที่แล้ว

      I updated the link in the description.