Scaling AI Reasoning: MCTS in ICL for Small LM

แชร์
ฝัง
  • เผยแพร่เมื่อ 30 ม.ค. 2025

ความคิดเห็น • 21

  • @code4AI
    @code4AI  หลายเดือนก่อน

    Please note, with the automatic dubbing from TH-cam /Google you hear a synthetic voice in your regional language. To hear my original voice in English, switch to "Default" or "English" in the settings. Thank you.

  • @s4kupfers
    @s4kupfers หลายเดือนก่อน +3

    I love these optimisations that are based on relatively simple logical principles and ideas that can be implemented inexpensively. Also your research with GPT is very valuable, I was able to successfully apply your way of asking GPT to my own problems.

  • @CharlotteLopez-n3i
    @CharlotteLopez-n3i หลายเดือนก่อน +4

    I'm intrigued by the idea of feeding abstract reasoning patterns into smaller language models. This could be a game-changer for efficiency and generalization. Excited to see where this research goes!

    • @Nnm26
      @Nnm26 หลายเดือนก่อน

      AI ahh account

  • @davidhurtado9922
    @davidhurtado9922 หลายเดือนก่อน

    i love your videos man! Usually think of the fish in American Dad is teaching me ai research

  • @GodbornNoven
    @GodbornNoven หลายเดือนก่อน +1

    This is a nice small breakthrough but i think we need something for training.
    I mean we kinda implemented "thinking" into these models, allowing them to refine their answers and take their time while doing a problem. We also implemented a form of plasticity into them "test time training" what's left is making a good spiking neural network. But how would we go about improving training?

  • @toxy805
    @toxy805 หลายเดือนก่อน +2

    Hey can you structure a playlist from beginner to advanced to learn agents and multi agents so on?
    Now it’s all over the place

  • @Yannick-ei2tz
    @Yannick-ei2tz หลายเดือนก่อน

    Very good. It is somehow related to a paper about embodied rag I m trying to implement. This more abstract approach to pb solving for my pet project of drone navigation could help

  • @xtema1624
    @xtema1624 หลายเดือนก่อน

    Plz share the chatgpt's result that u used in this youtube.

  • @drdca8263
    @drdca8263 หลายเดือนก่อน

    4:21 : and, do you think that this list is likely to be accurate?
    4:37 : hm. Which of these 6 do you think abductive reasoning falls under?
    Also, how does “relational reasoning” differ from “systemic reasoning”?
    5:50 : these 5 seem rather different. (Not sure why you didn’t start with these?)
    6:14 : some of these correspondences seem like a stretch…

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w หลายเดือนก่อน

    On surface, it looks like a lot of feature engineering in the training process.

  • @davidak_de
    @davidak_de หลายเดือนก่อน

    12:30 Wenn der Voice Mode plötzlich in einer anderen Sprache spricht 😅

    • @code4AI
      @code4AI  หลายเดือนก่อน

      Sorry, all automated by Google. I accidentally listened to my own video in "robotic German" and was shocked .... Just switch to "original" in settings or "English".

  • @luke.perkin.online
    @luke.perkin.online หลายเดือนก่อน

    The glaring omission is the usual critique of logic, the robustness to approximation and estimation at every step, due to the complexity, indeterminacy, polysemanticity of fuzzy things in the real world. It's fine if you're working with a purely abstract "if A then B", but the world is a lot more complicated, and the mappings are messy. Every reasoning step might be fine but the conclusion still wrong. It's why medicine is hard an QM doesn't work for complex systems.

    • @user-wr4yl7tx3w
      @user-wr4yl7tx3w หลายเดือนก่อน

      What is QM?

    • @drdca8263
      @drdca8263 หลายเดือนก่อน

      Why do you say that quantum mechanics doesn’t work for complex systems?
      Do you just mean that it becomes prohibitively difficult to do computations about?

    • @luke.perkin.online
      @luke.perkin.online หลายเดือนก่อน

      @@drdca8263 In a way it's both. Experimentally it's difficult to keep things entangled, and computationally it becomes intractable to simulate with all the degrees of freedom, got to make lots of simplifications.

    • @drdca8263
      @drdca8263 หลายเดือนก่อน

      @@luke.perkin.online ok, I wouldn’t say that “QM doesn’t work” for those systems for that reason. That seems like to strong of a statement to me .

    • @luke.perkin.online
      @luke.perkin.online หลายเดือนก่อน

      @@drdca8263 Just did a quick Google and they did a 120 atom carbon lattice quantum simulation in 2022. Probably a bit higher now. I'm not saying QM isn't amazing, the maths isn't incredible, and the applications like making huge wafers with nanoscale structure aren't amazing. It doesn't mean the maths isn't basically intractable as of 2024 for large systems.