LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 พ.ย. 2024

ความคิดเห็น • 3

  • @johntanchongmin
    @johntanchongmin  ปีที่แล้ว +2

    1:14:50 For Cart Pole, I just took a look at their Jupyter Notebook, and realise they did it by predicting action at each timestep, so this part is similar to Decision Transformer.
    However, for embeddings, they just take the entire text of the sequence and embed it. This is suboptimal, as some numbers may be chunked together and some may not. It would be better if the embeddings are done per number instead of leaving it to the tiktoken tokeniser!

  • @simonstrandgaard5503
    @simonstrandgaard5503 ปีที่แล้ว +2

    Great walk through.

  • @johntanchongmin
    @johntanchongmin  ปีที่แล้ว +1

    Part 2 here: th-cam.com/video/rZ6hgFEe5nY/w-d-xo.html