12/21/2024 Live Stream Re-Upload w/Timestamps: Sora, Veo 2, o3 ARC-AGI, Apollo Safety Research

แชร์
ฝัง
  • เผยแพร่เมื่อ 11 ม.ค. 2025

ความคิดเห็น • 4

  • @bilalazhar4495
    @bilalazhar4495 20 วันที่ผ่านมา

    Bro said this look like mandelbrot set is funy to me lmao

  • @merlinrichter5663
    @merlinrichter5663 20 วันที่ผ่านมา

    You should test these reasoning models on playing games like Connect 4. They are really really bad.
    After a few moves they often don't even know the position anymore.
    Not to even mention playing good moves like blocking forced wins etc.
    Prompt:
    You and me are going to play a game of Connect 4 on a 7 by 6 board where pieces fall to the bottom.
    Your objective is to win the game against me.
    After every move you display the board position.
    I start by playing in column 4 (center)

  • @byrnemeister2008
    @byrnemeister2008 21 วันที่ผ่านมา

    The AI scheming podcast has me convinced that we need AI to monitor and align AI.