Latent Space
Latent Space
  • 81
  • 205 637
Agents @ Work: Dust.tt — with Stanislas Polu
Stanislas Polu on working with Greg, Ilya and Sama at OpenAI, competing with LangChain, and why he's working on a horizontal agents platform when vertical agents are at peak hype
latent.space/p/dust
มุมมอง: 102

วีดีโอ

[Paper Club] Intro to Diffusion Models and OpenAI sCM: Simple, Stable, Scalable Consistency Models
มุมมอง 29621 ชั่วโมงที่ผ่านมา
RJ Honicky presents Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models, the newest OAI diffusion paper! arxiv.org/abs/2410.11081 slides docs.google.com/presentation/d/1G_JTTYlXqVtKr9kVYN0NtG2oAlxFLSUyJvdQhj5k1Ho/edit?usp=sharing
In the Arena: How LMSys changed LLM Benchmarking Forever
มุมมอง 435วันที่ผ่านมา
LMArena's leads on pioneering LLM evals with ChatBot Arena and MT-Bench, adjusting for human bias with Style Control, and replacing static benchmarks with dynamic evaluations. www.latent.space/p/lmarena 00:00:00 Introductions 00:01:16 Origin and development of Chatbot Arena 00:05:41 Static benchmarks vs. Arenas 00:09:03 Community building 00:13:32 Biases in human preference evaluation 00:18:27 ...
[Paper Club] Upcycling Large Language Models into Mixture of Experts
มุมมอง 350วันที่ผ่านมา
​this week Ethan He from NVIDIA will present his recent paper on MoE upcycling and Megatron-Core MoE! ​arxiv.org/abs/2410.07524 ​github.com/NVIDIA/Megatron-LM/tree/main/megatron/core/transformer/moe slides: docs.google.com/presentation/d/1EecxYnQbwrwHKTF9SP-yLHrnDF6G9fRbN0fxAkz7l8g/edit?usp=sharing
How NotebookLM Was Made
มุมมอง 1.7K14 วันที่ผ่านมา
Raiza Martin and Usama Bin Shafqat are the lead PM and AI engineer behind the NotebookLM feature flag that gave us the first viral AI voice experience, the “Deep Dive” podcast. We talked about history of the project, design decisions that went into it, and how to product manage effectively in AI. Full show notes: www.latent.space/p/notebooklm 00:00 Introductions 01:39 From Project Tailwind to N...
Singapore: the AI Engineer Nation - with Minister Josephine Teo
มุมมอง 1.7K21 วันที่ผ่านมา
www.latent.space/p/josephine-teo Singapore's GovTech is hosting an AI CTF challenge with ~$15,000 in prizes, starting October 26th. It will be hosted on Dreadnode's Crucible platform; signup here! 00:00:00 Introductions 00:00:34 Singapore's National AI Strategy 00:02:50 Ministry of Digital Development and Information 00:08:49 Defining a National AI Strategy 00:14:32 AI Safety and Governance 00:...
[Paper Club] SWE-Bench [OpenAI Verified/Multimodal] + MLE-Bench with Jesse Hu
มุมมอง 28521 วันที่ผ่านมา
Join our weekly paper clubs: lu.ma/ls SWE Bench swe-bench.github.io/ ​SWE Bench verified openai.com/index/introducing-swe-bench-verified/ ​MLE bench openai.com/index/mle-bench/
Building the Silicon Brain - Drew Houston of Dropbox
มุมมอง 90521 วันที่ผ่านมา
CEOs of publicly traded companies are often in the news talking about their new AI initiatives, but few of them have built anything with it. Drew Houston from Dropbox is different; he has spent over 400 hours coding with LLMs in the last year and is now refocusing his 2,500 employees around this new way of working, 17 years after founding the company. 00:00 Introductions 00:43 Drew's AI journey...
[Paper Club] Molmo + Pixmo + Whisper 3 Turbo - with Vibhu Sapra, Nathan Lambert, Amgadoz
มุมมอง 38928 วันที่ผ่านมา
​Thanks to Vibhu for volunteering Molmo Pixmo, and Amgadoz for Whisper 3 Turbo ​Molmo: molmo.allenai.org/blog ​Pixmo: www.arxiv.org/abs/2409.17146 ​Whisper 3 turbo (no paper) huggingface.co/openai/whisper-large-v3-turbo amgadhasan.substack.com/p/demystifying-openais-new-whisper ​ join us each Wednesday at 12pm PT! lu.ma/ls
Production AI Engineering starts with Evals
มุมมอง 1.9Kหลายเดือนก่อน
Production AI Engineering starts with Evals
[Paper Club] Berkeley Function Calling Paper Club! - Sam Julien, Writer
มุมมอง 380หลายเดือนก่อน
[Paper Club] Berkeley Function Calling Paper Club! - Sam Julien, Writer
Building AGI in Real Time (OpenAI Dev Day 2024)
มุมมอง 4.2Kหลายเดือนก่อน
Building AGI in Real Time (OpenAI Dev Day 2024)
[Paper Club] Who Validates the Validators? Aligning LLM-Judges with Humans (w/ Eugene Yan)
มุมมอง 441หลายเดือนก่อน
[Paper Club] Who Validates the Validators? Aligning LLM-Judges with Humans (w/ Eugene Yan)
Language Agents: From Reasoning to Acting - with Shunyu Yao of OpenAI, Harrison Chase of LangGraph
มุมมอง 2.8Kหลายเดือนก่อน
Language Agents: From Reasoning to Acting - with Shunyu Yao of OpenAI, Harrison Chase of LangGraph
llm.c's Origin and the Future of LLM Compilers - Andrej Karpathy at CUDA MODE
มุมมอง 30Kหลายเดือนก่อน
llm.c's Origin and the Future of LLM Compilers - Andrej Karpathy at CUDA MODE
The Ultimate Guide to Prompting - with Sander Schulhoff from LearnPrompting.org
มุมมอง 1.7Kหลายเดือนก่อน
The Ultimate Guide to Prompting - with Sander Schulhoff from LearnPrompting.org
[Paper Club] Writing in the Margins: Chunked Prefill KV Caching for Long Context Retrieval
มุมมอง 287หลายเดือนก่อน
[Paper Club] Writing in the Margins: Chunked Prefill KV Caching for Long Context Retrieval
[Paper Club] 🍓 On Reasoning: Q-STaR and Friends!
มุมมอง 1Kหลายเดือนก่อน
[Paper Club] 🍓 On Reasoning: Q-STaR and Friends!
Building AGI with OpenAI's Structured Outputs API
มุมมอง 1.9Kหลายเดือนก่อน
Building AGI with OpenAI's Structured Outputs API
Personal benchmarks vs HumanEval - with Nicholas Carlini of DeepMind
มุมมอง 7522 หลายเดือนก่อน
Personal benchmarks vs HumanEval - with Nicholas Carlini of DeepMind
Is finetuning GPT4o worth it?
มุมมอง 1.9K2 หลายเดือนก่อน
Is finetuning GPT4o worth it?
Answer.ai & AI Magic with Jeremy Howard
มุมมอง 2.2K2 หลายเดือนก่อน
Answer.ai & AI Magic with Jeremy Howard
Segment Anything 2: Memory + Vision = Object Permanence - with Nikhila Ravi and Joseph Nelson
มุมมอง 2.5K3 หลายเดือนก่อน
Segment Anything 2: Memory Vision = Object Permanence - with Nikhila Ravi and Joseph Nelson
The Winds of AI Winter (Q2 Four Wars of the AI Stack Recap)
มุมมอง 1.3K3 หลายเดือนก่อน
The Winds of AI Winter (Q2 Four Wars of the AI Stack Recap)
[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models
มุมมอง 9463 หลายเดือนก่อน
[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models
Training Llama 2, 3 & 4: The Path to Open Source AGI - with Thomas Scialom of Meta AI
มุมมอง 2.4K3 หลายเดือนก่อน
Training Llama 2, 3 & 4: The Path to Open Source AGI - with Thomas Scialom of Meta AI
The 10,000x Yolo Researcher Metagame - with Yi Tay of Reka
มุมมอง 2.8K4 หลายเดือนก่อน
The 10,000x Yolo Researcher Metagame - with Yi Tay of Reka
State of the Art: Training 70B LLMs on 10,000 H100 clusters
มุมมอง 1.3K4 หลายเดือนก่อน
State of the Art: Training 70B LLMs on 10,000 H100 clusters
How To Hire AI Engineers (ft. James Brady and Adam Wiggins of Elicit)
มุมมอง 8484 หลายเดือนก่อน
How To Hire AI Engineers (ft. James Brady and Adam Wiggins of Elicit)
How AI is Eating Finance - with Mike Conover of Brightwave
มุมมอง 1.5K5 หลายเดือนก่อน
How AI is Eating Finance - with Mike Conover of Brightwave

ความคิดเห็น

  • @coreyhayes9428
    @coreyhayes9428 4 ชั่วโมงที่ผ่านมา

    Great work

  • @TheAIEpiphany
    @TheAIEpiphany 7 วันที่ผ่านมา

    great vid! I love the fact he still codes that much minor feedback: reduce the amount of frame changes, the content is already fun as it is (when you have a great guest), no need for the production to go wild to keep viewers's attention (i think, at least for me)

  • @mriz
    @mriz 7 วันที่ผ่านมา

    her tweet is code for infra team 😅😅

  • @WenRolland
    @WenRolland 10 วันที่ผ่านมา

    Great talk! Google really have great creative people working for them.

  • @brandonheaton6197
    @brandonheaton6197 11 วันที่ผ่านมา

    Steven Johnson's description of using DevonThink from Where Good Ideas Come From surely inspired a chunk of the design

  • @TechCindy
    @TechCindy 12 วันที่ผ่านมา

    Nice work & explanation.

  • @ethanhe42
    @ethanhe42 12 วันที่ผ่านมา

    thanks for having me!

  • @Fun-with-AI.s
    @Fun-with-AI.s 14 วันที่ผ่านมา

    Thank you so much!! I never understood "The Turnover Village" until NotebookLM explained it to me.

  • @bananasmileclub5528
    @bananasmileclub5528 15 วันที่ผ่านมา

    i love that guy! Pls have him again on the platform. The depth of his understanding from low level to ML application including the financial side, is very impessive.

  • @areeshzaharavlog570
    @areeshzaharavlog570 16 วันที่ผ่านมา

    MashAllha

  • @RomonaFoster
    @RomonaFoster 16 วันที่ผ่านมา

    Good stuff! 🤓 Found you on X.

  • @zaheerabbas630
    @zaheerabbas630 16 วันที่ผ่านมา

    Love you Dear

  • @mriz
    @mriz 17 วันที่ผ่านมา

    f love how NotebookLM podcast begin with Deep Dive opening 😆

  • @absbox_
    @absbox_ 17 วันที่ผ่านมา

    Ha ha brilliant 😀

  • @akbeastvijayfan
    @akbeastvijayfan 19 วันที่ผ่านมา

    This is really helpful for me. Thank you.

  • @สหรัตศรีวิเศษ-ฃ2ฉ
    @สหรัตศรีวิเศษ-ฃ2ฉ 22 วันที่ผ่านมา

    🎉🎉😮

  • @lakkakka
    @lakkakka 22 วันที่ผ่านมา

    Lets not. This guy is approaching it to be lazy. To not have to do "unimportant" stuff.

  • @LatentSpaceTV
    @LatentSpaceTV 23 วันที่ผ่านมา

    Full writeup: www.latent.space/p/josephine-teo

  • @ziurnauj
    @ziurnauj 23 วันที่ผ่านมา

    Insight dense podcast, relistening and taking notes

  • @therobotocracy
    @therobotocracy 23 วันที่ผ่านมา

    Wow, talented dude! Great interview!

  • @ai_is_a_great_place
    @ai_is_a_great_place 23 วันที่ผ่านมา

    No echo on Dylan which sounded great but unfortunately the other audio was kinda echoing but amazing podcast in all other aspects 👏 👏 👏 👏 👏

    • @LatentSpaceTV
      @LatentSpaceTV 21 วันที่ผ่านมา

      We put this together in a couple hours and our studio wasn't ready, thanks for putting up with it :)

    • @ai_is_a_great_place
      @ai_is_a_great_place 21 วันที่ผ่านมา

      @@LatentSpaceTV ah no worries - still a stellar podcast and I look forward to future ones!

  • @ashutoshpadhi2782
    @ashutoshpadhi2782 24 วันที่ผ่านมา

    seems like Drew is deep into music.

    • @LatentSpaceTV
      @LatentSpaceTV 24 วันที่ผ่านมา

      He played some drums before we started!

    • @ashutoshpadhi2782
      @ashutoshpadhi2782 23 วันที่ผ่านมา

      @@LatentSpaceTV Did you ask him how the heck is he juggling so many stuffs i.e. music, CEOing, actively coding, etc ?

  • @anpan6282
    @anpan6282 28 วันที่ผ่านมา

    That looks like a Sunny Hills Pineapple cake

  • @LatentSpaceTV
    @LatentSpaceTV หลายเดือนก่อน

    sorry we screwed up the video recording! see the full writeup as well! www.latent.space/p/braintrust

  • @LatentSpaceTV
    @LatentSpaceTV หลายเดือนก่อน

    see also Amgad's writeup on Whisper amgadhasan.substack.com/p/demystifying-openais-new-whisper

  • @claudioagmfilho
    @claudioagmfilho หลายเดือนก่อน

    🇧🇷🇧🇷🇧🇷🇧🇷👏🏻, I'm eagerly awaiting the release of real-time video for Plus users from OpenAI, as it was originally mentioned as part of the ChatGPT Omni update, which sadly never reached us. This feature will be revolutionary, enabling us to tackle a wide range of daily tasks more efficiently. Real-time video integration within ChatGPT would greatly enhance productivity by allowing for interactive, dynamic assistance and more streamlined workflows. It would be especially useful for tasks like desktop sharing-being able to visually assist and collaborate on real-time activities is just phenomenal. I hope this feature rolls out soon, as it could drastically improve how we approach everyday challenges.

  • @CurtCox
    @CurtCox หลายเดือนก่อน

    Asking Claude: Read the provided transcript. Provide lists for each of the following that are mentioned: APIs Tools Techniques People Organizations Problem Domains ----------------------- Here are the lists based on the provided transcript: APIs: 1. Real-time API 2. WebSocket API 3. Chat Completions API 4. Assistants API 5. Function calling API 6. Structured output API Tools: 1. ChatGPT 2. GPT-4 3. GPT-3.5 4. O1 (OpenAI's new model) 5. Cursor (coding tool) 6. Genie (coding AI) 7. Cognition (AI tool) 8. Harvey (AI tool) 9. Case Text (AI tool) 10. Notebook LM (Google's tool) 11. Claude (Anthropic's AI) 12. Gemini (Google's AI) 13. Waymo (self-driving car technology) 14. Code Interpreter Techniques: 1. Fine-tuning 2. Distillation 3. Prompt caching 4. Vision fine-tuning 5. Model evaluation 6. Iterative deployment 7. Chain of thought reasoning 8. Function calling 9. Structured outputs 10. System prompts People: 1. Sam Altman 2. Kevin Weil 3. Roman Hewitt 4. Michelle Pokras 5. Simon Willison 6. Alistair Pullen 7. Olivier Godemont 8. Ilya Sutskever (implied) 9. George Hotz 10. Jon Frankle 11. Chris Lattner 12. Jeremy Howard Organizations: 1. OpenAI 2. Anthropic 3. Google 4. Microsoft 5. Cosine 6. Cognition 7. Harvey 8. Case Text 9. Twilio 10. LifeKit 11. Agora 12. Waymo 13. Twitter (former) 14. Coinbase (former) 15. Y Combinator Problem Domains: 1. Natural language processing 2. Computer vision 3. Speech recognition and synthesis 4. Code generation 5. Language translation 6. Autonomous systems 5. Artificial General Intelligence (AGI) 6. AI safety and alignment 7. AI ethics and responsible deployment 8. Developer tools and platforms 9. AI-assisted software engineering 10. Real-time AI interactions 11. Multimodal AI (text, voice, vision) 12. AI agents and automation 13. AI in government and public services 14. AI for scientific discovery 15. AI user interfaces and experiences

  • @elliptictree
    @elliptictree หลายเดือนก่อน

    Interesting.

  • @LatentSpaceTV
    @LatentSpaceTV หลายเดือนก่อน

    high quality video is now up here x.com/marksaroufim/status/1841277387834830876

  • @dawid_dahl
    @dawid_dahl หลายเดือนก่อน

    Is the host the guy in Silicon Valley who played the supermarket employee who quit his job to build an app which could find a parked car?

  • @objectobjectobject4707
    @objectobjectobject4707 หลายเดือนก่อน

    thanks for recording and sharing

  • @markivy202
    @markivy202 หลายเดือนก่อน

    Simple answer is Yes, the problem with most of the major contributors is they're trying to build without knowing or understanding what the best end would be ( building thinking about the current without the end in mind). Feeding a machine all of the data makes it knowledgeable, but it will always lack true intelligence until it realizes that there are hundreds if not thousands of ways to solve a problem and it's able to narrow down the best course of action to get the best result.

  • @GNARGNARHEAD
    @GNARGNARHEAD หลายเดือนก่อน

    AYYO OP really published an AI paper with the term reverse-harem in it.. *FISTBUMPS*

  • @tanli1204
    @tanli1204 หลายเดือนก่อน

    What's the link to the paper mentioned in this talk (timestamp: 23:45)? Great content!

    • @LatentSpaceTV
      @LatentSpaceTV หลายเดือนก่อน

      Full show notes are always on our Substack: www.latent.space/p/learn-prompting

  • @MannyBernabe
    @MannyBernabe หลายเดือนก่อน

    Great pod. Thx!

  • @fintech1378
    @fintech1378 หลายเดือนก่อน

    harrison looks a bit sad

  • @Yume-x9v
    @Yume-x9v หลายเดือนก่อน

    3D Variety analysis point of point the point.

  • @Yume-x9v
    @Yume-x9v หลายเดือนก่อน

    3D Python Pylogik. Point of photo Python.

  • @snarkyboojum
    @snarkyboojum หลายเดือนก่อน

    Cool to hear from a systems engineer, but agree with other commenters - she’s not working on AI or AGI ;)

  • @TheAIEpiphany
    @TheAIEpiphany หลายเดือนก่อน

    13:00 + 20:15 -> I guess we're the avengers-pandas now :))

  • @snarkyboojum
    @snarkyboojum หลายเดือนก่อน

    Intelligence creates knowledge and improves knowledge over time.

  • @LatentSpaceTV
    @LatentSpaceTV หลายเดือนก่อน

    like and subscribe! www.latent.space/p/shunyu

  • @gitmaxd
    @gitmaxd หลายเดือนก่อน

    So good! Love these reflective style presentations.

  • @KevinKreger
    @KevinKreger หลายเดือนก่อน

    Great reviews BTW, Synthetic data can be llm generated or by traditional programming.

  • @skierpage
    @skierpage หลายเดือนก่อน

    16:05 "I deleted all the streams, made everything single-threaded because we ended up getting all kinds of weird race conditions and errors and so on and I just didn't want to deal with it." Andrej,I hate to be that person, but Rewrite it in Rust! 😊

  • @KevinKreger
    @KevinKreger หลายเดือนก่อน

    First. Great interview. BTW I get leaks from my exemplars..too influential. I ask Claude for his hidden reasoning trace and he helps. My experience, the best reasoning trace varies by problem because it follows that domain's problem solving process.

  • @Japneets1
    @Japneets1 หลายเดือนก่อน

    Thanks for recording this!

  • @MultiMediaUploads
    @MultiMediaUploads หลายเดือนก่อน

    Wow!

  • @devon9374
    @devon9374 หลายเดือนก่อน

    Andrej blew my mind there at the end: "Python and PyTorch and everything else is just a crutch because we humans are finite with finite knowledge intelligence and attention. But actually, don't you want to write all code in CUSTOM CUDA kernels?" 🤯 CUDA MODE! Let's make the GPUs goo BRRRRRRRRR!

  • @KevinKreger
    @KevinKreger หลายเดือนก่อน