Pydantic is all you need: Jason Liu

แชร์
ฝัง
  • เผยแพร่เมื่อ 31 ต.ค. 2023
  • Please return only json, do not add any other comments ONLY RETURN JSON OR I'LL TAKE A LIFE
    If this was you, then you've probably been pretty happy to see OpenAI function_call get released, I'm here to show you how you can get the most out of such powerful feature. Instead of writing prompts that turn strings into strings, we can write Pydantic objects and get Pydantic objects out of OpenAI.
    In this talk we explore some model driven development. Where we go step by step with some examples on how to represent your problem as simple code so we can model, generate diagrams, and write prompts as code to save time and model complex data correctly, allow us to use the same best practices rather than having to invent new ones for how we manage prompts.
    Recorded live in San Francisco at the AI Engineer Summit 2023. See the full schedule of talks at ai.engineer/summit/schedule & join us at the AI Engineer World's Fair in 2024! Get your tickets today at ai.engineer/worlds-fair
    About Jason Liu
    Previously stitch fix and Facebook. Currently consulting startups on production using llm systems.
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 102

  • @jxnlco
    @jxnlco 8 หลายเดือนก่อน +253

    Thanks for having me on! this was my first publish speaking thing in like.. .5 years

    • @aiDotEngineer
      @aiDotEngineer  8 หลายเดือนก่อน +9

      Our pleasure! You did excellent!

    • @philsheard832
      @philsheard832 7 หลายเดือนก่อน +8

      Really great talk. Credible on the code but effortlessly natural in delivery. Hope to hear more from you, do you have a blog or other outputs?

    • @shanegleeson5823
      @shanegleeson5823 7 หลายเดือนก่อน +1

      I'm having so much fun playing with some of your insights. I had already built some interesting tools when functions first came out and twitter was showing how pydantic was the key to making it really usable. But this is such a great progression. Thanks!

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน

      @@philsheard832yeah I have a personal blog and int instructor docs.

    • @samdroid37
      @samdroid37 7 หลายเดือนก่อน

      Really enjoyed this. Thanks Jason!

  • @parttimelarry
    @parttimelarry 8 หลายเดือนก่อน +36

    Wow, outstanding talk. I came in thinking a 15 minute talk on typing wouldn't be that interesting, but by the end I felt like this was the most well explained and immediately useful presentation on here.

    • @ccerrato147
      @ccerrato147 8 หลายเดือนก่อน

      Same here!

    • @jxnlco
      @jxnlco 8 หลายเดือนก่อน

      Thanks for the kind words would love any feedback on the docs or on the talk if you have any

    • @alexemilcar6525
      @alexemilcar6525 8 หลายเดือนก่อน

      @@jxnlcoawesome presentation. Where can I dive deeper into this for other models? I heavily use Pydantic today, but other models for work. You mention ask Marvin. What are the differences between that and Instructor?

  • @shanky1897
    @shanky1897 8 หลายเดือนก่อน +1

    Awesome talk jasonliu ......really learned something new today.Thank you for sharing this

  • @JosephRedwoodMartinez
    @JosephRedwoodMartinez 8 หลายเดือนก่อน +21

    I was at the conference in person -- this talk was a major highlight. Glad to see it again!

    • @jxnlco
      @jxnlco 8 หลายเดือนก่อน +1

      thanks for sharing!

  • @DKLHensen
    @DKLHensen 7 หลายเดือนก่อน +26

    Jason Lio is seriously next level, he brings so much in this video, can watch this 10 times.

    • @ultmusic
      @ultmusic หลายเดือนก่อน

      hopefully watching it the 11th time you can spell his name properly

  • @rstar899
    @rstar899 4 หลายเดือนก่อน

    Thank you Jason. Phenomenal work and effort + you & your team.

  • @DeruwynArchmage
    @DeruwynArchmage 7 หลายเดือนก่อน

    This awesome! Thank you for sharing this!

  • @friendlydroid42
    @friendlydroid42 7 หลายเดือนก่อน +2

    I used decorators over my calls to allow for feedback loops ! Pydantic is a must 😊

  • @kevon217
    @kevon217 7 หลายเดือนก่อน

    Great talk. Such a useful workflow, thanks!

  • @JonathanPlasse
    @JonathanPlasse 7 หลายเดือนก่อน

    Thank you for your presentation

  • @anthonyd2988
    @anthonyd2988 7 หลายเดือนก่อน +2

    Well this really helped me learn Pydantic🎉

  • @swannschilling474
    @swannschilling474 7 หลายเดือนก่อน +5

    This is something very different from all the other stuff out there!

  • @eugenio.cabral
    @eugenio.cabral 8 หลายเดือนก่อน

    Great talk and content!

  • @agarbanzo360
    @agarbanzo360 7 หลายเดือนก่อน

    Such a smooth talk!!

  • @RajabNatshah
    @RajabNatshah 7 หลายเดือนก่อน

    Thank you :)

  • @jakobkristensen2390
    @jakobkristensen2390 7 หลายเดือนก่อน

    Great talk!

  • @anujkhandelwal7523
    @anujkhandelwal7523 4 หลายเดือนก่อน

    Incredible talk!

  • @jairajsahgal7101
    @jairajsahgal7101 7 หลายเดือนก่อน

    Thank you

  • @sanyamjain4378
    @sanyamjain4378 7 หลายเดือนก่อน +4

    Validation with citation is very interesting. I did this using normal prompt engineering and hoped I’d get good results every time 😂

  • @messi_maeun_stew
    @messi_maeun_stew 7 หลายเดือนก่อน

    amazing job!

  • @tomlonghaymingway9396
    @tomlonghaymingway9396 8 หลายเดือนก่อน

    Nice one!

  • @ilianos
    @ilianos 5 หลายเดือนก่อน +5

    🎯 Key Takeaways for quick navigation:
    00:01 🎵 *Introduction and Scope*
    - Jason Liu introduces himself as a keynote speaker, providing an overview of his talk on type hints, Pydantic, and structured prompting.
    - Discusses the challenges of using language models in production, particularly when outputting JSON or structured data.
    01:19 🌐 *Introduction to Pydantic*
    - Introduces Pydantic as a library for data model validation, emphasizing its similarity to data classes and its reliance on type hints.
    - Highlights the benefits of using Pydantic, including better validation, cleaner code, and automatic generation of JSON schema.
    04:21 🧩 *Introduction to Structured Prompting with Pydantic*
    - Discusses the concept of structured prompting, where prompts are actual code represented by Pydantic objects.
    - Demonstrates how Pydantic enables defining objects with nested references, methods, and cleaner code for language model prompts.
    05:28 🔧 *Introduction to Instructor Library*
    - Introduces the "instructor" library, designed to simplify the usage of Pydantic for prompting language models, especially for OpenAI function calls.
    - Explains how "instructor" patches the completion API and facilitates type-safe and auto-complete features.
    06:24 🔄 *Advantages of Structured Prompting*
    - Explores the advantages of structured prompting, emphasizing the ability to define nested references, methods, and reusable components.
    - Discusses how this approach leads to more modular, maintainable, and bug-free code.
    07:47 🛡️ *Using Validators with Pydantic*
    - Demonstrates the use of validators with Pydantic, showcasing the ability to add custom validation functions.
    - Illustrates how language model validators can be integrated to catch and handle errors effectively.
    08:57 🌐 *Structured Prompting for Knowledge Workflows*
    - Explores how structured prompting can go beyond structured outputs, enabling the modeling of knowledge workflows and plans.
    - Discusses the potential for representing knowledge graphs and leveraging language models for more productive development.
    12:14 🔄 *Advanced Applications: Search Query Planning*
    - Demonstrates advanced applications, such as search query planning using structured prompting.
    - Shows how defining a data structure for search types and execution methods simplifies the process of querying multiple backends.
    14:34 📊 *Advanced Applications: Knowledge Graph Extraction*
    - Illustrates an advanced application focused on extracting knowledge graphs by closely modeling the data structure to the graph visualization API.
    - Emphasizes the simplicity achieved in code with the structured prompting approach.
    16:11 🔮 *Future Possibilities and Conclusion*
    - Discusses the future possibilities of structured outputs, including multimodal applications and generative UI over images, audio, and more.
    - Concludes with excitement about the evolving space of structured prompting and its potential in various domains.
    Made with HARPA AI

  • @pythagoran
    @pythagoran 8 หลายเดือนก่อน +13

    Engineers used to be expensive because they produced potentially infinite automation.
    Today, they're expensive because they consume potentially infinite automation.

    • @softdevstuff1008
      @softdevstuff1008 5 หลายเดือนก่อน

      engineers are expensive because not everyone wants to deal with things that abstract and complex. I mean, engineers are close to create a machine that is smarter than them, which can make them obsolete, but still did it. which other profession is so true to it?

  • @ristopaasivirta9770
    @ristopaasivirta9770 7 หลายเดือนก่อน +3

    Respond only with a valid json and nothing else. MY AND MY FAMILY'S LIFE DEPEND UPON THIS!
    Let us hope Roko's basilisk will look kindly upon us emotionally manipulating our poor LLMs

  • @iumair_mehmood
    @iumair_mehmood 3 หลายเดือนก่อน

    This is soo good, would love to have a link for repo for the reference

  • @fkxfkx
    @fkxfkx 8 หลายเดือนก่อน

    Hi: thanks for the talk. Where can we find out more about instructor? (16:33) Oops, there it is one second later.

  • @pz8087
    @pz8087 7 หลายเดือนก่อน

    TIL that you can copy text from youtube videos. very slick... and great talk Jason :)

  • @Adrian_Galilea
    @Adrian_Galilea 7 หลายเดือนก่อน

    This has been growing on me.

  • @ATH42069
    @ATH42069 8 หลายเดือนก่อน +2

    this is so good it hurts my feelings a little

  • @jcourson8
    @jcourson8 7 หลายเดือนก่อน +5

    Makes me interested in using Rust for LLM interactions. Whether the LLM return is validated or not you’re forced to handle every case and thus will make more robust systems by design.

  • @ReflectionOcean
    @ReflectionOcean 5 หลายเดือนก่อน

    - Consider using structured prompting for better LL model outputs (00:31)
    - Ensure LL models output JSON or structured data compatible with existing software (00:50)
    - Utilize OpenAI function calls for improved JSON schema validation (2:49)
    - Employ the Pantic library for data model validation and to generate JSON schema (3:53)
    - Implement instructor library to simplify OpenAI function calling with Pantic (5:16)
    - Use doc strings in Pantic models to improve prompt and data quality (6:47)
    - Create validators in Pantic models for data integrity and error handling (7:20)
    - Use LL models to output structured data for complex data processing (11:08)
    - Explore advanced applications of structured prompts for knowledge extraction (12:07)
    - Check out additional examples and documentation on structured prompting (16:29)

  • @shaunbowman8036
    @shaunbowman8036 2 หลายเดือนก่อน

    lol i JUST finished our pydantic based datamodels - it’s great, nice to see this, feels like `validation` lol

  • @jason13gaming
    @jason13gaming 7 หลายเดือนก่อน

    jason here

  • @nartrab1
    @nartrab1 8 หลายเดือนก่อน +1

    Now this is good. This is going to be the way to actually integrate llm models with apps that goes beyond chatbots more reliable

    • @jxnlco
      @jxnlco 8 หลายเดือนก่อน

      thank you! let me know if you have any thoughts, available in the github issues etc or on twitter

  • @nikilragav
    @nikilragav 8 หลายเดือนก่อน +2

    How come instructor requests don't need async await for calling the APIs?

    • @jxnlco
      @jxnlco 8 หลายเดือนก่อน

      cause it just uses `create` vs `acreate`

  • @udaynj
    @udaynj 7 หลายเดือนก่อน

    This just seems to be wrapping a prompt into a python class or function. How is this any more reliable/reliant that just straight prompts/json, since what goes to OpenAI API's is still a prompt. Not sure what I am missing here, but I don't see this being that much more useful.

  • @samuelschmidt6150
    @samuelschmidt6150 7 หลายเดือนก่อน

    How is this different from langchain?

  • @zmeta8
    @zmeta8 6 หลายเดือนก่อน

    I don't want to install the whole rust toolchain to install pydantic on riscv.

  • @alivecoding4995
    @alivecoding4995 7 หลายเดือนก่อน

    Who is the speaker, where is he on Twitter?

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน

      jxnlco !

  • @jesuslovesyoujohn314-21
    @jesuslovesyoujohn314-21 3 หลายเดือนก่อน

    John 3:16 For God so loved the world, that he gave his only begotten Son, that whosoever believeth in him should not perish, but have everlasting life.
    Isaiah 53:6 All we like sheep have gone astray; we have turned every one to his own way; and the LORD hath laid on him the iniquity of us all.
    Romans 4:5 But to him that worketh not, but believeth on him that justifieth the ungodly, his faith is counted for righteousness.
    1 Corinthians 15:3 For I delivered unto you first of all that which I also received, how that Christ died for our sins according to the scriptures;
    1 Corinthians 15:4 And that he was buried, and that he rose again the third day according to the scriptures:
    Ephesians 1:12 That we should be to the praise of his glory, who first trusted in Christ.
    Ephesians 1:13 In whom ye also trusted, after that ye heard the word of truth, the gospel of your salvation: in whom also after that ye believed, ye were sealed with that holy Spirit of promise,
    Ephesians 1:14 Which is the earnest of our inheritance until the redemption of the purchased possession, unto the praise of his glory.
    Ephesians 4:30 And grieve not the holy Spirit of God, whereby ye are sealed unto the day of redemption.

  • @Nurof3n_
    @Nurof3n_ 8 หลายเดือนก่อน

    I still don't understand how instructor works behind the scenes, for example if I want chapters from a youtube transcript how do I prompt it?

    • @jxnlco
      @jxnlco 8 หลายเดือนก่อน

      you might ask for a chapter with a title and body, and then ask for a list
      class Chapter:
      timestamp: int
      title: str
      summary: str
      class VideoChapters:
      chapters: List[Chapter]
      response_model=VideoChapters
      something. like this! i actually build youtubechapters dot app so there ya go

  • @aitools24
    @aitools24 7 หลายเดือนก่อน +9

    00:13 Pydantic is all you need: Jason Liu
    02:08 Pydantic is a library for data model validation.
    04:07 Pydantic is a trusted library for handling JSON schema and object definition in Python.
    06:06 Pydantic allows for cleaner code and easier maintenance by defining nested references and object behavior.
    07:54 Validation error handling in Instructor helps fix errors in language models.
    09:45 Pydantic allows for structured prompting and object-oriented programming.
    11:38 Language models can output data structures to traverse and process data more effectively.
    13:27 Pydantic enables easy creation and visualization of graph structures.
    15:27 The paraphrasing detection algorithms help identify quotes and provide more accurate answers.
    17:20 Pydantic enables extraction of bounding boxes and structured outputs.
    Crafted by Merlin AI.

    • @richcaputo2929
      @richcaputo2929 7 หลายเดือนก่อน +2

      this "all you need" trend in the AI community is out of hand lol

    • @aitools24
      @aitools24 7 หลายเดือนก่อน

      True haha @@richcaputo2929

  • @egonkirchof
    @egonkirchof 3 หลายเดือนก่อน

    70M downloads a month...how many developers exist in the world ?

  • @bartech101
    @bartech101 7 หลายเดือนก่อน +2

    -- "AI will take software engineer job"
    -- Software engineer: "Hold my beer."

  • @alivecoding4995
    @alivecoding4995 7 หลายเดือนก่อน

    I don't get it. What's new about this? Has been part of langchain for a while now.

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน +5

      well who do you think introduced this to langchain ;)

    • @alivecoding4995
      @alivecoding4995 7 หลายเดือนก่อน

      @@jxnlco I see & apologize 😀 This has been fantastic. Loved the idea straightaway.
      Are you on Twitter?

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน

      @@alivecoding4995 @jxnlco
      haha yeah i was the reviewer on the langchain function calling PRS when it came out. but we do validation differently.

  • @jamiekenber
    @jamiekenber 7 หลายเดือนก่อน +1

    This is great. I've been using openai to process survey data in begging mode "please respond with valid json"... try/except/loop
    Anyone know:
    1. If using pydantic, will we hit any issues with async calls?
    2. Can we achieve structured, error-handled responses using langchain instead? Is that the way to go?

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน +1

      we'll run into issues of using async validators, but async don't be an issue otherwise

  • @_guru
    @_guru 7 หลายเดือนก่อน

    so basically typescript....... ?

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน +1

      Models have behavior. Type hints do not.
      Base model has runtime type information. Typescript does not. You would need zod. Which lacks coloration of behavior.

  • @spokesperson_usa
    @spokesperson_usa 7 หลายเดือนก่อน

    And people say rustc is too complicated.

  • @atursams5501
    @atursams5501 7 หลายเดือนก่อน

    I wish that was more like a tutorial. It is really hard to follow.

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน +3

      I'll be making one coming out in a month or so!

  • @burningredphoenix3
    @burningredphoenix3 7 หลายเดือนก่อน

    if you want to integrate with legacy software, named pipes are your friends. thank me later

  • @sajidurrahman7903
    @sajidurrahman7903 6 หลายเดือนก่อน

    If I want so much of gibberish boilerplate with types i will use Java.

  • @bullpup1337
    @bullpup1337 7 หลายเดือนก่อน +2

    Or…. you could just use a statically typed language to begin with….

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน +5

      an llm is not statically typed.

    • @bullpup1337
      @bullpup1337 7 หลายเดือนก่อน

      @@jxnlco what I meant is have the LLM generate code in a typed language, and write the code that interprets the results in a typed language.

  • @andgate6204
    @andgate6204 7 หลายเดือนก่อน +1

    great talk, but please stop using pydantic!

    • @jxnlco
      @jxnlco 7 หลายเดือนก่อน +2

      what do you suggest?

    • @andgate6204
      @andgate6204 7 หลายเดือนก่อน

      ​@@jxnlco haskell \s
      To be honest, pydantic is so far ahead of other solutions like dataclasses and attrs that I have trouble arguing against them. But they come with their own typesystem and the authors have been known for causing drama in the python community.

    • @RatafakRatafak
      @RatafakRatafak 6 หลายเดือนก่อน

      @@jxnlco attrs

  • @mlcat
    @mlcat 7 หลายเดือนก่อน

    Whoever decided to make a fresh version of pydantic without backwards compatibility (breaking fastapi and everything else), you wasted so many human-hours, and made people remember what are virtual environments. 😡

  • @annaczgli2983
    @annaczgli2983 7 หลายเดือนก่อน +2

    Oh, this is on Python. Sorry, not interested.

    • @ShortSnoph
      @ShortSnoph 7 หลายเดือนก่อน +1

      🐍

  • @ymous7051
    @ymous7051 7 หลายเดือนก่อน

    LMAO, I made something better than this tbh! I don't even want to go on to showcase it to you all, but it's way better than this. dude, I never thought this was a problem to all, lolz. well, I'll be releasing it too then ig. thanks for making this video!

    • @xlagunaa
      @xlagunaa หลายเดือนก่อน

      How its called and where I can try it

    • @abhayshandle
      @abhayshandle หลายเดือนก่อน

      @@xlagunaa okay, so basically i made it a private repo, well this is the idea ig i didn't explain it well back then lol, well, this is how it works, we get a json response, and i convert the json response into a class like object so now instead of accessing data like: response["user"]["name"], you can do it like response.user.name and this just makes ig the typing easier and the code more readable, i mean that's my preference tbh, but then it didn't really work that well, if u liked the concept then ig i'd give u the link to it.

  • @gackerman99
    @gackerman99 7 หลายเดือนก่อน +2

    i hate everything about what llms are doing to us, and the fact that a bunch of hangers-on blockchain type bros are just making as much bullshit as possible on top of them