Training Your Own AI Model Is Not As Hard As You (Probably) Think

แชร์
ฝัง
  • เผยแพร่เมื่อ 21 พ.ย. 2023
  • Read about this in more detail in my latest blog post: www.builder.io/blog/train-ai
    #ai #developer #javascript #react

ความคิดเห็น • 191

  • @Steve8708
    @Steve8708  5 หลายเดือนก่อน +29

    Read more in my latest blog post: www.builder.io/blog/train-ai

    • @balachandar3360
      @balachandar3360 4 หลายเดือนก่อน

      Hi Steve I am building a project on digitizing catalog for a hackathon in your recent video you told you trained a model on detecting images from webpages. if you can open source that it will mean a lot because we are having very less time to complete the project.

    • @saadshajy3849
      @saadshajy3849 หลายเดือนก่อน

      We created a number of GPT powered personal AI assistants for businesess like law, media industry and pretty much for corporate companies where they can integrate internal corporate resources and data securely to retrieve useful information and save 70% cost

    • @Osirisdigitalagency
      @Osirisdigitalagency หลายเดือนก่อน

      ​@@saadshajy3849 thats nice, could you explain the steps that was taken to train these personal models

    • @tribulusfox3816
      @tribulusfox3816 28 วันที่ผ่านมา

      Hi Steve,
      How can I reach you?

  • @JakeLuden
    @JakeLuden 5 หลายเดือนก่อน +118

    This channel just feels like it’s on another level than the rest dude, in every way. The way you turn intimidating concepts into very easy to follow videos is invaluable. Quickly becoming my absolute favorite dev channel.

  • @matiss.
    @matiss. 5 หลายเดือนก่อน +1

    Thank you for putting out these type of videos. Each video has covered advanced topic in a simple manner and made me curious about building something simular!

  • @PotatoMagnet
    @PotatoMagnet 5 หลายเดือนก่อน +16

    Dude, you’re soothing to listen to and it’s nice to hear your chain of thought for approaching a problem. Subbed!

    • @PlsDontSuuue
      @PlsDontSuuue 25 วันที่ผ่านมา +1

      for real, the moment i open other videos and hear an indian accent its an instant nope for me

  • @user-ix8kx8lq3e
    @user-ix8kx8lq3e 5 หลายเดือนก่อน +5

    I was getting stuck on training a chatbot agent using off the shelves LLMs. This video came out at the right time and saved me at least a few days of monkering around. I really like the thought process breakdown instead of just showing jumbo codes like a tutorial channel. You got yourself a sub!

  • @amoledon
    @amoledon 5 หลายเดือนก่อน +17

    This channel is combination of business, technology and sales training anyone needs to build a start up. Kudos to Steve.

  • @pedroaquino5368
    @pedroaquino5368 5 หลายเดือนก่อน +4

    Simple video, complex concept explained beautifully. Thanks Steve!

  • @elliottchong
    @elliottchong 5 หลายเดือนก่อน +5

    This is awesome dude! I watched your last video about developing proper AI apps. Your content on AI is top notch, you have opened my eyes to a whole new world of AI development. Much appreciation, keep up with the good job! Thank you :)

  • @dynastyduck3799
    @dynastyduck3799 14 วันที่ผ่านมา

    So quick and clean, thank you for taking the time to do this; I was trying to understand "Training", and now I know it is a button

  • @Sindoku
    @Sindoku 5 หลายเดือนก่อน +35

    Man Steve, you are QUICKLY becoming my new favorite person to learn advanced coding techniques from. Please keep up the amazing work!

    • @saadshajy3849
      @saadshajy3849 หลายเดือนก่อน

      We created a number of GPT powered personal AI assistants for businesess like law, media industry and pretty much for corporate companies where they can integrate internal corporate resources and data securely to retrieve useful information and save 70% cost

  • @yarapolana
    @yarapolana 5 หลายเดือนก่อน +2

    First time seeing a video by you! Instant follow, really well broken down.

  • @lauri2806
    @lauri2806 4 หลายเดือนก่อน +1

    I appreciate this a ton. Since seeing a nice, unique project a guy had with regarding AI, I only thought about applying multiple AI's together trying to get what I need. Now I have some new insight on what I just might be able to do by myself, with my own skills.

  • @ericschrdr
    @ericschrdr 13 วันที่ผ่านมา

    Thank you for simplifying and proposing a best solution - great stuff! Subscribed 🤘🤘🤘

  • @weiguo9887
    @weiguo9887 5 หลายเดือนก่อน +1

    Exactly what I’d like to learn! Great content! Subbed

  • @pollywops9242
    @pollywops9242 2 หลายเดือนก่อน +2

    Amazing I was looking for exactly this info I was super overwhelmed

  • @franpalokaj
    @franpalokaj 3 หลายเดือนก่อน

    Great stuff Steve, thanks a lot!

  • @divine-favouranigbogu413
    @divine-favouranigbogu413 5 หลายเดือนก่อน

    Thank you for this Steve, you helped me rethink my initial solutiion to a problem I planned to use AI to solve.

  • @noduslabs
    @noduslabs 4 หลายเดือนก่อน

    Your tutorials are great. I also make trainings on a text analysis tool I created so I know when I see good stuff! Thank you! 💪🏼🙏🏼

  • @TshDev
    @TshDev 5 หลายเดือนก่อน

    Amazing video, can’t express enough the amount of value it has. Thank you🙇‍♂️😊

  • @samuelparry5736
    @samuelparry5736 5 หลายเดือนก่อน +19

    I'm interested in the layout hierarchy approach too. What type of model did you use for the layout hierarchy? Do you give it an image and get it to return some HTML / JSON Structure? Or do you transform the existing Figma nodes into some text representation? How did you generate the training data for this step?
    My initial thoughts would be to provide the model an image (the same screenshot used for the object identification step), and get it to return HTML. Then for the training data, when you take the screenshots of the webpage, you also save off the HTML and provide that as training data for the hierarchy model (maybe after some processing).
    If you can share, I'd love to learn more!

  • @pixelperfectpravin
    @pixelperfectpravin 5 หลายเดือนก่อน +1

    I appreciate your content and your efforts

  • @stevenanderson7046
    @stevenanderson7046 5 หลายเดือนก่อน

    Sick dude, agree 100% thanks for the video!

  • @shakapaker
    @shakapaker 5 หลายเดือนก่อน

    I'm so grateful for that type of content

  • @impossibletechie8059
    @impossibletechie8059 5 หลายเดือนก่อน +17

    Would be happy to pay for a complete course on this explain each step in detail

  • @dawid_dahl
    @dawid_dahl 5 หลายเดือนก่อน

    Thanks a lot for posting this. Super interesting and valuable.

  • @Thosch_io
    @Thosch_io 5 หลายเดือนก่อน +13

    Very interesting! The area of opportunity within the AI sector now is niche AI's that does one specific task very well as it is impossible to compete with general models as OpenAI, Anthropic, Google and other large companies has almost infinite money and resources. This means that quality of the data is as you say essential and one of the best ways to build a moat against competitors. I will definitely keep in mind that I should just write normal code to solve as much as possible before creating my own model. Do you use any open source models or have you made one from scratch? Been fiddling with fine-tuning LLMs such as llama-2 7B myself and I am now considering trying to find a model I can train instead with my own data.

  • @youngmediageeks
    @youngmediageeks 5 หลายเดือนก่อน

    Well done, thank you!

  • @adicandra9940
    @adicandra9940 4 หลายเดือนก่อน

    Everything you build is very inspirational, Steve.

  • @samuelparry5736
    @samuelparry5736 5 หลายเดือนก่อน +22

    I see Steve post.... I click...

  • @LukeOtwell
    @LukeOtwell หลายเดือนก่อน

    Love the bullet points, and break down of what each task implemented. I would also add that to get things up and running quicker, see if there is an API for certain steps in your workflow i.e. dynamic scraping or object detection. Eden AI is expensive and slow, but if you haven't nailed down the workflow, it can help you try tools and compare services. Great vid!

  • @RakeshMakhija
    @RakeshMakhija 5 หลายเดือนก่อน

    Super Helpful. Thank you very much

  • @oszi7058
    @oszi7058 5 หลายเดือนก่อน

    Amazing work thank you very much

  • @samsunforev1846
    @samsunforev1846 หลายเดือนก่อน

    I was copycatting others project using Openai wrapper, and it didn't feel right to me, but I didn't know what to do instead. Until I came across your channel and now I have a whole new perspective on creating an SaaS AI product. Thanks Steve!

  • @MARTIN-101
    @MARTIN-101 หลายเดือนก่อน

    world class. top tier video. spot on.

  • @mercedg
    @mercedg 3 หลายเดือนก่อน +1

    I really liked this video, I’m commenting to boost algo and hopefully get more vids like this

  • @gabrielcebria1887
    @gabrielcebria1887 5 หลายเดือนก่อน +1

    Thanks, really cool video!! If you don't mind me asking, what sw are you using for the blackboard presentation? sleek!!

  • @MrThryler444
    @MrThryler444 5 หลายเดือนก่อน +3

    Between turning to basic code and utilizing an LLM (a large "super" model) for generating customized code, I'd suggest storing both the input and output. Over time, you can train your own smaller LLM with fewer parameters, perhaps 7 or 13 billion, specifically tailored for the language/framework. By collecting user feedback through edits to the code and storing this feedback, you can achieve better results than relying on an expensive "super" model like GPT-4.

  • @user-vr1hd8of5s
    @user-vr1hd8of5s หลายเดือนก่อน

    Thanks a lot for this great advice! Cool to see, that the good old "divide and conquer" still is true in the brave new world of AI 😊

  • @jonan.gueorguiev
    @jonan.gueorguiev 5 หลายเดือนก่อน

    Thanks for sharing that! Actually even more interesting is the other specialized model - layout generating one. Can you share something about it as well? 🙂

  • @thalesa4715
    @thalesa4715 3 หลายเดือนก่อน

    golben boy, golden content!

  • @Emmanmanis
    @Emmanmanis 2 หลายเดือนก่อน

    My friend this was a very clear and easy to comprehend video! I know nothing about coding and would like to learn more. Hypothetically if I wanted to develop a AI app. What route would you lay out based on your experience?

  • @alganpokemon905
    @alganpokemon905 5 หลายเดือนก่อน

    this is SICK!!

  • @Sylarleft
    @Sylarleft หลายเดือนก่อน

    We're building an AI solution at my company. And this is the same conclusion we arrived at!
    Glad to see another company illustrating the same approach cu

  • @felipeblin8616
    @felipeblin8616 3 หลายเดือนก่อน

    Great! Thank for the video. One question. How do you train a help desk with this?

  • @keithchen5467
    @keithchen5467 2 หลายเดือนก่อน

    Amazing insights. Can you share more on steps 3 and 4?

  • @JustCode39
    @JustCode39 5 หลายเดือนก่อน

    this was awesome

  • @geepytee
    @geepytee 5 หลายเดือนก่อน

    Very cool Steve!

  • @JetSoftProHQ
    @JetSoftProHQ 5 หลายเดือนก่อน

    Thanks that showed that training your own AI model is simpler than we might think. We, at JetSoftPro, a software development service, also think that custom AI development can achieve faster, cheaper, and better results than using off-the-shelf large models like OpenAI's GPT-3 or GPT-4.

  • @Focus11493
    @Focus11493 หลายเดือนก่อน

    not sure its possible cause i dont know much about Figma, but thinking about Figma mirror with the ability to mirror the design in the browser, id imagine you can take the generated html via inspection tools and feed that to the model instead which would kill the image detection model training step and really what you end up with is a HTML to React Component translator. its still along the same lines of what you end goal was which was taking the initial Figma design and translating it into code. anyways just food for thought from one dev to another! great vid!

  • @PotatoMan1491
    @PotatoMan1491 5 หลายเดือนก่อน

    Perfect explanation depth and usage guidance for noobs like me!

  • @Idan_Peritz
    @Idan_Peritz 5 หลายเดือนก่อน +1

    realy like your videos keep going

  • @abduwahabhasan
    @abduwahabhasan 17 วันที่ผ่านมา

    YT please recommend to me these kinds of videos.

  • @aghasaad2962
    @aghasaad2962 5 หลายเดือนก่อน

    Hello Steve I am a big fan of your content and it really helps me improve my AI knowledge. I am working on a solution that uses Knowledge base of OPEN AI and data (RAG) from a company , I am using OPEN AI API as brain for the system but eventually its pretty costly and doesnt work perfect even if i use good instructions, answers are pretty random and not consistent, can you guide me with this. Thankyou

  • @peter102
    @peter102 2 หลายเดือนก่อน

    very good video. learnt alot

  • @cagdasucar3932
    @cagdasucar3932 หลายเดือนก่อน +3

    I don't understand. Why would you need to run image detection for importing Figma designs? They already have node/element structure. Image block detection is not needed for an importer plugin like this at all. It would be useful if you wanted to build a webpage from mock-ups, not Figma.

    •  22 วันที่ผ่านมา +1

      en mantıklı yorum

  • @shubhamgarg5007
    @shubhamgarg5007 5 หลายเดือนก่อน

    Tried the Vertex AI a few times for different projects. The issue is that the endpoint always returns the same prediction irrespective of the input. This happened to me for 3 different projects over the course of last 1.5 years after which I entirely gave up on GCP.

  • @DioArsya
    @DioArsya 5 หลายเดือนก่อน

    all of this journey is just really a *wow🤯💥*

  • @wei48221
    @wei48221 5 วันที่ผ่านมา

    Awesome!!

  • @b3owu1f
    @b3owu1f 5 หลายเดือนก่อน

    So if you had a couple of computers with say some 3080s sitting around, is it terribly difficult to use those to train on even if it takes a few days or more? I'd be interested in playing around with my one GPU just to see what it can do since it's just sitting around doing nothing. Is python the only way/language that can be used to train models with data? As a go developer, I am rather surprised a much faster runtime language like Go isn't used for something this cpu (and gpu) intensive. With that in mind can you use CPU only if your GPU is low end or does training require GPUs? Subbed btw.. this video was very easy to follow so look forward to more.

    • @N7Tonik
      @N7Tonik 4 หลายเดือนก่อน

      I would say training will be always very much faster with a GPU as GPUs are made for parallel computing

  •  5 หลายเดือนก่อน

    Hey there! It seems like you've put a lot of effort into learning and applying AI models to solve real-world problems. Your dedication and determination are truly commendable. Keep up the great work and continue exploring new possibilities in the field of AI and machine learning. Your insights and experiences are valuable and can inspire others in the community. Great job!

  • @karlbarker2912
    @karlbarker2912 5 หลายเดือนก่อน

    Steve is one of the great brains in next stage technology ❤️

  • @n3x404
    @n3x404 5 หลายเดือนก่อน

    Very interesting !

  • @blender_wiki
    @blender_wiki 4 หลายเดือนก่อน

    In the broader context, investing in large-scale language models (LLMs) for natural language understanding may be deemed inefficient and resource-intensive if such capabilities are not a primary requirement. A conventional coding strategy that involves breaking down workflows into modular functions with standardized algorithms, coupled with the integration of smaller machine learning models, is not only more maintainable but also facilitates better evolution. This approach is not only more realistic but also cost-effective in comparison to the utilization of extensive LLMs.

  • @bennguyen1313
    @bennguyen1313 2 หลายเดือนก่อน

    Any thoughts on tinygrad?
    Also, do you think that eventually the commercial models will be so much better than an open source model that training your data on your own will be for very niche cases?

  • @STREETSURFER98
    @STREETSURFER98 3 หลายเดือนก่อน

    I have some questions.
    After Training the Model in Google vertex you have a working object Detector. Nice!
    But How Do you connect the output of the Model (the coordinates of the bounding boxes) to the Code of the Website?

  • @collectitbydiyversity2442
    @collectitbydiyversity2442 5 วันที่ผ่านมา

    Thanks for shearing. Question , how much was the total cost to build your models?

  • @costathoughts
    @costathoughts หลายเดือนก่อน

    Steve you rock it dude !!! Please create one tool to Java legacy projects lol

  • @drkeithnewton
    @drkeithnewton หลายเดือนก่อน

    Hi, Steve. This is really helpful. once you use Vertex AI for your specialized LMM, is it still stored at Google or are you able to download and run locally? Are you constantly making API calls for this one use case, for example?

    • @n111254789
      @n111254789 หลายเดือนก่อน

      Did you ever find out? I can't get a straight answer either.

    • @drkeithnewton
      @drkeithnewton หลายเดือนก่อน

      @@n111254789 I did not. I guess I'm going to have to build one and find out myself. Wanna partner up?

  • @Jjjabes
    @Jjjabes 5 หลายเดือนก่อน

    cool video... subscribed.

  • @deviladvoc
    @deviladvoc 21 วันที่ผ่านมา

    is this ideal for studying especially in an engineering field (specifically electrical)? i noticed even gemini advance have difficulty understanding and solving intermediate problems.

  • @olegmikulskiy
    @olegmikulskiy 4 หลายเดือนก่อน

    Steve, you mentioned that it's easy to use some python library to run free LLM image model on PC. Can you show it?

  • @danielahern3560
    @danielahern3560 5 หลายเดือนก่อน

    what headphones are you wearing? do they have a bluetooth microphone that you're using in the video?

  • @gemini22581
    @gemini22581 2 หลายเดือนก่อน

    Wat model should I use to train a model using my own data for conversational purposes?

  • @pavi013
    @pavi013 3 หลายเดือนก่อน

    I been wondering this, there's so many general models that are just too big and would love to have smaller model that do specific task.

  • @comicsans8278
    @comicsans8278 5 หลายเดือนก่อน

    what do you use to create the flowcharts?

  • @hidrate3189
    @hidrate3189 5 หลายเดือนก่อน

    Thats so tureeee , Video i was looking for , WOuld you mind giving breakdowns , lol i know it need huge time and vakue consumption but i thing we need an community talking about this.

  • @debasishtripathy530
    @debasishtripathy530 3 หลายเดือนก่อน

    Sure, here's a table summarizing the key differences between using off-the-shelf large language models (LLMs) and training your own specialized AI model:
    | **Characteristic** | **Off-the-Shelf LLMs** | **Specialized AI Model** |
    |---|---|---|
    | **Cost** | Generally more expensive due to high training and usage costs | Can be more cost-effective, especially for specific use cases |
    | **Speed** | Can be slower due to model complexity and inference time | Can be faster due to smaller size and focused training |
    | **Accuracy & Predictability** | May be less accurate and predictable for specialized tasks | Can be more accurate and predictable for specific use cases |
    | **Customization** | Limited customization options | Highly customizable to specific requirements |
    | **Data Requirements** | Require large amounts of data for training | May require less data, especially if using transfer learning |
    | **Training Time** | Can take days or weeks | Can be faster, especially for smaller models |
    | **Expertise Required** | Requires specialized expertise in AI and ML | Can be more accessible with basic development skills |
    | **Use Cases** | Suitable for general-purpose tasks | Ideal for specific, well-defined problems |
    In general, off-the-shelf LLMs are a good starting point for exploring AI solutions and for tasks that require general knowledge and understanding. However, if you have a specific problem that requires high accuracy, predictability, and customization, training your own specialized AI model may be a better option.
    It's important to carefully consider the trade-offs and requirements of your project before deciding which approach to take.

  • @burnt1ce85
    @burnt1ce85 5 หลายเดือนก่อน

    Thanks for this video. I'm sure you're saving us devs time, headaches and expensive bills from using LLMs

  • @user-zr4no9sz8u
    @user-zr4no9sz8u 5 หลายเดือนก่อน

    Is there a similar option of Vertex AI in Microsoft azure.

  • @aketo8082
    @aketo8082 2 หลายเดือนก่อน

    Thank you. How can I do that with GPT4ALL? Looking forward for an instruction, how I can work with GPT4ALL and with my own documents and information. And how to train and fine tune my data, if this is possible. Thanks.

  • @lost4468yt
    @lost4468yt 2 หลายเดือนก่อน

    If you were to add an extra network that goes between the smaller ones, it suddenly starts looking (from a high level of course) like the larger scale networks in the brain. The most obvious example would be the brains Default Mode Network.

  • @kasvith
    @kasvith 5 หลายเดือนก่อน

    hmm how accurate/good are the results?

  • @okopyl
    @okopyl 5 หลายเดือนก่อน

    I was able to train YOLOv8s much cheaper than $60. Less than 5. And it was very good.

  • @user-qp2pb3yy5w
    @user-qp2pb3yy5w 2 หลายเดือนก่อน

    what scraper did you use

  • @muzafferkadir
    @muzafferkadir 2 หลายเดือนก่อน

    great!!!

  • @guitarhero69
    @guitarhero69 5 หลายเดือนก่อน +1

    im new to this kinda thing but what does LLM mean??
    im sorry for asking the dumbest question but can someone recommend more videos like this so i can get a better grasp or whats going on in the video.

    • @user-db9bw5cl1e
      @user-db9bw5cl1e 5 หลายเดือนก่อน +1

      LLM stands for large language models. I presume its self explanatory what that means but a popular example is ChatGPT 3.5 / 4. It runs off of a LLM that takes text input and returns output based on predicting the 'next text' / 'what text should follow'

    • @hidayat1999
      @hidayat1999 หลายเดือนก่อน

      ​@user-db9bw5cl1e it also means that it can only chat right? It is not good for logical discussions like creating code?

  • @wayneswildworld
    @wayneswildworld 5 หลายเดือนก่อน

    Can we use the final product?

  • @meisherenow
    @meisherenow 5 หลายเดือนก่อน

    Something like...using a dumptruck to deliver a small package is okay if you're doing it once or twice, but if you're starting a delivery service, a little effort to reduce cost is a good idea.

  • @user-dh9ys6dg1k
    @user-dh9ys6dg1k 4 หลายเดือนก่อน

    is there alternative for Google's Vertex AI ?

  • @Ma-pz5kl
    @Ma-pz5kl 5 หลายเดือนก่อน

    YEAP data set being clean is the raw quality ingredient for a 3 star michelin receipe. architecture comes second.

  • @kayakMike1000
    @kayakMike1000 5 หลายเดือนก่อน

    I want a model that can take video frames from several cameras and identify pathing information.

  • @jeffdavidson5601
    @jeffdavidson5601 4 หลายเดือนก่อน

    Wait, did I miss the part where you show how you got the text and other Figma objects?

  • @atifulislamasif7893
    @atifulislamasif7893 5 หลายเดือนก่อน

    can anyone tell what is the tool used to draw the flow and diagrams in the vdo?

    • @excalidraw
      @excalidraw 4 หลายเดือนก่อน

      us! ❤️

  • @jakemeyer8188
    @jakemeyer8188 4 หลายเดือนก่อน +1

    What company do you work for? My employer has been looking for a way to generate basic sites from Figma's. A vendor solution is just fine with them.

    • @Steve8708
      @Steve8708  4 หลายเดือนก่อน +1

      Builder.io

    • @jakemeyer8188
      @jakemeyer8188 4 หลายเดือนก่อน

      @@Steve8708 Excellent. Hugely appreciated!!

  • @lutfiikbalmajid
    @lutfiikbalmajid 5 หลายเดือนก่อน

    That's cleanest AI generated code from design (figma)

  • @ReflectionOcean
    @ReflectionOcean 4 หลายเดือนก่อน +3

    - Explore pre-existing models for solutions at 0:53.
    - Break down complex problems into smaller parts at 0:55.
    - Test general models to understand their limitations at 1:19.
    - Consider the costs and feasibility of training large models at 2:17.
    - Solve as much of the problem as possible without AI at 3:13.
    - Identify the right model type for your needs at 4:00.
    - Generate high-quality example data for training at 4:44.
    - Use tools like Google's Vertex AI for model training at 5:54.
    - Deploy and test your model with real-world data at 6:46.
    - Combine specialized AI models and plain code for best results at 8:35.
    - Utilize LLMs for final customization steps when necessary at 8:47.

  • @yaizudamashii
    @yaizudamashii 5 หลายเดือนก่อน

    This video is encouraging but at the same time sounds a bit too good to be true. How many images, which you have supervised each one of them, did you have in the training data?

  • @smallhat1
    @smallhat1 3 หลายเดือนก่อน

    Hey just wondering how much dose it cost to teach and can it do illegal stuff cuz that’s scary if it can

  • @fonsecajjr
    @fonsecajjr 14 วันที่ผ่านมา

    good ad 👍

  • @ragtop63
    @ragtop63 หลายเดือนก่อน +1

    Here's the kicker:
    None of this will work for someone who either doesn't know how to code or is a beginner/intermediate coder.
    If you know just enough programming to know that what a LLM is giving you is either wrong or isn't optimal, but you don't know the solution, there's no way you'll be able to validate the data you want to provide to the custom language model without involving someone (or others) who can help validate that data. At that point, which came first? The chicken or the egg?