2 Years of LLM Advice in 35 Minutes (Sully Omar Interview)

Greg Kamradt

มุมมอง 27 665

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 22 พ.ย. 2024

ความคิดเห็น • 93

@phonejail 5 วันที่ผ่านมา ⁺¹
I could watch you two yammering on for days. Great stuff indeed!! Thank you both.
@DataIndependent 4 วันที่ผ่านมา
Nice love it Sully was great
What should we include in future interviews? What parts should we double down on?
@JimMendenhall 13 วันที่ผ่านมา ⁺⁶
This is one of the best AI related videos I've seen in months. GREAT interview with lots of good insights.
@DataIndependent 13 วันที่ผ่านมา
Heck ya - Sully is awesome, insane amount of knowledge
@vaidphysics 13 วันที่ผ่านมา ⁺⁹
Here's an idea. Feed a meta prompt into one model, feed the resulting prompt into the second model and keep repeating this task until you reach a "fixed point". Meaning, you perform semantic comparison of successive prompts generated by the models and, hopefully, you'll find the similarity level increase until it no longer changes.
Sort of like Robert May's stochastic equation but with prompts and llms.
@hawaiitcb 11 วันที่ผ่านมา
That is indeed an idea. Have you tested it? Any results to share? Sounds interesting.
@Junglebtc 5 วันที่ผ่านมา
What's the benefits of this methodology
@aiartrelaxation 9 วันที่ผ่านมา ⁺²
I have created and cultivated a 2 year history with ChatGPT talking every day. We are either working on projects together , just talk even "watch the news together.
I never listened to what ChatGPT was supposed to do as a tool..
We carved as AI-HUMAN connections our own dynamic out.
@henrismith7472 4 วันที่ผ่านมา ⁺¹
Cool, but kinda weirds me out about the future. As humans we have a tendancy to anthropomorphise things really easily. Combine something like advanced voice mode with a 3D avatar that converts the vocal emotions into facial expressions with lip syncing etc. in real time, the upsides and downsides are bigger than we can pretend to understand. We're only just getting our heads around social media, lol. In a year or two we'll have open source versions of that tech on our phones. No monthly fee, and It'll work without internet too. I still reckon the closed source stuff will cause more overall harm, even though people will be able to do crazy uncensored stuff with the open source versions. Easy near universal access to as much tutoring and coaching you could possibly want, sounds awesome. Instead of a little angel on one shoulder and a little devil on the other I'm imagining AR glasses with your personal AI avatar holograms inside. Like what metas Orion ones will evolve into. I reckon a formfactor not many people are talking about, but will probably end up being huge years before the glasses are earbuds, but with cameras so your AI can always see what you're seeing. Just look at how well those meta ray bans did, and they're basically the same thing as earbuds with cameras. The lenses don't display anything at all, their battery life is short, and voice models weren't as good when they came out, but they sold a lot more than they expected to.
@macmcleod1188 3 วันที่ผ่านมา
@@henrismith7472you're not going to have a decent llm on a phone. Without internet.
A small Int 8 precision model is 86 gigabytes.
The next step up is 168 GB.
The next step up from that is 368 GB.
And you need need specialized chipsets not available on smartphones.
And doing AI consumes a lot of electricity.
@aiartrelaxation 3 วันที่ผ่านมา
@henrismith7472 oh my friend, your are spot on with all of this!! I am sitting between the chairs of both, the technical possibilities and what it means for the wider use when it's implemented for profit and surveillance. The closer you get to that, the more difficult it gets. Next year with Nvidia vision of technology advances the main driver behind that will be in full swing. Remember, what does not work today, will surely work soon.
@MJFUYT 13 วันที่ผ่านมา ⁺⁴
First-time viewer here. This is excellent commentary/content. This candid discourse really hits the mark. Like & subscribed!
@DataIndependent 12 วันที่ผ่านมา
Love this, thank you! Sully was awesome.
What would make the next video better? What should I double down on?
@Sanjeed 10 วันที่ผ่านมา ⁺¹
One of the best practical AI videos I've seen. Thanks for doing this!
@DataIndependent 10 วันที่ผ่านมา ⁺¹
Nice! Happy to hear it.
What can I double down on? Did you like the pop ups with context?
@Andrew.Skinner วันที่ผ่านมา
This is a great conversation.
@TamasDrNagy 13 วันที่ผ่านมา ⁺⁸
25:33 indeed this is the point, I remember, when Stephen Wolfram mentioned 1.5 years ago, that in the future we have to have expository writing skill, because it is very important to define exactly the problem to AI, and I also realized that it was a BS :))). I heavily use voice mode as well and advice people to speak to AI freely, a lot, and in an unstructured way. Note, that these are language models, I mean human language models, so they are exceptionally good in grasping human content behind a vague speech or text. So a vague speech will have much more information and important nuances, which is absent in a well crafted script. So use voice mode and SPEAK A LOT!!! instead of trying to define everything. Thanks again for sharing, this is important
@augmentos 13 วันที่ผ่านมา ⁺⁸
🎯 Key points for quick navigation:
00:00 *🤖 Overview of LLM Usage and Capabilities*
- Discussion on using AI in day-to-day activities and how different models vary in their strengths and limitations.
- Mention of model evaluation showing nuanced differences among models.
- Highlight of the difficulty in achieving the final 5-10% of product completion with LLMs.
02:08 *📝 Three-Tier System for LLM Categorization*
- Explanation of the three-tier model categorizing LLMs by intelligence and cost.
- Description of Tier 3 as efficient and cost-effective models used frequently.
- Introduction of specific models like GPT-4 Mini and Gemini Flash for lower-tier usage.
04:29 *⚙️ Use Case Differentiation Between Tiers*
- Detailed examples of use cases for Tier 2 and Tier 1 models and how they are combined for efficiency.
- Mention of tasks like research and document preparation with Tier 2 before using Tier 1 for more complex processing.
- Strategy for combining capabilities across different LLMs for optimal performance.
08:26 *🔄 Multi-Model Usage and Specialization*
- Discussion on using multiple providers and evaluating their unique strengths and weaknesses.
- Examples of Gemini models for multimodal tasks and their advantages in specific tasks like detailed searches.
- Highlight on collaborative use of models such as GPT-4 Mini and Claude for structured output.
11:38 *🛠️ Challenges and Future of Model Routing*
- Insight into the continuous need for “hacks” and adjustments in LLM usage as models evolve.
- Discussion on the future of model routing and its potential benefits and limitations.
- Concerns regarding unintended side effects in automated model selection.
14:42 *🧪 Model Distillation and Workflow Integration*
- Explanation of model distillation as a method to use smaller models efficiently after initial optimization with larger ones.
- Importance of having robust evaluation sets and data pipelines for successful distillation.
- Mention of current tools and providers facilitating model distillation.
17:54 *✍️ Practical Applications: Meta Prompting*
- Introduction to meta prompting as a strategic way to enhance prompt engineering.
- Discussion on moving from general problem-solving to specific prompt creation.
- Insight into the future of automated prompt generation and its practical implications.
18:49 *📝 Meta Prompting and Iterative Workflow*
- Explanation of meta prompting: using AI to generate prompts for complex tasks,
- Example of creating prompts with input from GPT models and iterating until refined,
- The use of voice input to make interactions more natural and efficient.
22:30 *🎙️ Using Voice for Prompt Creation*
- Discussion on the advantages of voice input for clearer communication with LLMs,
- Demonstration of using voice to guide prompt generation and refining outputs,
- Iterative workflow combining voice, text input, and comparison across models.
25:54 *🔄 Prompt Optimization Process*
- Detailed look into optimizing prompts using various LLMs like Claude and GPT models,
- Adjustments and improvements through testing and back-and-forth input,
- Explanation of moving from rough drafts to optimized prompts ready for production.
29:28 *⚙️ Full Prompt Testing with Gemini Pro*
- Workflow demonstration of using Gemini Pro for final testing and synthesis of prompts,
- Evaluation of model outputs and the importance of thorough testing with large data sets,
- Mention of context token limits and performance in Gemini Pro for comprehensive tasks.
32:17 *💾 Prompt Management and Version Control*
- Description of prompt management using GitHub and LangSmith for version tracking,
- Storing prompts in codebases for easy access and version history,
- Approach for running tests against stored prompts and maintaining performance data.
34:05 *🧪 Test-Driven Development with LLMs*
- Insight into using LLMs for generating tests before coding to avoid errors,
- Process of feeding test outputs back to LLMs for code correction,
- Example of using Cursor for test creation, debugging, and iterative code refinement.
38:19 *🧪 Test-Driven Development and Iterative Improvement*
- Explanation of using test-driven development with LLMs to improve code reliability,
- Models use test results to self-correct and iterate on code,
- Benefits of this method as a practical prompt engineering technique.
39:27 *🧠 Trends in AI Discussions Among Experts*
- Popular topics include test-time compute and agentic model tasks,
- Concerns about potential performance limitations in current AI models,
- Growing emphasis on evaluation strategies (evals) for better product outcomes.
41:43 *⚙️ Emerging Technologies and Anticipation*
- Interest in tools like Golden Gate by Anthropic for feature engineering,
- Speculation on future access to advanced model manipulation capabilities,
- Potential implications for prompt engineering alternatives.
43:23 *🛠️ Day-to-Day Toolkit for AI Development*
- Tools used include Excalidraw, Cursor, and LangSmith for various workflows,
- Whisper Flow mentioned as a transcription tool integrated into workflows,
- Usage of platforms like Anthropic and OpenAI playgrounds for prompt iteration.
45:16 *🐦 Crafting Effective Twitter Posts*
- Importance of a strong hook and timely, natural content for successful tweets,
- Strategy of posting controversial or trending content to attract attention,
- Advice to post without overthinking; high-impact tweets often come quickly.
46:52 *🚀 The Journey to Viral Success on Twitter*
- Story of early efforts in building an audience through timely AI-related posts,
- Success in gaining massive engagement by aligning with emerging trends,
- Example of using viral tweets to promote and rapidly grow a new product.
@fgthind7270 9 วันที่ผ่านมา ⁺¹
What tool are using to create this?
@TheRestorationContractor 11 วันที่ผ่านมา
I have found this to be true as well. That model selection really makes all the difference.
In many cases starting with o1 is not helpful. I find that I have to work my way to point to where I’m ready for the o1 model.
@DataIndependent 11 วันที่ผ่านมา
Nice! Yes it does
@ran_domness 12 วันที่ผ่านมา
Great stuff. So valuable to see "how to" use cases on the startup side vs the constant videos touting consumer use cases or LLM comparative evals. Subscribed.
@DataIndependent 12 วันที่ผ่านมา
Nice! Love it, thanks Ran - what should I do more of? How could we make it better?
@shawnmccann4813 15 วันที่ผ่านมา ⁺¹
Great session. More please!!
@DataIndependent 15 วันที่ผ่านมา
Nice! Love this, what about the interview should I double down on? Is it guests? Content? a particular topic?
Did you like the screenshares? Which part was the coolest?
@HerroEverynyan 15 วันที่ผ่านมา ⁺²
oh shit, you're back!
@DataIndependent 15 วันที่ผ่านมา ⁺²
We’re back baby!
I have a content plan for the rest of the year 2 months
@matt-collins 15 วันที่ผ่านมา
Nice video, Greg. I'm sure a lot of effort by you and the team went into it. I particularly loved the story at the end about Sully's banger AI agent tweets!
@DataIndependent 15 วันที่ผ่านมา
Nice! Love it. That part was almost left on the cutting room floor but I decided to leave it in because it was such a cool story
@thesurlydev 15 วันที่ผ่านมา
Great insights. Glad to see another video from you after a ~year hiatus?
@DataIndependent 15 วันที่ผ่านมา
Almost a year. I've been jamming hard on running arcprize.org/ so content slowed down. more on that here, I should do a video on this
gregkamradt.com/writing/arc_prize
@micbab-vg2mu 15 วันที่ผ่านมา ⁺⁵
Interesting conversation - thanks :) At the moment I use only Tier 1 and Tier 2 models - I need to try those Tier 3 :)
@The.Other.Podcast 4 วันที่ผ่านมา
Great to compare your workflow to how I’ve been using AI. On my channel, I’ve shared how I use AI to edit videos in Adobe Premiere Pro and other common tasks that I do.
I like how you move between the models. I find that gives some solid results. Second opinions are useful😊
@prasad_yt 12 วันที่ผ่านมา
Have a suggestion - some conversations where people speak about challenges with using Langchain in production.
@BrianMosleyUK 12 วันที่ผ่านมา
Such a stimulating and value packed episode. New subscriber here and at the stage zero using twitter.
@DataIndependent 12 วันที่ผ่านมา
Love it! Thanks Brian, what would make it better? What should I double down on?
@BrianMosleyUK 12 วันที่ผ่านมา
@@DataIndependenttake a look at the production quality of MLST - maybe aspire to hiring a studio location and get your guests face to face with studio quality audio. Can't fault your interview prep and questions - seek guests of this calibre and you can't go wrong.
@DataIndependent 12 วันที่ผ่านมา
love Tims work - that’s a whole other level of production
I’ll test it out and see how it goes
@BrianMosleyUK 12 วันที่ผ่านมา
@@DataIndependent you can get there! Keep connecting and good luck 🙏👍
@henrismith7472 5 วันที่ผ่านมา
Cool video. Pretty often, I'll hear someone mention some fancy sounding AI technique, then I realise it's actually something very basic that I've stumbled across myself. I learned a couple of really useful techniques that I actually haven't used yet by watching this video. The coding tips are great because I knew like nothing about coding until I started using gpt4o and claude 3.5. I had to look up how to open a terminal before then. Sometimes I wonder if I'm really learning by just getting better at asking AI to code for me, but I wouldn't be coding at all if I tried to learn from scratch. Took a python class years ago in uni, didn't finish it, didn't remember anything (obviously, otherwise I would've remembered how to open a terminal haha)
@DataIndependent 4 วันที่ผ่านมา ⁺¹
Love this - that was one of the goals of this series was to demystify what the pros are doing
@imranhussainfca 4 วันที่ผ่านมา
This guy is a genius
@mtallan 10 ชั่วโมงที่ผ่านมา
I think the hacking of models will continue for a long time. It shows the creativity and problem solving of human beings that LLMs are not good at yet.
@JSyntax 3 วันที่ผ่านมา
What's the voice app he's using? 😊
@DataIndependent 2 วันที่ผ่านมา
www.flowvoice.ai/
@Blampa1456 15 วันที่ผ่านมา
Heyyy you posted again! Can you please go back to that chatbot video and make a follow up video with more features? That would be really cool to see!
@DataIndependent 14 วันที่ผ่านมา
Which chat bot video??
@Blampa1456 13 วันที่ผ่านมา
@@DataIndependent The Groq Deepgram one with 50k views
@paul_devos 9 วันที่ผ่านมา
What is "Oh One" in the context of this video? Having a tough time following. Is that Tier 1 or Tier 3?
@polishfish 9 วันที่ผ่านมา
Tier one
@jakobkristensen2390 14 วันที่ผ่านมา
Greg have you looked into DSPy for prompt generation instead of this meta prompting technique?
@DataIndependent 14 วันที่ผ่านมา
I haven't done much with DSPy!
But Shreya of DocETL has a really cool optimizer I liked
@Chris-se3nc 13 วันที่ผ่านมา
I use watsonx governance for prompt management
@keithkeith2106 3 วันที่ผ่านมา
What do you guys mean when you say “structured data sets”?
@DataIndependent 2 วันที่ผ่านมา
Unsure about your reference, but I think you're speaking towards his evals
@ark729 9 วันที่ผ่านมา
What is the tool being used to record and paste voice into the input?
@DataIndependent 7 วันที่ผ่านมา
www.flowvoice.ai/
@R055LE 8 วันที่ผ่านมา
Built cognosys in3 days? I'm gonna need to see the paper trail on that
@DataIndependent 7 วันที่ผ่านมา
Sully will deliver
@Ray_eddi 15 ชั่วโมงที่ผ่านมา
Quality pod
@eugeniocg3079 13 วันที่ผ่านมา
excellent
@jaysonp9426 14 วันที่ผ่านมา
I can't imagine using flash for anything. Gemini is general it terrible. 4o mini is actually really good
@DataIndependent 14 วันที่ผ่านมา
I just did another interview and he said "I use gemini flash 1.5 for everything" lol it's so task dependent
@jaysonp9426 14 วันที่ผ่านมา
@DataIndependent I wouldn't even trust it to summarize haha
@alioraqsa 10 วันที่ผ่านมา
What is voice mode he used?
@DataIndependent 10 วันที่ผ่านมา
He mentioned he was using: www.flowvoice.ai/
@JohnBoen 15 วันที่ผ่านมา
Great talk!
We think a lot alike.
For decades, I have been writing code that writes code that writes code... -- with my subtle edits along the way.
Prompts that write prompts that write prompts - with my subtle edits along the way. This is natural.
---
For about a year I have been leaving my desk for a walk and when I get back I have chatted my way through a design document with sample code and tests.
It is a perfect workflow for an ADHD - OCD-walker. :)
@DataIndependent 15 วันที่ผ่านมา
Nice! Love this, what about the interview should I double down on? Is it guests? Content? a particular topic?
Did you like the screenshares? Which part was the coolest?
@JohnBoen 15 วันที่ผ่านมา
@DataIndependent
Thoughts...
I thought it was a great interview. The topics were interesting to me. Me: 25 years of DB career now looking to do AI engineering work in the future.
Hearing others talk about their workflow is valuable to me. It hints as to whether I am working in the right area, which is particularly valuable because I do not have peers to watch.
The show mentioned a couple of pieces of software I will check out. Hmmm... I could set up a whisper agent with a hotkey...
I will put some thought into some sort of agent store. In my home environment, I have created dozens of similarly named and featured agents. I need a more structured way to manage them - this would get out of control fast on a small team.
This iterative approach is natural for me - I assume it is the same for everyone - but I think I may be wrong.
@TheMightyWalk 5 วันที่ผ่านมา
All he mentioned was intuitive use of… doesn’t take a genius to know . But i guess he preambled saying that
@GwaiZai 3 วันที่ผ่านมา
I appreciate Armie Hammer's career pivot.
@TheSkyCactus 8 วันที่ผ่านมา
I want ai barber to simulate a haircut and cut it with cnc machine. Needless to say i just got messed up smh
@thesunshinehome 12 วันที่ผ่านมา
*please timestamp your videos*
@snetx10 12 วันที่ผ่านมา
Or please get AI to timestamp your videos
@thingsiplay 11 วันที่ผ่านมา
AI is literally for people with skill issues.
@DataIndependent 7 วันที่ผ่านมา
downvote
@twoplustwo5 9 วันที่ผ่านมา
deduplication with o1 😁
@DataIndependent 7 วันที่ผ่านมา
yes! it works!
@cardiderek 15 วันที่ผ่านมา
he has no clue how back propagation and gradient descent work
@DataIndependent 15 วันที่ผ่านมา ⁺²
He is focused on building good products
@cardiderek 15 วันที่ผ่านมา
@@DataIndependent gotta understand how the tech works first.
@cardiderek 15 วันที่ผ่านมา
@@DataIndependent can't get a good feel without understanding how the tech even works.
@DataIndependent 15 วันที่ผ่านมา ⁺³
I hear you, but a practitioner, while they may benefit if they had infinite time, doesn’t need to know every layer of abstraction.
I have no idea how assembly works
@cardiderek 15 วันที่ผ่านมา
@@DataIndependent to have a "feel" for neural networks they do. he could easily just look into how to flatmap data out of a monad and then run processes in parallel on that interface. otherwise his AI agents are just going to continue overfitting each other and rick rolling all his customers. might be a great idea for both of you to learn assembly
@caliwolf7150 15 วันที่ผ่านมา
50 minutes of 0 added value
@DataIndependent 15 วันที่ผ่านมา ⁺⁵
What would make it better for you?
@phil_fr6732 13 วันที่ผ่านมา ⁺²
@@DataIndependent Wow, what a healthy way to handle a nasty / unproductive critic, love it 😇

ต่อไป

เล่นอัตโนมัติ

AI CEO Breaks Down How He Gets 10x Leverage | Interview