Real time AI Conversation Co-pilot on your phone, Crazy or Creepy?

แชร์
ฝัง
  • เผยแพร่เมื่อ 20 ม.ค. 2025

ความคิดเห็น • 96

  • @m4tthias
    @m4tthias 10 หลายเดือนก่อน +44

    Would be funny when both parties of an interview who uses the co-pilot finds out about each other.

    • @jaredlalal
      @jaredlalal 10 หลายเดือนก่อน +1

      Oh it'll happen, it already does to an extent, we use ai to help us find jobs, they use them to scim threw applicants

    • @figs3284
      @figs3284 10 หลายเดือนก่อน +1

      You just gave me an idea 👍

    • @adolphgracius9996
      @adolphgracius9996 10 หลายเดือนก่อน +1

      Lmao😂😂😂

    • @jaredlalal
      @jaredlalal 10 หลายเดือนก่อน

      @@figs3284 I love random ideas :0, can I have a spoiler? 🙂

    • @TyFactorCGI
      @TyFactorCGI 8 หลายเดือนก่อน

      this is so great!!! what fun! I picture the underqualified candidate who's code crashes mid interview and all of a sudden can't answer 1+1. (:
      But seriously, I have some cognitive disabilities, even something to keep me on track and succinct in answering my questions is absolutely invaluable. I am seeing this as a reminder tool for me, not an answer tool per se. so powerful and enabling for somebody like me. I imagine it helping my confidence which will allow me to be more of myself and let my natural attitude come out instead of being clouded with mumbling, with racing thoughts.. (as many of my other high-frequency colleagues have)
      thank you very very much for taking the time to share with us!

  • @enlaichu
    @enlaichu 10 หลายเดือนก่อน +3

    Thanks!

  • @HarpaAI
    @HarpaAI 10 หลายเดือนก่อน +1

    🎯 Key Takeaways for quick navigation:
    00:00 *🎬 Introduction to Real-Time Conversation Co-pilots*
    - Introduction to the concept of real-time conversation co-pilots.
    - Overview of the challenges and potential benefits of using AI in conversations.
    - Discussion of past attempts and the need for low-latency solutions.
    01:39 *🚀 Real-Time Co-pilots in Professional Settings*
    - Examples of real-time conversation co-pilots in professional contexts, such as aerospace engineering and job interviews.
    - Consideration of the value and ethical implications of using AI in interviews and professional settings.
    - Potential for improving interview processes and enhancing communication skills.
    03:29 *🤝 Real-Time Co-pilots for Social Interactions*
    - Exploration of the benefits of real-time conversation co-pilots in social interactions.
    - Personal anecdotes about the challenges of social interactions and the potential support AI could provide.
    - Discussion of the broader applications beyond professional contexts.
    04:24 *📱 Building Real-Time Conversation Co-pilots: Web and Mobile Apps*
    - Overview of building real-time conversation co-pilots, including web and mobile applications.
    - Introduction to the technical components required for such applications.
    - Step-by-step guide on using platforms like Replicate for deploying AI models.
    04:53 *⚙️ Technical Challenges: Real-Time Transcript and Fast Inference*
    - Discussion of the technical challenges involved in achieving real-time transcription and fast inference.
    - Solutions for real-time transcription, including recurrent loops and optimizations for accuracy.
    - Strategies for achieving fast inference with large language models, such as model selection and optimization techniques.
    10:19 *🛠️ Implementing Real-Time Conversation Co-pilots: Demo and Iterations*
    - Overview of the iterative process of building real-time conversation co-pilots.
    - Demonstration of a web application prototype and its functionality.
    - Integration of AWS services and Replicate for deploying and running AI models.
    17:30 *🛠️ Backend service setup and frontend basic structure*
    - Setting up the backend service involves defining routes and handling requests.
    - The frontend structure includes defining HTML elements and basic CSS styling.
    - Functionality such as recording audio and fetching suggestions is outlined.
    23:18 *📱 Exploring Whisper Kit for mobile deployment*
    - Whisper Kit, an open-source fine-tuned model, enables deploying speech-to-text models on mobile devices.
    - Real-time transcription with minimal latency is demonstrated on devices like the iPhone.
    - Whisper Kit's optimization allows for efficient use of resources on mobile hardware.
    24:01 *🎁 Unboxing and setup of Apple Vision Pro*
    - The unboxing and setup process of Apple's Vision Pro headset is showcased.
    - Details about accessories, boot-up procedure, and initial impressions are provided.
    - The demonstration highlights optimizations for improved accuracy in AI-generated transcripts.
    25:14 *💻 Development using Whisper Kit for iOS apps*
    - Utilizing Whisper Kit Swift package to develop iOS apps for real-time transcription.
    - Setting up the project in Xcode and configuring model selection and streaming options.
    - Overview of the app's structure and explanation of key functionalities for transcription and model loading.
    31:03 *🚀 Integration and testing of conversation co-pilot features*
    - Defining variables and functions for user input, API interaction, and response handling.
    - Adding UI components for user prompts, transcription display, and interaction buttons.
    - Demonstrating the setup process and testing the app on an iPhone for real-time transcription and suggestion generation.
    Made with HARPA AI

  • @haraldlasshofer214
    @haraldlasshofer214 10 หลายเดือนก่อน +3

    this is crazy. Love your content. I will definitely use the end product if you publish one. Keep the great work up man!

  • @Romathefirst
    @Romathefirst 10 หลายเดือนก่อน +19

    Best AI channel hands down

  • @mmarrotte101
    @mmarrotte101 10 หลายเดือนก่อน +2

    I've been thinking about this idea for the last 8 months - so pumped to give it a shot, thanks a ton for sharing!

    • @bassamel-ashkar4005
      @bassamel-ashkar4005 10 หลายเดือนก่อน

      Are you trying to build a phone agent?

    • @mmarrotte101
      @mmarrotte101 10 หลายเดือนก่อน

      @@bassamel-ashkar4005 phone is cool but just generally a conversational agent to utilize when having any conversation anywhere. It seems to be an extremely useful concept in so many ways.

  • @Jim-ey3ry
    @Jim-ey3ry 10 หลายเดือนก่อน +16

    Holy, whisper kit is insane, running the model on mobile device directly gonna be the future;
    Also thanks for sharing Replicate, didn't know it is free to use!

    • @nftawes2787
      @nftawes2787 10 หลายเดือนก่อน +4

      "If you’re new to Replicate, you can try us out for free, but eventually you’ll need to enter a credit card."

  • @Soniboy84
    @Soniboy84 10 หลายเดือนก่อน +3

    Hey Jason, interested in the iPhone project (tho android would be better).
    How about adding an extra layer to this flow that translates the text to a different language and speaks it out loud?
    We could then have real-time conversations with someone who isn't the same language as us.
    Here's the use case. You hand over the left earbud to your Polish friend. You use the right earbud. Ideally both earbud would have a mic.
    The application on the phone would:
    listen to the speech in English => do a transcribe on the chunk => do a translate to Polish on the chunk => speak out the chunk in Polish.
    Then when the Polish person is talking, it'd work the opposite way.

  • @asbjborg
    @asbjborg 10 หลายเดือนก่อน

    I need this for screening consultants. They are very smooth talkers, the minute you scratch their gloss they fall through, unless they actually know what they are talking about. Can't wait to see your app. Thanks for sharing!

  • @brianhe2690
    @brianhe2690 10 หลายเดือนก่อน

    There can be lots of good applications for this. Happy to explore and share. Keep the good work 👍

  • @matten_zero
    @matten_zero 10 หลายเดือนก่อน

    Every video you drop is a gem.
    Deepgram STT and Groq's LPUs make this possible.

  • @RatherBeCancelledThanHandled
    @RatherBeCancelledThanHandled 8 หลายเดือนก่อน +1

    Awesome Job. You really need to go commercial with your ideas .

  • @Scheevel67
    @Scheevel67 10 หลายเดือนก่อน +1

    All your videos are amazing!
    One gotcha, if AWS give you an "access denied" error you may need to update S3 policy to add a "/*" onto the back of the resource ARN - the wildcard permits access to all bucket objects

    • @haowu8448
      @haowu8448 10 หลายเดือนก่อน

      thanks bro it worked!

  • @NatGreenOnline
    @NatGreenOnline 10 หลายเดือนก่อน

    This is amazing Jason. Just subscribed to your channel and am very interested to see the iPhone app you build and other AI projects that you're working on.
    I see a lot of amazing use cases for this like overcoming objections on sales calls, asking better questions on podcasts / interviews, etc.

  • @Royaltea_Citizen
    @Royaltea_Citizen 10 หลายเดือนก่อน

    I look forward to you rolling out you app Jason! It would be amazing to run that locally on my phone!

  • @carterjames199
    @carterjames199 10 หลายเดือนก่อน

    Awesome video again Jason good stuff

  • @lakergreat1
    @lakergreat1 10 หลายเดือนก่อน

    Yes definitely interested in the app, please notify when done!

  • @the3rdworlder293
    @the3rdworlder293 10 หลายเดือนก่อน

    tremendous work dude

  • @jpgallegoar
    @jpgallegoar 10 หลายเดือนก่อน +1

    This running on the new Groq arquitecture would be awesome

  • @jasonfinance
    @jasonfinance 10 หลายเดือนก่อน +5

    I tried to build a similar interview co-pilot before too, but the latency made it not usable;
    Can't believe how far we went with those model performance past few month!

    • @SahilP2648
      @SahilP2648 10 หลายเดือนก่อน +1

      You do know that you can use OpenAI API to get sub 2-3 sec outputs right? The only thing not possible on GPT is a system prompt (I think, I have never needed to use OpenAI's API). But on my Mac with a capable 7b parameter like Mistral, or Mixtral model, the output is also within 5 secs (especially when loaded in GPU memory). I prefer local generation vs online since in local you can modify the system prompt and you can customize the output a lot more.

  • @sandrofelder
    @sandrofelder 10 หลายเดือนก่อน

    Yes would be highly interessted to see this app in the store!

  • @webinnovationspartners9293
    @webinnovationspartners9293 10 หลายเดือนก่อน

    Love your work. Great content. Yes, please let me know about the end product once you polish it please.

  • @automatalearninglab
    @automatalearninglab 10 หลายเดือนก่อน +1

    Love your videos, you do an amazing job of packing high quality information into a 30 minutes ish video. 🎉 thanks a lot!

    • @AIJasonZ
      @AIJasonZ  10 หลายเดือนก่อน +2

      thank you so much for your feedback!

  • @kenchang3456
    @kenchang3456 10 หลายเดือนก่อน

    Excellent video and very timely for my interests. Thank you very much.

  • @kate-pt2ny
    @kate-pt2ny 10 หลายเดือนก่อน

    Great production, thanks for sharing

  • @surfkid1111
    @surfkid1111 10 หลายเดือนก่อน

    Don’t have enough thumbs for that, great content.

  • @akellasoumya3432
    @akellasoumya3432 10 หลายเดือนก่อน

    Excellent content

  • @VaibhavShewale
    @VaibhavShewale 10 หลายเดือนก่อน +1

    so in real life f2f talk we have to hold mobile to have a convo with other?

  • @Paktalkuncovered
    @Paktalkuncovered 9 หลายเดือนก่อน

    Could you make a detailed video on how to make this?

  • @luishiluy
    @luishiluy 10 หลายเดือนก่อน

    I would be super interested. Thanks for your magic!

  • @marcus_AI_Advisor
    @marcus_AI_Advisor 10 หลายเดือนก่อน

    Definitely interested

  • @Nitralans
    @Nitralans 9 หลายเดือนก่อน

    quick question, If I were to run this offline what would the token speed look like?

  • @augmentos
    @augmentos 10 หลายเดือนก่อน +1

    Why use small model when you can use large model and Qroq?

  • @Silberschweifer
    @Silberschweifer 10 หลายเดือนก่อน +1

    why no search or/and RAG func call?
    with thsi even the small fats model can become more knowledge

  • @ashishmaru3883
    @ashishmaru3883 2 หลายเดือนก่อน

    Impressive, I was thinking about to implement this in Android

  • @Qwerty-ff1cr
    @Qwerty-ff1cr 10 หลายเดือนก่อน +1

    Why can't I see this video from your channel on my laptop? Lol. Im on my phone now but is anyone able to see this video from the computer?

  • @csepartha
    @csepartha 10 หลายเดือนก่อน

    Kindly make a tutorial to fine tune an open source LLM model on many pdfs data. The fine tuned LLM must be able to answer the questions from the pdfs accurately.

  • @blackhat856
    @blackhat856 8 หลายเดือนก่อน

    Is it possible to have an AI copilot real time in game ,steamvr rec room ?

  • @mikew2883
    @mikew2883 10 หลายเดือนก่อน

    Very cool! 👍

  • @build.aiagents
    @build.aiagents 10 หลายเดือนก่อน

    Phenomenal

  • @senzz97
    @senzz97 10 หลายเดือนก่อน

    This is amazing, thank you for great content. I wonder, I tried this (i'm a beginner with a newly found passion for learning python). I don't have the same amount response on the web app like you have, I get a 8-10 second delay both with the transcript and suggestion. How can I fix this?

  • @danielmacbride525
    @danielmacbride525 10 หลายเดือนก่อน

    hell yeah im interested in the app

  • @magic-4-ai
    @magic-4-ai 10 หลายเดือนก่อน

    When your app will be published in istore? Or maybe it is already?

  • @mr.mikaeel6264
    @mr.mikaeel6264 10 หลายเดือนก่อน

    Ok now i want to build an agent that can listen to videos, copy and build the apps. There is way too much cool AI stuff to try and i have other hobbies and a life too xD

  • @nexuslux
    @nexuslux 10 หลายเดือนก่อน +2

    Thanks for sharing. Don’t sit on the translation potential for this as well ;)

  • @TaktAkira
    @TaktAkira 10 หลายเดือนก่อน

    Is there something like this for the android?

  • @mackroscopik
    @mackroscopik 10 หลายเดือนก่อน

    In the future, Neuralink will be wired directly to the brain activating the vocal chords so that the interviewer is mind blown on how you're answering the questions even though it appears you fell asleep during the interview.

  • @NatGreenOnline
    @NatGreenOnline 10 หลายเดือนก่อน

    Me: "I think I can do this. I'm going to give it a shot!"
    Tries executing this following Jason's steps. Gets error message at first step when installing replicate into VSC. "command not found" .
    Watches 3 videos to see if I can figure out why VSC is giving me this error. Still not working.
    Feels defeated and quits :(

  • @marcc0183
    @marcc0183 10 หลายเดือนก่อน +1

    Can we do this but in Google meet or similar?

    • @elskipvers
      @elskipvers 10 หลายเดือนก่อน

      Yes! I need this for zoom, teams and meet

  • @YipMilk
    @YipMilk 10 หลายเดือนก่อน +2

    It's not going to work if the interviewer is able to track your eye movements through AI which can tell you are reading from a script.

    • @free_thinker4958
      @free_thinker4958 10 หลายเดือนก่อน

      Connect it then to a suitable glasses

    • @peterparker7146
      @peterparker7146 6 หลายเดือนก่อน

      Have you heard about eye tracking by nvidia

  • @nexuslux
    @nexuslux 10 หลายเดือนก่อน +4

    Imagine using this with Groq api inference speeds

    • @messostuff6829
      @messostuff6829 10 หลายเดือนก่อน

      exactly my thoughts.

    • @brandonheaton6197
      @brandonheaton6197 10 หลายเดือนก่อน

      For sure- two orders of magnitude faster inference is bringing us a whole new world and fast - by the end of march it will be evident

  • @saiaditya4397
    @saiaditya4397 10 หลายเดือนก่อน

    Can we use this model on ESP 32?

  • @Silberschweifer
    @Silberschweifer 10 หลายเดือนก่อน

    oh, another step to local speaking AI Assistant like Cortana or Jarvis

  • @aldousd666
    @aldousd666 10 หลายเดือนก่อน

    This is a great tutorial and illustration of how to use services, but your bucket policy on Amazon needs to be locked to just your user so nobody can mess with your bucket and hijack it. It's one of the most common ways people get their data leaked.

  • @dawn_of_Artificial_Intellect
    @dawn_of_Artificial_Intellect 10 หลายเดือนก่อน

    Hi i am very interested in this development

  • @tiberiumihairezus417
    @tiberiumihairezus417 10 หลายเดือนก่อน

    What's the point of passing the interview when there is a probation period in which real tasks need to be accomplished. And if those tasks are still doable by LLMs, it is just a matter of time until that position will be completely automated.

  • @chivesltd
    @chivesltd 10 หลายเดือนก่อน

    lol cheesing interview

  • @abhijeetkumar1552
    @abhijeetkumar1552 10 หลายเดือนก่อน

    seeing this and thinking google audio recorder transcript and gemma

  • @jessedbrown1980
    @jessedbrown1980 10 หลายเดือนก่อน

    interested!

  • @harisonfekadu
    @harisonfekadu 10 หลายเดือนก่อน

    👏👏

  • @alvintohw
    @alvintohw 10 หลายเดือนก่อน

    Please publish as an Android app too!

  • @JohnSteiger-ey9bi
    @JohnSteiger-ey9bi 10 หลายเดือนก่อน

    I don’t do the ‘TH-cam’ other than to watch. I liked. Subscribed. And now I am kindly asking you how can I give you money?
    What you have here can help so many people. I eagerly await for this blessing to come to fruition.

    • @AIJasonZ
      @AIJasonZ  10 หลายเดือนก่อน

      hah thanks bro!

  • @RealLexable
    @RealLexable 10 หลายเดือนก่อน

    Horrorfying😮 Terminator has arrived i guess. Better to late than never.

  • @arixerchan3807
    @arixerchan3807 10 หลายเดือนก่อน

    this is what the deaf waiting for long time👍🏻

    • @AIJasonZ
      @AIJasonZ  10 หลายเดือนก่อน

      true!

  • @teensounds
    @teensounds 10 หลายเดือนก่อน

    what if interviewer ask to share the screen😅

    • @NatGreenOnline
      @NatGreenOnline 10 หลายเดือนก่อน

      If you get a teleprompter like the Elgato Prompter it acts as a 2nd monitor so the other person will never see it, plus you can be looking directly into the camera (while reading the info) at the same time so it looks super natural!

  • @Ho-Lee-Chit_Fu-Kin-Fast
    @Ho-Lee-Chit_Fu-Kin-Fast 10 หลายเดือนก่อน

    I will only do AI on my mobile if I can use it in Airplane mode.

    • @AIJasonZ
      @AIJasonZ  10 หลายเดือนก่อน

      this model load locally so yes it works in airplane mode!

  • @jaredlalal
    @jaredlalal 10 หลายเดือนก่อน

    Ok so what if i want to use this instead to argue with ppl and win every debate always forever. I gotta grind them TH-cam comment wins or something

  • @Generouslife153
    @Generouslife153 10 หลายเดือนก่อน

    I’ll find anyone who is serious about building a ai call software

  • @Mr.JOG-
    @Mr.JOG- 10 หลายเดือนก่อน

    just make sure you throw in a "right" every 6 to 9 words and your interviewer will never know your full of shit and not reading.

  • @bloomflora1105
    @bloomflora1105 10 หลายเดือนก่อน

    hahaha so funny

  • @cutthecheck
    @cutthecheck 8 หลายเดือนก่อน

    I'm high

  • @laif9857
    @laif9857 10 หลายเดือนก่อน +1

    30 sec. and i find so pathetic the use that some people give to the tools , faking an interview , if you suck at work pls , dont do an interview imgonna fired you after a month , why you are gonna lie for a month of pay , if you suck at one job maybe you can spend the improving your skills , but young people of this days really Suck so badly

  • @ComicMasta
    @ComicMasta 10 หลายเดือนก่อน

    Thanks!

    • @AIJasonZ
      @AIJasonZ  10 หลายเดือนก่อน +1

      thank you sir!