I turned Linux into Jarvis (welcome to cyberpunk)

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 ธ.ค. 2024

ความคิดเห็น • 215

  • @bugswriter_
    @bugswriter_  10 หลายเดือนก่อน +24

    Thanks Tobias Frisch / TheJackiMonster
    for letting me know about this tool.
    - thejackimonster.de/blog

    • @vaisakh_km
      @vaisakh_km 10 หลายเดือนก่อน

      for nixos, it's packaged as `piper-tts` and voices as `services.wyoming.piper.servers..voice`
      i am surprised that it's not in aur.. or is it?

  • @tedgoddamncruz5277
    @tedgoddamncruz5277 10 หลายเดือนก่อน +93

    I am not prepared for "keyboard is bloat"

  • @Hacker097
    @Hacker097 10 หลายเดือนก่อน +141

    Honey, its 4:30 am, time for your daily bugswriter content

    • @rd_626
      @rd_626 10 หลายเดือนก่อน +5

      He posting is american schedule

  • @FassihFayyaz
    @FassihFayyaz 10 หลายเดือนก่อน +108

    Now imagine using this with a local LLM and embeddings for unlimited memory its going to be huge.

    • @ckpioo
      @ckpioo 10 หลายเดือนก่อน +4

      thats what im doing, im currently in process of making a package similar to memgpt but with more agents + out of the box support for agents + some custom agents of my own and alot of customization made easy for developers

    • @aniksamiurrahman6365
      @aniksamiurrahman6365 10 หลายเดือนก่อน +3

      But how do you train a local LLM? How do you update?

    • @ckpioo
      @ckpioo 10 หลายเดือนก่อน +2

      @@aniksamiurrahman6365 you can't train a local llm, and why would u need to train it locally anyways?, updating is easy it's a drop and replace most the time

    • @FassihFayyaz
      @FassihFayyaz 10 หลายเดือนก่อน +2

      @@ckpioo good luck ❤

    • @aniksamiurrahman6365
      @aniksamiurrahman6365 10 หลายเดือนก่อน +1

      @@ckpioo So its just chatGPT/Bard/Llama under the hood.

  • @bruh-zy1dp
    @bruh-zy1dp 10 หลายเดือนก่อน +70

    10:12 Shots fired at mental outlaw

    • @anupamyedida5484
      @anupamyedida5484 10 หลายเดือนก่อน +16

      He just reads news lol🤣🤣🤣

    • @ARCISX
      @ARCISX 10 หลายเดือนก่อน +1

      lmaoooo

  • @LeoLegit
    @LeoLegit 10 หลายเดือนก่อน +16

    Jarvis, delete my system

    • @tanyawheng7943
      @tanyawheng7943 9 หลายเดือนก่อน +2

      J: thank you for ending my suffering

  • @sahilsharma2867
    @sahilsharma2867 10 หลายเดือนก่อน +30

    my heartbeat raises when seeing such great creativity possible in linux

    • @sporus3700
      @sporus3700 10 หลายเดือนก่อน +1

      That's huge bro 😮

  • @hackerman.1337
    @hackerman.1337 10 หลายเดือนก่อน +67

    Now use mistral or mixtral for a local LLM

    • @Tyrone-Ward
      @Tyrone-Ward 10 หลายเดือนก่อน +1

      Llama 2

    • @SairajKhope2809
      @SairajKhope2809 9 หลายเดือนก่อน +1

      Dolphin-2.5 is the best

  • @danlscan
    @danlscan 10 หลายเดือนก่อน +5

    This is among the coolest of things I've seen. Good job and your enthusiasm is a joy to witness.

  • @rogsiel
    @rogsiel 10 หลายเดือนก่อน +6

    Holy, I have thought about this for a long time, it's amazing to see someone actually do it. GJ mate

  • @DavidCastillaGil
    @DavidCastillaGil 10 หลายเดือนก่อน +13

    Man you are awesome! I have learned a lot of linux scripting from you, but mostly, you make me want to learn more every time, you make my child curiosity kick back in. I love you, nohomo

  • @n0kodoko143
    @n0kodoko143 10 หลายเดือนก่อน +18

    This is a really great showing of putting she creativity and several programs together! 🔥🔥🔥

  • @gaz7702
    @gaz7702 10 หลายเดือนก่อน +8

    just use "whisper tiny or base models" , they are much much accurate even with noise in surroundings.
    also use "porcupine" for wake word detection (very fast and accurate) , instead of typing command every time

  • @AkkarisFox
    @AkkarisFox 10 หลายเดือนก่อน +25

    The only reason why I will ever use Linux: an AI can jury rig all of the software I need for me because Linux is a hellscape for ADHD

  • @famaterial
    @famaterial 10 หลายเดือนก่อน +5

    Looking forward to what you come up with. This is a game changer

  • @linusgoblin
    @linusgoblin 9 หลายเดือนก่อน +2

    Thanks! I used your explanation to make myself a script binded to a hotkey, i implemented the visualizer cava to add a bit of charm to the voice. Pretty cool!

  • @theNullBlocks
    @theNullBlocks 10 หลายเดือนก่อน +10

    Last pieces will be having local LLM agent , and passing/choose files using voice .

  • @egoworks5611
    @egoworks5611 10 หลายเดือนก่อน +3

    ahhahahah automate mental outlaw

  • @rusty39939
    @rusty39939 10 หลายเดือนก่อน +23

    mental outlaw just read news 💀💀

  • @SupaShang
    @SupaShang 8 หลายเดือนก่อน +1

    Aw! I'm so going to play with this. Nice video.

  • @boltez6507
    @boltez6507 9 หลายเดือนก่อน +3

    Nice vid like everytime but i have a suggestion,please clearly mention the all the tools you used to get the thing working in the starting,it took me quite some time to figure out that you were using vosk for voice to text.

    • @3gsahil
      @3gsahil 9 หลายเดือนก่อน

      I'm stuck there too ! Could you elaborate me on the configuration??

  • @sortextheguy
    @sortextheguy 10 หลายเดือนก่อน +5

    You can integrate this with optional voice to text, so you can make a deamon and a cli to communicate with it, so you setup one keybind to voice to text, and another for a rofi prompt or something like that

    • @sortextheguy
      @sortextheguy 10 หลายเดือนก่อน +2

      The daemon maybe is too much but there can be 2 scripts calling these programs with different configurations for the same effect

  • @somedudeonyoutubefrfr
    @somedudeonyoutubefrfr 10 หลายเดือนก่อน +1

    This is great content! Definitely helpful for my further projects :D

  • @SamuTheFrog
    @SamuTheFrog 10 หลายเดือนก่อน

    @4:15 "If you are smart enough", sir, im not smart, but i definitely see where you're going

  • @mdforbes500
    @mdforbes500 9 หลายเดือนก่อน +2

    If you include a voice auth system that authorizes a given user for commands and generates a token, then you can have user privileges. its basically a user-detection system, and then your command libraries are more safe. i'd also, for now, use the "windows/macosx/penguin" command key be the "push-to-talk" key, and then you can move that to a peripheral. star trek comms badge, anyone?
    Brilliant work.

  • @U2VidWVz
    @U2VidWVz 10 หลายเดือนก่อน +1

    I like your attitude and enthusiasm, keep up the good work. :)

  • @GospodinStanoje
    @GospodinStanoje 10 หลายเดือนก่อน +1

    WOW DUDE, THAT IS INSANE! When I've heard your enthusiasm I was expecting to be disappointed, but daaaamn. Thank you for your videos, it's content dense and very helpful!
    Greetings from Serbia!

  • @crimiusXIII
    @crimiusXIII 10 หลายเดือนก่อน

    Absolutely fucking wonderful. I have so many use cases for this kind of library and script. There's so many little things you could abstract away into commands, tons of building blocks that could be made, not to mention the glory of a truly personalized AI companion. So cool.

  • @RazoBeckett.
    @RazoBeckett. 10 หลายเดือนก่อน +1

    bro this was awesome, i also have few ideas to work on. Thanks

  • @shiverello6109
    @shiverello6109 9 หลายเดือนก่อน +1

    Great idea to combine these tools and good tool recommendations in the first place. As others here suggested I might attempt this with Mixtral as well^^ I might need a better PC though

  • @BettersonMcgee
    @BettersonMcgee 9 หลายเดือนก่อน +1

    this is something I've been wanting to do for a while! I'm nowhere near as experienced but what I wanted to accomplish was to have a similar outcome, but I wanted the AI to be a locally hosted one. Bonus points for if it automatically saves data from my queries and can trains itself off those so it has a sort of memory. Not even sure where to start on that last part but your video has revitalized my spark to continue thinking about it!

  • @tkenben
    @tkenben 10 หลายเดือนก่อน +1

    Great tool I didn't know about. I'm thinking of building a sort of Jarvis for an elderly family member. She's getting to the point where any kind of automation would be a quality of life improvement, and to be honest, she's scared of computers - will, not *of* them, but of the interface. She knows how to use a smart phone but not a keyboard and mouse :) Thanks for pointing this out.

  • @Timely-ud4rm
    @Timely-ud4rm 10 หลายเดือนก่อน +1

    Me and you are the same. I love futuristic things and currently I am trying to customize hyprland to just be like that. this is amazing keep up the good work. also you move so fast, like you type and click so fast that I was surpised one second your typing something and 2 milli seconds later you're in the terminal and have going through 15 directory's.

  • @pekotofo2522
    @pekotofo2522 10 หลายเดือนก่อน +2

    OMG that's perfect! This is what I want my computers to do and it's so obvious, I don't understand why LLM's hasn't been utilized like this from the get go, but now I know how, thanks BugsWriter!

  • @laniusdev
    @laniusdev 10 หลายเดือนก่อน +1

    Ok, that is really a banger. It gives so many ideas for great quality of life things you can make with it. :D

    • @bugswriter_
      @bugswriter_  10 หลายเดือนก่อน +2

      Glad you like it!

    • @bugswriter_
      @bugswriter_  10 หลายเดือนก่อน +2

      You look good 👍

  • @voi__wood5508
    @voi__wood5508 10 หลายเดือนก่อน

    appreciate all the work you did, very cook. bro I really NEED this wallpaper, ricing my setup now and this wallpaper would fit nice

  • @KrazyKaiser
    @KrazyKaiser 9 หลายเดือนก่อน +1

    This is the kind of thing I want to mix with the home automation stuff I thinker in.

  • @khaledalshammari857
    @khaledalshammari857 10 หลายเดือนก่อน +1

    This is exactly what i was thinking about but never figured out how to do it because im a noob, thanks!

  • @VictoriaMan69
    @VictoriaMan69 10 หลายเดือนก่อน +1

    This is one of the coolest things I have ever seen. Please make some videos based on this with some ideas. I have made some wacky shit so far. :)

  • @Purplemid
    @Purplemid 10 หลายเดือนก่อน +1

    Thanks for the video man.

  • @gand0rfTRZ
    @gand0rfTRZ 10 หลายเดือนก่อน +1

    I love this type of stuff!! My wife is going to hate me this weekend. Also..
    At least Debian systems can just work after the install, and don't take half a day to finish setting up.
    Love the videos!! Can't wait to see what else you do with piper.

  • @realtimestatic
    @realtimestatic 10 หลายเดือนก่อน +3

    Linux user asks ChatGPT why people still use Windows, gets upset when AI tells him why people still use windows. Tbh I thought about this the moment LLMs started becoming really good. Getting a local LLM and integrating it deeply into the system to assist with all sorts of things would be amazing. I'm surprised that I haven't seen anyone try to package these tools together to make them easy to set up and use on Linux but I think the limitless potential is amazing. Also it'd be really cool to sync it to a phone and have basically your own jarvis like assistant always with you.

  • @Axenide
    @Axenide 10 หลายเดือนก่อน +2

    This is awesome. I made something like this using ollama and whisper, but I used a really old voice synthetizer. I will give Piper a try. :)

  • @HydraRosario
    @HydraRosario 8 หลายเดือนก่อน

    BRO you are a MENACE, keep it up, you're fire

    • @HydraRosario
      @HydraRosario 8 หลายเดือนก่อน

      Do you have any progress regarding this or a github repository?

  • @VidalIC
    @VidalIC 10 หลายเดือนก่อน

    The "so not good, she was just bullshiting" got me hahaha you are great.

  • @NotoriousArnav
    @NotoriousArnav 10 หลายเดือนก่อน +1

    Let's make "A.D.I.T.Y.A" a reality then

  • @v1d300
    @v1d300 10 หลายเดือนก่อน +2

    Bugs, I am here for the ❤. Thank you and have a great day!

  • @Kwadster
    @Kwadster 10 หลายเดือนก่อน

    the keyboard sounds are perfect

  • @PSELMASTER
    @PSELMASTER 10 หลายเดือนก่อน +2

    Cool video, as always
    The possibilities are endless! The only drawbacks I'm seeing is having to go through, and be dependent on OpenAi, for all the data has to pass through it (delay and privacy) and the fact that we're still limited by the model's barriers and character limit.
    It would be great if you we really could create are own ML/AGI models tailored to us/our needs, that would evolve, learn, but locally, maintaining our privacy.

    • @trevorroddy3773
      @trevorroddy3773 10 หลายเดือนก่อน +2

      it's very possible! FastAI is the easiest to pickup python library for deploying local models that I know about. It's not gonna be as generally useful as GPT though. It's a pipe dream for me too, but seems to rapidly be getting more approachable

  • @kobeneilson6717
    @kobeneilson6717 10 หลายเดือนก่อน +1

    That’s super cool, bro send commands so quick

  • @paradase1
    @paradase1 10 หลายเดือนก่อน +2

    Quite interesting. The only problem, I believe, is related to the delay that occurs. There is a method that uses the graphics card to change the voice, like w-okada/voice-changer. I don't know if you've heard of it, but one thing that could be added is something like a real-time translator.

  • @bearnaff9387
    @bearnaff9387 10 หลายเดือนก่อน

    Very nicely done. Marrying on-the-fly content generation with speech recognition and synthesis could lead to some very fascinating user interface designs. I feel like we're going to quietly see bes t practices for "intelligent audio" slowly emerge and no one is going to realize that there's a new "normal" in the world until it's done.

  • @sunberry9039
    @sunberry9039 10 หลายเดือนก่อน +2

    Disable capslock function and macro your inevitable creation to the same key.

  • @rudrapednekar1644
    @rudrapednekar1644 10 หลายเดือนก่อน +1

    Amazing Project!!

  • @rinzle3r
    @rinzle3r 10 หลายเดือนก่อน +1

    Great video and we are very closer to an megaman network future

  • @lyszt
    @lyszt 9 หลายเดือนก่อน +1

    The fact you can run it locally makes it better than ElevenLabs

  • @zoyW3301
    @zoyW3301 9 หลายเดือนก่อน +1

    Great video!

  • @ALulzyApprentice
    @ALulzyApprentice 10 หลายเดือนก่อน +1

    This is amazing! Thanks!

  • @stevenlaczko8688
    @stevenlaczko8688 10 หลายเดือนก่อน +1

    Use SGPT (cli tool) and use the -s flag to get a shell command output. So you can ask it to do something and actually get a command to run.

    • @bugswriter_
      @bugswriter_  10 หลายเดือนก่อน +1

      I know. I had this idea for very long.

  • @PriyanshuAman-dn5jx
    @PriyanshuAman-dn5jx 8 หลายเดือนก่อน +1

    Hey i think the problem is llm’s you need an llm that can break down what you really wanna do

  • @sauraabh
    @sauraabh 10 หลายเดือนก่อน +1

    Dude, you are my inspiration for becoming tinkerer..❤
    And you deserve much more subs and likes!

  • @Sandrodeveloper
    @Sandrodeveloper 4 หลายเดือนก่อน +2

    What vosk model do you use?

  • @Tanvir1337
    @Tanvir1337 10 หลายเดือนก่อน +1

    i was expecting either linus's or stallman's voice

  • @sebastianrussian8415
    @sebastianrussian8415 7 หลายเดือนก่อน +1

    i liked your environment. Are you using zsh or something else? Can you explain how to make it look like that?

    • @bugswriter_
      @bugswriter_  7 หลายเดือนก่อน +1

      I have a video on zsh

  • @logicalkarma3314
    @logicalkarma3314 10 หลายเดือนก่อน +2

    .NET already has a speech synthesizer class, although Piper has better audio.

  • @csori1075
    @csori1075 10 หลายเดือนก่อน +1

    BRO THIS IS SICK

  • @johannes7856
    @johannes7856 10 หลายเดือนก่อน +1

    Bro ist faster than light when typing xD

  • @kjcao_
    @kjcao_ 10 หลายเดือนก่อน +1

    This is very cool!

  • @rajmajumdar5253
    @rajmajumdar5253 10 หลายเดือนก่อน +2

    Pretty cool, but where can I find the `voice_to_text` ?

    • @Oszku
      @Oszku 10 หลายเดือนก่อน +1

      same question

  • @thiagolopes4978
    @thiagolopes4978 10 หลายเดือนก่อน +1

    Loved it!

  • @crusader_
    @crusader_ 10 หลายเดือนก่อน +1

    I wish there were more voice models, precisely more popular voices like friday or jarvis

  • @soumyadrip
    @soumyadrip 10 หลายเดือนก่อน +3

    what keyboard are you using?

  • @bonquaviusdingle5720
    @bonquaviusdingle5720 10 หลายเดือนก่อน +3

    I really want a Japanese voice assistant who tries to speak English.

  • @chillout7984
    @chillout7984 10 หลายเดือนก่อน +1

    Silero tts also pretty good and you can run it locally

  • @badalyadav3822
    @badalyadav3822 10 หลายเดือนก่อน +1

    Great content as usual.

  • @varunrmallya5369
    @varunrmallya5369 10 หลายเดือนก่อน +1

    THATS FUCKING IMPRESSIVE

  • @vilijanac
    @vilijanac 10 หลายเดือนก่อน +1

    Wow, this is super cool.

  • @bowiemtl
    @bowiemtl 10 หลายเดือนก่อน +2

    Pretty cool! I like building tools like this too but the chances I'd actually use this are slim.

    • @xDeFc0nx
      @xDeFc0nx 10 หลายเดือนก่อน +1

      This is a game changer for people who can't see, are color blind, or the rest. Many uses.

    • @aaronspeedy7780
      @aaronspeedy7780 10 หลายเดือนก่อน

      One use case that I could think of would be in learning a new language.

    • @bowiemtl
      @bowiemtl 10 หลายเดือนก่อน

      @@xDeFc0nx I'm not discrediting the use of this. There have been attempts at accessibility in the form of screen readers, different controllers, etc. and I definitely can see this being integrated into more operating systems in the near future. Heard some buzz about samsung integrating an ai feature into android for example, it's just that personally I'm well versed with a keyboard and see myself being more efficient with it in most day to day applications (for now that is)

    • @xDeFc0nx
      @xDeFc0nx 10 หลายเดือนก่อน

      @@bowiemtl I'm the same, I just use this for cool stuff

  • @blackedmirror5073
    @blackedmirror5073 10 หลายเดือนก่อน +1

    Is the thumbnail art sourced from somewhere? I like it a lot

  • @agent_artifical
    @agent_artifical 9 หลายเดือนก่อน +1

    very nice one!

  • @committedcoder3352
    @committedcoder3352 10 หลายเดือนก่อน +1

    I wonder if there’s a way to give it access to what you’re seeing so if you’re programming in vim you could ask project questions or if there’s a complication or runtime error ask what it thinks of it. That would be cool

  • @fgjsgffdjsdjgfsd3112
    @fgjsgffdjsdjgfsd3112 10 หลายเดือนก่อน +1

    I think espeak could work well if you could install a better voice. I just don't know where to find them.

  • @JavierHarford
    @JavierHarford 10 หลายเดือนก่อน +1

    I can see this being a really powerful plugin to Nushell, which can handle data output in a much more tabular format, also, it would be powerful with agents embedded with certain tools. Do you have any open source intentions for this kind of work flow?

  • @AnshumanSingh-cp8fv
    @AnshumanSingh-cp8fv 10 หลายเดือนก่อน

    waiting for more cool stuff you do

  • @ItsKingMyles
    @ItsKingMyles 10 หลายเดือนก่อน +2

    ive been looking for ways to train custom voice models like this.

  • @QuestionTheTruth
    @QuestionTheTruth 10 หลายเดือนก่อน +1

    Do you have a proper installation guide? You should make a bash script and have that be opened and started with a macro or key command.

  • @devops-sushi5534
    @devops-sushi5534 10 หลายเดือนก่อน

    Nice Vid! May i ask your background source?

  • @aquadap219
    @aquadap219 9 หลายเดือนก่อน +1

    i love how happy you are lol

  • @thenarrowgate3063
    @thenarrowgate3063 9 หลายเดือนก่อน +1

    Which VTT are you calling with $(voice_to_text)?

    • @thenarrowgate3063
      @thenarrowgate3063 9 หลายเดือนก่อน

      Dude i appreciate the love, but how about a response 👍

  • @pontifex_max
    @pontifex_max 10 หลายเดือนก่อน +1

    What is your terminal font it's super pretty

    • @bugswriter_
      @bugswriter_  10 หลายเดือนก่อน +1

      jetbrains mono

  • @ckpioo
    @ckpioo 10 หลายเดือนก่อน

    what im planning on doing is having multiple spearkers (with mic) throughout my house which i can connect to with bluetooth, i can have a small rasberrypi connected to them via bluetooth, and rest you already guessed it probably 💀

  • @filipepinho3319
    @filipepinho3319 10 หลายเดือนก่อน +1

    What I was always looking for :P turn my computer in my personal voice assistance...

  • @nekoeko500
    @nekoeko500 9 หลายเดือนก่อน +1

    So tgpt runs in someone's else computer and piper can run on a pi, so... could this setup be replicated on a smartphone, using termux?

  • @ernststravoblofeld
    @ernststravoblofeld 10 หลายเดือนก่อน +1

    The biggest problem i see is, ChatGPT costs over 30 cents per query. OpenAI is burning through enormous amounts of money, and they can't do it forever. They could go pay-as-you-go at any time, which takes it out of hobby territory. Maybe you could switch to something like Llama on a local server?

  • @monkeybum1984
    @monkeybum1984 10 หลายเดือนก่อน

    It's so impressive how fast you can navigate around your desktop and termial. What DE is that?

  • @nnisarggada
    @nnisarggada 10 หลายเดือนก่อน +1

    current wm and configs please?

  • @gaurav_jk
    @gaurav_jk 10 หลายเดือนก่อน +1

    now i realize that , i was running linux as windows😅

  • @wijiler5834
    @wijiler5834 10 หลายเดือนก่อน +1

    Jarvis, find this guys house

  • @factswithtruth6950
    @factswithtruth6950 10 หลายเดือนก่อน +1

    Hello sir, I am your big fan... Can you tell me where did you learn this linux knowledge and other stuff... In college or self... Please answer

    • @bugswriter_
      @bugswriter_  10 หลายเดือนก่อน +1

      laptop + internet

  • @bimmaku732
    @bimmaku732 10 หลายเดือนก่อน +1

    I actually made the same thing and made it as executable and just that i used gemini instead of chat gpt, Name it chat cli, and offer used to use any LLM they want, using local language models will cost performance so your wish there