Best Way to Transcribe Audio and Video with Python and Whisper-MLX ASR

แชร์
ฝัง
  • เผยแพร่เมื่อ 29 พ.ย. 2024

ความคิดเห็น • 9

  • @flyingzeppo
    @flyingzeppo 10 หลายเดือนก่อน +3

    I don't use Apple products, but I've used Whisper along with Python in the past. It's a great API. Also, I see that you've killed and beheaded the Abominable Snowman. Congrats. The villagers are safe now. 😊

    • @python-programming
      @python-programming  10 หลายเดือนก่อน +3

      HAHAHAHA! I was fortunate to meet a trader of hand-made Romanian masks in the Carpathian mountains and purchased it a few months back. It's now one of my favorite things from my travels. I'm thinking about making a non-Apple version of this video in the near future.

  • @alir8zana635
    @alir8zana635 9 หลายเดือนก่อน

    thank you for your videos
    you are really a great teacher
    your channel is a gem

  • @critical-chris
    @critical-chris 5 หลายเดือนก่อน

    Interesting! I have been working with whisperX and whisper-timestamped on my MacBook so far and wasn't aware of MLX. Thanks for sharing! But since you emphasize the word level timestamps: with standard whisper those are known to be very inaccurate (i.e. pretty much unusable - whisper is simply not trained to predict timestamps). So, are you suggesting that timestamps in whisper-mlx are better?

  • @ersineser7610
    @ersineser7610 10 หลายเดือนก่อน +1

    Thanks for sharing.

  • @mrtn5882
    @mrtn5882 9 หลายเดือนก่อน

    Do you know which languages it speaks?

  • @andreamaral3537
    @andreamaral3537 9 หลายเดือนก่อน +1

    What about speaker diarization with mlx?

    • @python-programming
      @python-programming  9 หลายเดือนก่อน

      Not yet, but I suspect that wil be a added soon. MLX is covering a wide ground right now, but over the next few months I expect that they will go deeper with that wide net both through their own work and community pull requests