Splitting up long chunks of automatically transcribed text

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 ต.ค. 2024

ความคิดเห็น • 12

  • @rygy409
    @rygy409 4 หลายเดือนก่อน +11

    A QoL feature might be highlighting the words on the right panel if they are included in the current clips on the timeline.

  • @distortions
    @distortions 4 หลายเดือนก่อน +4

    whats up whats up whats up

  • @CrossbowBeta
    @CrossbowBeta 4 หลายเดือนก่อน +1

    Next step: Editing the text and the video gets updated.

  • @gastonmaffei
    @gastonmaffei 4 หลายเดือนก่อน +1

    one thing I'd add is an LLM powered search bar to search for nuggets in natural language. So something like "show me where I talk about X", and it jumps to that timestamp

  • @RoryDavidWatts
    @RoryDavidWatts 4 หลายเดือนก่อน

    This is looking great

  • @flobuilds
    @flobuilds 4 หลายเดือนก่อน

    I miss the time where we played trackmania on the school computers. It was fun

  • @CrossbowBeta
    @CrossbowBeta 4 หลายเดือนก่อน

    So much fun statistics can be collected with this too. How many words spoken during a stream. Word occurrence histogram. Average word complexity. Then group the words together to identify a set of tags for the video. This is so cool.

  • @Dje4321
    @Dje4321 4 หลายเดือนก่อน

    Should highlight the current position in the text window to help with debugging to you know exactly what words are at which specific parts in the timelines

  • @jomojomo123
    @jomojomo123 4 หลายเดือนก่อน

    "is it a bug or is it just a little bit of a surprise" 😂😂

  • @bart2019
    @bart2019 4 หลายเดือนก่อน

    In your place, i'd try to use a text editing widget for the script text and use it in read-only mode. That may detecting which word a user clicked on easier because it's likely built in.
    Also, you need a way to search in the text, again, just like in a text editor.

  • @gvteja4908
    @gvteja4908 4 หลายเดือนก่อน

    Along with the text highlighting which others have mentioned, maybe change the audio based on text edits in the transcript, although this might be a very intense thing to do

  • @deprecated_channel_
    @deprecated_channel_ 4 หลายเดือนก่อน +1

    Hi,
    I really like your project and seeing you go through the dev process.
    I think it would be easier to track if the current spoken word would be in bold, or a different color. A bit like karaoke.
    And from what I understand, Whisper already gives you the ranges of each word.
    While playing the video, iterating through the list would be fast.
    And on seek, since the word & time range list is in a sorted order, doing a binary search should be pretty fast.