Audio and Video Transcription

Hi all,

I store many audio and video files with useful information in my Nextcloud. I occasionally manually create transcriptions of them using OpenAIā€™s Whisper model running on my laptop. It would be awesome if I could create a workflow with Nextcloud where audio and video files uploaded to a particular folder could trigger a transcription service to run (ex. OpenAIā€™s Whisper with parameters set by the user). It would also be awesome if the text could be accessed in the search function. While this is a bit of a niche use case, itā€™d help me out quite a bit :slight_smile:

What do you think of this idea?

Thanks,
Martin