Whishper: a complete transcription suite.

pluja@lemmy.world · 1 year ago

Whishper: a complete transcription suite.

UberMentch@lemmy.world · edit-2 1 year ago

Would love to deploy this, but unfortunately I’m running server equipment that apparently doesn’t support MongoDB 5 (Error message MongoDB 5.0+ requires a CPU with AVX support, and your current system does not appear to have that!). Tried deploying with both 4.4.18 and 4.4.6 and can’t get it to work. If anybody has some recommendations, I’d appreciate hearing them!

Edit: Changed my proxmox environment processor to host, fixed my issue.

pluja@lemmy.world · 1 year ago

I’m glad you were able to solve the problem, I add the comment I made to another user with the same problem:

Didn’t know about this problem. I’ll try to add a MariaDB alternative database option soon.

Obinice@lemmy.world · 1 year ago

I’ve been looking for a tool to do this for YEARS, my god! Years!!! ❤️❤️

Axiochus@lemmy.world · 1 year ago

Oh, awesome! Does it do speaker detection? That’s been one of my main gripes with Whisper.

pluja@lemmy.world · edit-2 1 year ago

Unfortunately, not yet. Whisper per se is not able to do that. Currently, there are few viable solutions for integration, and I’m looking at this one, but all current solutions I know about need GPU for this.

tvcvt@lemmy.ml · 1 year ago

This is excellent timing for me. I was just taking a break from working on setting up whisper.cpp with a web front end to transcribe interviews. This is a much nicer package than I ever had a chance of pulling together. Nice work!

Konraddo@lemmy.world · 1 year ago

Just tried this out but couldn’t get it to work until downgrading mongo to 4.4.6 because my NAS doesn’t ha``ve AVX support. But then, mongo stays unhealthy. No idea why.

pluja@lemmy.world · 1 year ago

Didn’t know about this problem. I’ll try to add a MariaDB alternative database option soon to solve this.

crazygoat@lemmy.world · 11 months ago

Even this is an good sound to text converter and a good ai transcription service

Whishper: a complete transcription suite.

Whishper: a complete transcription suite.

GitHub - pluja/whishper: Transcribe any audio to text with an easy UI. Powered by OpenAI's Whisper, LibreTranslate, Sveltekit and Golang.