-
-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
automatic voice recognition/text ocr/attachment to text preview with sist2 #4291
Comments
Hey @finch71 Nice feature to work on. Let me know if you need some help |
I can set up sist2. it works nicely in its own GUI (can also be started with cli) and i can see filepath in sqlite but i don't know how to program in golang. Ideally one can get all these information from this external sqlite database or query sist2 GUI via HTTP requests (may require authentication) |
So the sist2 is considered as a attachment content indexer, isn't it? And whisper is in the loop just because sist2 can't handle audio transcription. |
yes, but there is not a standardized api for "any attachment to text" like an openai api however, many program rely on these and has to bundle libraries with huge model files into the program, for example, sist2 and memos |
But memos is positioning itself as a self-hosted solution so I don’t think it should embed LLM that’s why I’m saying about plugin |
Describe the solution you'd like
automatic voice recognition/text ocr/attachment to text preview
When user uploads a audio file automatically transcript the file with whisper
Type of feature
User Experience (UX)
Additional context
use https://github.com/sist2app/sist2 as an attachment analyzer, and read its index sqlite file
The text was updated successfully, but these errors were encountered: