-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
What model does this repo use? #1
Comments
We are using |
Okay, I'm quite curious about this. Then how do you use the generated output to allign back to the original video timestamps and then synthesize the result video? |
Hi @wong251440 To extract transcription from video we use Whisper AI model and its one of the most advanced open source model for the task(SpeechToText).Its also provides us the exact timestamps of the utterance's from video. We use python libraries moviepy and opencv for video editing using the processed transcript by gpt-4-turbo(llm).
|
What model does this repo use to condense the original video transcript, and then create a short script??
The text was updated successfully, but these errors were encountered: