WhisperScript, an Electron desktop app GUI for Whisper #1028
Replies: 24 comments 42 replies
-
@jonathgh Are you going to make it cross-platform for Linux + Windows? |
Beta Was this translation helpful? Give feedback.
-
👉 UPDATE: WhisperScript now added support for MKV, MP4 and MOV Video Import! |
Beta Was this translation helpful? Give feedback.
-
Wow this is great! Subbed the topic, and waiting for a windows client :) |
Beta Was this translation helpful? Give feedback.
-
It would be great if you could add speaker recognition! i would get it right away! |
Beta Was this translation helpful? Give feedback.
-
Why is the paid feature for better whisper models? It should be paid for features you've written yourself. |
Beta Was this translation helpful? Give feedback.
-
We're excited to announce WhisperScript v1.2.1, an update to our Electron desktop Whisper implementation that introduces a lot of new features to speed up your transcription workflow. This update adds a bunch of improvements to the visualization, playback, editing, and exporting of your transcripts. Here's what's new in v1.2.1:
We hope that these new features will speed up your workflow and make it easier to edit and navigate your transcripts. We’re actively developing more, and we're excited to see how you'll utilize these features in your projects! Download the latest version here. The features above are only in the Pro version: https://getwavery.com We'd love to hear your thoughts and suggestions for future updates. You can reach us at [email protected]. Happy transcribing! |
Beta Was this translation helpful? Give feedback.
-
Might have to try it. BTW, I started playing around with Whisper in Docker on an Intel Mac, M1 Mac and maybe eventually a Dell R710 server (24 cores, but no GPU). Not sure you can help, but wondering about mutli-CPU and/or GPU support in Whisper with that hardware. It sounds like it might be partially possible, but NVIDIA GPU's are the only ones that are supported much. I want to integrate the thing into a medical IT application stack that I have, just using the Whisper API in the local build. I have an OpenAI API Key for testing also. |
Beta Was this translation helpful? Give feedback.
-
Totally agree. Find ways to speed up the larger models. That’s worth paying
for. Hey people hooked on the larger models, even though slow, and also
offer the smaller and faster models. Your service and application are your
selling points, not the language models.
On Sun, Mar 26, 2023 at 1:13 PM Oindril Dutta ***@***.***> wrote:
That's great, and I really appreciate those features.
But the larger models are still behind payment gates, it'd be better to
strip down the free experience but offer all models in the free version.
A free trial of a few days or # of transcriptions with the pro features
would also help you make sales. Along with an auto updater with constant
feature drip for pro users.
Pro feature ideas:
- try to find ways to improve the performance of the models, and sell
that in the free version - don't artificially slow down performance
- when dragging in a video, also show the video as you scrub through
the transcript and audio waveform
- after adding video add speaker diarization to try to automatically
label voices and faces if any in the UI
- after adding diarization make it easy to export all transcriptions
as time and person labelled subtitles to add them back to sites like
YouTube or movies or anything.
Point is, there're a bunch of features you can work toward putting
together to make this an increasingly valuable and enticing paid product -
I just don't think you should payment gate the larger models since it's
open source and not your work.
—
Reply to this email directly, view it on GitHub
<#1028 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGW5A736VDBX25WQ4VFS43W6CILZANCNFSM6AAAAAAVPS7CDI>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
--
Jeffrey Duncan
|
Beta Was this translation helpful? Give feedback.
-
Hi! Just update to PRO version because I think there's value added for people like me without coding experience.. Just one feature I'm missing and that may be easy to implement (speaking with zero idea so enlighten me if otherwise) is to be able to decide where the models are downloaded. By default a new folder is created in Documents folder (I'm on Mac) so it's annoying to have a high level folder in there just for that. Thanks for your efforts with the app!! |
Beta Was this translation helpful? Give feedback.
-
Can this be updated to use https://github.com/guillaumekln/faster-whisper |
Beta Was this translation helpful? Give feedback.
-
I saw WAY faster speeds but also more hallucinations. I'm ok with slower,
if it means more accurate.
Jeffrey Duncan
…On Mon, Apr 24, 2023 at 12:07 PM becausereasons ***@***.***> wrote:
Up to 70x faster.
—
Reply to this email directly, view it on GitHub
<#1028 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAGW5A5I2MGI4AV5RHMZPC3XC253BANCNFSM6AAAAAAVPS7CDI>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Is model large-v2 implemented or going to be implemented? Here's what that page says about it: "The large-v2 model on average shows about 5% relative error reduction in English and about 10% in other languages, but please note that it may behave differently depending on the individual audio and in some cases perform worse than large-v1." Here it is on GitHub it's also on Hugging Face |
Beta Was this translation helpful? Give feedback.
-
A few questions:
|
Beta Was this translation helpful? Give feedback.
-
@Veneration1 If you can get Subtitle Edit to work on a Mac, with Wine or something, it has a file where you can add automatic corrections: |
Beta Was this translation helpful? Give feedback.
-
Looks interesting . Glad to see a project got updated continually 👍🏻 May I ask what's the main differences between this and MacWhisper if you don't mind? 😁 BTW when it's working on Windows, I do hope it could supports GPU for speed-up processing. Thank you. |
Beta Was this translation helpful? Give feedback.
-
@shruru As mentioned previously in this thread, for a Windows implementation that utilizes the GPU, you might want to look into this: https://github.com/Const-me/Whisper/releases/ |
Beta Was this translation helpful? Give feedback.
-
Another Windows app is whispercppGUI https://github.com/Topping1/whispercppGUI |
Beta Was this translation helpful? Give feedback.
-
EDIT: We have now released a new version of WhisperScript, with several improvements, including a video player, batch processing and improved performance overall. We'd love to hear your thoughts on the new design! Join the discord to hear about our latest developments. Exciting news! We're back with an impactful update to improve your transcription experience, making it more efficient for language learners, subtitle creators, interview analysts, or those combing through their media libraries. We're introducing WhisperScript v1.3.4, and here's what's new: New in WhisperScript v1.3.4:
These features are available to Pro Users, but Lite Users will also get several UI improvements and bug fixes:
We’re still actively developing additional features, and we’d love to invite you to join the Discord to be a part of the process, suggest features and report bugs: https://discord.gg/b9TYCgC6 Download the latest version here: https://getwavery.com We'd love to hear your thoughts and suggestions for future updates. You can reach us at [email protected]. Happy transcribing! |
Beta Was this translation helpful? Give feedback.
-
Greate job! |
Beta Was this translation helpful? Give feedback.
-
GJ,can it be used for ytb or twitch live streaming? |
Beta Was this translation helpful? Give feedback.
-
I made a open-source alternative with basic features here. Feel free to fork, contribute, or do whatever you need! |
Beta Was this translation helpful? Give feedback.
-
Hi there, does it only transcribe, or it can do "translate" as well? |
Beta Was this translation helpful? Give feedback.
-
When windows? T.T |
Beta Was this translation helpful? Give feedback.
-
whisperscript-05-search-replace.movExciting update! WhisperScript v2.0 is here, re-written from the ground up with React and Typescript, with a whole new architecture and improved features. Check out what’s new in this release! Wavery Accounts:
What This Means for You:
Thank you for being part of the WhisperScript community! We’re thrilled to continue evolving the app with these improvements and more to come. For any questions or assistance, reach out at [email protected]. We’re here to help! WhisperScript v2.0 is designed to optimize your transcription workflow, whether you're working on interviews, media analysis, or multilingual transcription. As always, we appreciate your feedback and suggestions to help us keep improving! Download the latest version here: https://getwavery.com Have ideas or feedback? Reach out to us at [email protected] or join our community on Discord to connect with other WhisperScript users: https://discord.gg/b9TYCgC6. Happy transcribing! |
Beta Was this translation helpful? Give feedback.
-
Thanks to the work of @ggerganov and with inspiration from @jordibruin, @kai-shimada and I were able to implement Whisper in a desktop app built with the Electron framework. The app runs on Mac at the moment, but we hope that Electron will also allow for cross-platform compatibility in the future. You can download it here: Whisperscript
Currently, our features include:
We hope you enjoy it and let us know if there are any other features you would like to see. You can reach us at [email protected]
Beta Was this translation helpful? Give feedback.
All reactions