Whisper iOS transcription app #443
Replies: 25 comments 58 replies
-
Awesome! |
Beta Was this translation helpful? Give feedback.
-
@bjnortier is it possible to implement the iOS app based on whisper.tflite(40MB model weights are in int8) Refer this to run inference in python Refer README to run inference on C++ to run tflite files on iOS refer the below link |
Beta Was this translation helpful? Give feedback.
-
Hi could you also add the options to tweak the transcribe settings for example temperature, initial_prompt ect... |
Beta Was this translation helpful? Give feedback.
-
This is pretty awesome. Thanks for making this!. I assume you're using the English-only tiny.en model? Just wondering for us multi-linguals. The models perform quite well in some other languages too. |
Beta Was this translation helpful? Give feedback.
-
Awesome project! Was looking for something exactly like this. I think multi-language support, as well as at least the medium model would be necessary to make this really useful. Also, if you'd add in-app audio recording and live-transcribtion, it would easily be the best transcribing app in the App Store. |
Beta Was this translation helpful? Give feedback.
-
congrats to this amazing project, @bjnortier! this is so much better than current mobile transcription apps! are you planning to implement multi-language (f.e. german) and in-app features? would love to support you with that - unfortunately, not my area. |
Beta Was this translation helpful? Give feedback.
-
@bjnortier this is really cool! Which language did you use? Objective C or Swift? Also was there any preprocessing you did with the raw audio data collected by the app prior to feeding it into the model? |
Beta Was this translation helpful? Give feedback.
-
Love this, but mine seems to be crashing whenever I tap on the Transcribe button. I must be doing something wrong..?
|
Beta Was this translation helpful? Give feedback.
-
This needs apple watch integration! It would be awesome if I could just talk into my watch from time to time and get a transcription of it on my phone later. |
Beta Was this translation helpful? Give feedback.
-
Hey, I love what you're doing here. I'm kind of new to coding I'm not too sure how this works. But I'm curious if this app takes advantage of Apple silicon? I know this must be a lot to do, but a really cool suggestion: able to attach the audio files with the transcription and to be able to scrub through the audio file using the transcription. Also, a search bar so you can search your personal library of transcriptions. Example: say I'm looking for the word "Jeremy," and so I search it up, and it will highlight all the moments in the audio file where it said Jeremy, and you can click it, and it will bring you exactly where it was said on the audio file (you can even have a broad search for all files!!!) These are only suggestions:)) |
Beta Was this translation helpful? Give feedback.
-
Any chance you could update the platforms so this could be run on an Apple silicon Mac? I run the original python via pipx but I know a few people who'd be delighted to fiddle with it. |
Beta Was this translation helpful? Give feedback.
-
Hi, I have a question We need to recognize a specific phrase. And make recognized words bold. Can we use whisper? |
Beta Was this translation helpful? Give feedback.
-
This is a great app. Just curious. Is there a way to make it differentiate between speakers? |
Beta Was this translation helpful? Give feedback.
-
Differentiation in this case would refer to speaker diarization. i.e telling which speaker is speaking what and like splitting the transcription
…On 26 Jan 2023, 12:19 PM +0400, Ben Nortier ***@***.***>, wrote:
Thanks! The answer is no currently but could be added, just not sure what you mean by differentiate?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Thank you for this very practical application! Several suggestions:
|
Beta Was this translation helpful? Give feedback.
-
Application is crashed after upgrading to PRO(By all Pro features) and turning on CoreML.
After reinstalling the app, I tried to "Restore Purchases", but nothing happened, only after uploading and restarting the app, download buttons appears. Steps to reproduce:
My proposal is, to block Turn on "Use CoreML" if iPhone model < 13 |
Beta Was this translation helpful? Give feedback.
-
Loving on-device quality transcription, thanks for making this! I now tried to automate a few things, what I was, hoping was to be able to seamlessly integrate whisper into a shortcut, and then use its output text in the next step. but the shortcuts don’t seem to work on iOS 16.5 “Invalid action metadata |
Beta Was this translation helpful? Give feedback.
-
By the way, kudos for getting CoreML to work. I run this on an iPhone 13 pro and it's orders of magnitude faster than just on CPU. How is this even possible? |
Beta Was this translation helpful? Give feedback.
-
can you add the option to translate to other languages than english? |
Beta Was this translation helpful? Give feedback.
-
Just installed Hello Transcribe for iPhone X. Seems not to work, can't get any results: I only see a blue bar repeatedly going from left to right. Perhaps this app isn't compatible with iPhone X? If so, it might be possible to disable download for this app for unsupported devices in apples App Store. |
Beta Was this translation helpful? Give feedback.
-
Feature request for the Shortcuts integration: Would it be possible for the multilinguals among us to make a language selection when calling the Transcribe action from Shortcuts? The reason is the higher CoreML speed and efficiency with a language pre-selected. |
Beta Was this translation helpful? Give feedback.
-
What is http://hello-transcribe.b-cdn.net/ ? The app beaconed to that url. |
Beta Was this translation helpful? Give feedback.
-
I'm looking app or shortcut for transcription multilingual OpenAi whisper v3 support for iPhone 14 Pro Max, anyone know about this ? |
Beta Was this translation helpful? Give feedback.
-
Will iOS 15 support ever be added? |
Beta Was this translation helpful? Give feedback.
-
This is based on the whisper.cpp implementation by @ggerganov. It uses the tiny model and all processing is done on-device.
It's in the App Store:
https://apps.apple.com/za/app/hello-transcribe/id6443919768
Beta Was this translation helpful? Give feedback.
All reactions