Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transcribe audio using huggingface #1718

Conversation

neuralsignal
Copy link
Contributor

Background
Addition of a audio transcriber command that uses the huggingface inference API to call an audio to text model.

Changes
Added a new python file called audio_text.py that contains the read_audio function, which call the huggingface API.
Made changes to prompt.py, the env template, config.py, and app.py to include the read_audio function and make it functional.

Documentation
The changes are implemented as just in code comments.

Test Plan
I tested the changes by added various audio samples to the working directory. Then I asked the agent to transcribe the audio samples and compare the transcribed stories to each other.

PR Quality Checklist

  • My pull request is atomic and focuses on a single change.
  • I have thoroughly tested my changes with multiple different prompts.
  • I have considered potential risks and mitigations for my changes.
  • I have documented my changes clearly and comprehensively.
  • I have not snuck in any "extra" small tweaks changes

autogpt/prompt.py Outdated Show resolved Hide resolved
@BillSchumacher BillSchumacher merged commit 4870356 into Significant-Gravitas:master Apr 16, 2023
sindlinger pushed a commit to Orgsindlinger/Auto-GPT-WebUI that referenced this pull request Sep 25, 2024
…audio_huggingface

Transcribe audio using huggingface
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants