Transcribe audio using huggingface #1718

neuralsignal · 2023-04-15T21:29:14Z

Background
Addition of a audio transcriber command that uses the huggingface inference API to call an audio to text model.

Changes
Added a new python file called audio_text.py that contains the read_audio function, which call the huggingface API.
Made changes to prompt.py, the env template, config.py, and app.py to include the read_audio function and make it functional.

Documentation
The changes are implemented as just in code comments.

Test Plan
I tested the changes by added various audio samples to the working directory. Then I asked the agent to transcribe the audio samples and compare the transcribed stories to each other.

PR Quality Checklist

My pull request is atomic and focuses on a single change.
I have thoroughly tested my changes with multiple different prompts.
I have considered potential risks and mitigations for my changes.
I have documented my changes clearly and comprehensively.
I have not snuck in any "extra" small tweaks changes

…ingface

autogpt/prompt.py

…ky92/Auto-GPT into transcribe_audio_huggingface

…ingface

…-GPT into pr/1718

…audio_huggingface Transcribe audio using huggingface

neuralsignal added 3 commits April 15, 2023 23:19

Transcribing audio

9696fc6

Merge branch 'master' of https://github.com/gucky92/Auto-GPT

3239d68

Merge branch 'Significant-Gravitas:master' into transcribe_audio_hugg…

18168cc

…ingface

hdkiller reviewed Apr 15, 2023

View reviewed changes

autogpt/prompt.py Outdated Show resolved Hide resolved

neuralsignal and others added 4 commits April 15, 2023 23:53

change 'image' to 'file'

973e3c5

Merge branch 'transcribe_audio_huggingface' of https://github.com/guc…

572aedf

…ky92/Auto-GPT into transcribe_audio_huggingface

Merge branch 'Significant-Gravitas:master' into transcribe_audio_hugg…

fd82414

…ingface

Merge branch 'master' of https://github.com/Significant-Gravitas/Auto…

017371b

…-GPT into pr/1718

BillSchumacher approved these changes Apr 16, 2023

View reviewed changes

BillSchumacher merged commit 4870356 into Significant-Gravitas:master Apr 16, 2023

sindlinger pushed a commit to Orgsindlinger/Auto-GPT-WebUI that referenced this pull request Sep 25, 2024

Merge pull request Significant-Gravitas#1718 from gucky92/transcribe_…

756cfbb

…audio_huggingface Transcribe audio using huggingface

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transcribe audio using huggingface #1718

Transcribe audio using huggingface #1718

neuralsignal commented Apr 15, 2023

Transcribe audio using huggingface #1718

Transcribe audio using huggingface #1718

Conversation

neuralsignal commented Apr 15, 2023

PR Quality Checklist