ollama-voice

Plug whisper audio transcription to a local ollama server and ouput tts audio responses

This is just a simple combination of three tools in offline mode:

Prerequisites

whisper dependencies are setup to run on GPU so Install Cuda before running pip install.

Install the packages python3-pyaudio, portaudio19-dev and espeak on your distribution

Install ollama and ensure server is started locally first (in WLS under windows) (e.g. curl https://ollama.ai/install.sh | sh)

Configure assistant.yaml settings. (It is setup to work in french with ollama mistral model by default...)

Run assistant.py

Leave space key pressed to talk, the AI will interpret the query when you release the key.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
assistant-ui.py		assistant-ui.py
assistant.png		assistant.png
assistant.py		assistant.py
assistant.yaml		assistant.yaml
build_ollama_docker.sh		build_ollama_docker.sh
requirements.txt		requirements.txt
run.sh		run.sh