OpenAI Realtime API support #869

bachittle · 2024-10-07T11:48:27Z

https://platform.openai.com/docs/guides/realtime

it uses websockets to get and set audio files from gpt-4o-realtime.

anhao · 2024-10-08T05:43:08Z

+1

WqyJh · 2024-10-10T03:20:07Z

+1

WqyJh · 2024-10-17T13:49:40Z

I want to add support for the GPT-4o-realtime model, which relies on WebSocket technology. This necessitates the use of a WebSocket library, but introducing such a dependency conflicts with the zero-dependency philosophy of the existing library, as I previously discussed with @sashabaranov.

As a result, I've decided to create a new library dedicated exclusively to GPT-4o-realtime. This library will serve as a complement to go-openai, focusing solely on supporting GPT-4o-realtime functionality.

The new library is called go-openai-realtime. Feel free to check it out!

https://github.com/WqyJh/go-openai-realtime

sashabaranov · 2024-10-17T14:15:03Z

@WqyJh, thank you for your effort on this! I think websockets are a fair case to introduce a dependency to this library, and I would love to merge your changes if you'll decide to contribute 🙌🏻

bachittle · 2024-10-17T18:53:45Z

They just released support for audio input and output in the chat completions endpoint, using the gpt-4o-audio-preview model. This could be supported first in the meantime: https://platform.openai.com/docs/guides/audio/quickstart

WqyJh · 2024-11-11T12:23:58Z

They just released support for audio input and output in the chat completions endpoint, using the gpt-4o-audio-preview model. This could be supported first in the meantime: https://platform.openai.com/docs/guides/audio/quickstart

I just added support for gpt-4o-audio-preview. See #895

WqyJh · 2024-11-11T12:33:21Z

@WqyJh, thank you for your effort on this! I think websockets are a fair case to introduce a dependency to this library, and I would love to merge your changes if you'll decide to contribute 🙌🏻

I'd like to contribute to the project. Since it contains a lot of code and examples, it will take some time to complete. Mixing all the code together would create a mess, so I suggest organizing all the real-time code into a folder named realtime.

sabuhigr · 2024-12-26T06:56:03Z

No need to implement realtime api with websockets/webRTC I think.

Completion api supports Audio input now.
Ref: https://platform.openai.com/docs/guides/audio

The project is concentrated to api level functionalities with REST not with different protocol like WebRTC or Websocket.

bachittle added the enhancement New feature or request label Oct 7, 2024

WqyJh mentioned this issue Oct 17, 2024

Can you add support for open ai realtime api? #880

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI Realtime API support #869

OpenAI Realtime API support #869

bachittle commented Oct 7, 2024

anhao commented Oct 8, 2024

WqyJh commented Oct 10, 2024

WqyJh commented Oct 17, 2024

sashabaranov commented Oct 17, 2024

bachittle commented Oct 17, 2024

WqyJh commented Nov 11, 2024

WqyJh commented Nov 11, 2024

sabuhigr commented Dec 26, 2024

OpenAI Realtime API support #869

OpenAI Realtime API support #869

Comments

bachittle commented Oct 7, 2024

anhao commented Oct 8, 2024

WqyJh commented Oct 10, 2024

WqyJh commented Oct 17, 2024

sashabaranov commented Oct 17, 2024

bachittle commented Oct 17, 2024

WqyJh commented Nov 11, 2024

WqyJh commented Nov 11, 2024

sabuhigr commented Dec 26, 2024