Separate fast and smart llm providers #813

kesamet · 2024-08-27T09:42:24Z

Different LLM sources for "SMART" and "FAST".
For issues #702, #598

assafelovic · 2024-09-07T17:57:19Z

Hey @kesamet sorry it took me long to reply here. First of all this is super super valuable!! We've been getting a lot of requests to use different LLMs for different actions. A bit of feedback:

resolve conflicts based on latest changes
I think it's in the right direction but I'd challenge a solution a bit more complex. Check out how retrievers is built. You can add as many retrievers in comma seperated and they're all populated. Check out here: https://github.com/assafelovic/gpt-researcher/blob/master/gpt_researcher/config/config.py#L54C9-L54C25
I think it might be better to rewrite the config param llm_provider to support multiple llms.
Then we can use the already existing params fast_llm_model and smart_llm_model automatically.

This approach is probably in the right direction and would be optimal. Lmk if you're up for it or I can take it from here!

kesamet · 2024-09-16T08:27:15Z

Hey @assafelovic, thanks for the feedback and sorry for the late reply.

Taking inspiration from AWS Bedrock where the model name is something like "anthropic.claude-3-sonnet-20240229-v1:0", I think it is best to combine LLM_PROVIDER and LLM_MODEL in env vars into a combined name "<llm_provider>:<llm_model>" using semi-colon. I call it "FAST_LLM_NAME" and "SMART_LLM_NAME".

Eg

FAST_LLM_NAME = openai:gpt-4o-mini
SMART_LLM_NAME = openai:gpt-4o

config/config.py will then split the names into smart_llm_provider, smart_llm_model, etc params

self.fast_llm_provider = "openai"
self.smart_llm_model = "gpt-4o-mini"
self.smart_llm_provider = "openai"
self.smart_llm_model = "gpt-4o"

What do you think?

assafelovic · 2024-09-16T12:57:31Z

This actually sounds great! Will require some refactoring but love it

kesamet · 2024-09-17T00:12:45Z

Please help to review the PR.
We can also do the same simplification for embedding env vars, which I can do in another PR.

assafelovic · 2024-09-17T08:45:12Z

Embeddings would be amazing @kesamet as well. Thank you for your contributions, I will dive into this PR in the coming days

assafelovic · 2024-10-04T07:38:52Z

Hey @kesamet is this ready for review?

kesamet · 2024-10-04T07:50:17Z

@assafelovic yes

kesamet · 2024-10-04T08:14:42Z

But I noticed that fast_llm_model seems to be no longer in use, right?

assafelovic · 2024-10-05T08:22:36Z

@kesamet it was used to summarize retrieved articles but since we're using embedding retrieval it's in no use right now. Still worth having the configs in case we find other use cases for it

assafelovic · 2024-10-05T08:32:16Z

@kesamet everything looks great, may i ask for one last revision and remove the _NAME from the vars? I think simply FAST_LLM is enough. Thank you and I'll merge!

kesamet · 2024-10-05T14:28:08Z

@assafelovic Done!

assafelovic

Thank you @kesamet this is huge and will open many oppurtunities! I'll send an update on this to the community in a few days

Separate fast and smart llm providers

11da972

kesamet added 3 commits September 9, 2024 15:29

Merge branch 'master' into llm-provider

bc7ba14

Merge branch 'master' into llm-provider

3ab0146

Refactor

9af869c

kesamet added 6 commits September 19, 2024 15:31

Merge branch 'master' into llm-provider

a5daee9

Small fix

12305ed

Merge branch 'master' into llm-provider

2693f7b

Merge branch 'master' into llm-provider

07c6138

Merge branch 'master' into llm-provider

2f2aa3f

Undo delete

62b68de

Fix

9bd06b3

kesamet added 2 commits October 5, 2024 17:38

Simplify name

21afffe

Merge branch 'master' into llm-provider

f5e14c2

Update readme

90f9a91

assafelovic approved these changes Oct 10, 2024

View reviewed changes

Merge branch 'master' into llm-provider

e685773

assafelovic merged commit dd62de1 into assafelovic:master Oct 10, 2024

kesamet deleted the llm-provider branch October 11, 2024 00:50

kesamet mentioned this pull request Oct 17, 2024

Specify embedding provider and model #918

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate fast and smart llm providers #813

Separate fast and smart llm providers #813

kesamet commented Aug 27, 2024 •

edited

Loading

assafelovic commented Sep 7, 2024

kesamet commented Sep 16, 2024 •

edited

Loading

assafelovic commented Sep 16, 2024

kesamet commented Sep 17, 2024

assafelovic commented Sep 17, 2024

assafelovic commented Oct 4, 2024

kesamet commented Oct 4, 2024

kesamet commented Oct 4, 2024

assafelovic commented Oct 5, 2024

assafelovic commented Oct 5, 2024

kesamet commented Oct 5, 2024

assafelovic left a comment

Separate fast and smart llm providers #813

Separate fast and smart llm providers #813

Conversation

kesamet commented Aug 27, 2024 • edited Loading

assafelovic commented Sep 7, 2024

kesamet commented Sep 16, 2024 • edited Loading

assafelovic commented Sep 16, 2024

kesamet commented Sep 17, 2024

assafelovic commented Sep 17, 2024

assafelovic commented Oct 4, 2024

kesamet commented Oct 4, 2024

kesamet commented Oct 4, 2024

assafelovic commented Oct 5, 2024

assafelovic commented Oct 5, 2024

kesamet commented Oct 5, 2024

assafelovic left a comment

Choose a reason for hiding this comment

kesamet commented Aug 27, 2024 •

edited

Loading

kesamet commented Sep 16, 2024 •

edited

Loading