[RFC] Dynamic tool support in skills #459

zane-neo · 2024-11-05T08:25:34Z

Dynamic tool support in skills

Problem statement

Tools are important part in ml-commons agent framework, but currently there are several pain points in tool using for both user and developer. Tool implementation is not elegant as the tool copies the corresponding core/plugin code to it and apply certain transformation to make it suitable for tool’s purpose, this brings several pain points:

The tool needs to keep tracking the code source to make it adapt to changes which takes a lot of maintenance effort, e.g. when _cat/indices API is been changed to _list/indices API: [Enhancement] Change CatIndexTool implementation from _cat/index action to _list/index action ml-commons#3182, tool maintainer needs to copy the latest code.
The tool needs to be versioned for different AOS versions and user needs to be aware of the version difference to ensure the tool runs as expected.
A user can’t use a tool that hasn't been implemented in skills or ml-commons which is not a good user experience.

Purposed solution

If the tools can be configured dynamically and if the execution run the tool based on configuration, then it can eliminate the pain points above, the high level solution looks like below:

A new tool named DynamicTool will be created and this tool in charged for executing tools defined with configuration.
In /agents/_register API, the request_body needs a little modification to support dynamic tool, the uri, request_body, tool_steps in parameters map are critical keys to identify if it’s dynamic.
- uri: the actual uri of the REST API the tool uses, e.g. CatIndexTool uses _cat/indices API and MLModelTool uses _ml/{model_id}/_predict
- request_body: the request body of the corresponding REST API.
- tool_steps: A tool could be a simple tool or a composite tool, e.g. a RAGTool is composite with two steps, first to retrieve context from knowledge base, second step is to invoke LLM to generate response with context and user question.

{
  "name": "dynamic tool for composite tool",
  "type": "conversational_flow",
  "description": "this is a test agent",
  "memory": {
    "type": "conversation_index"
  },
  "tools": [
    {
      "name": "text embedding RAG tool",
      "type": "RAGTool",
      "tool_steps": [
        {
          "type": "DynamicTool",
          "name": "text embedding model query tool",
          "parameters": {
            "uri": "/_ml/${parameters.textEmbeddingModelId}/_predict",
            "textEmbeddingModelId": "FSdp4ZIBKOcmWBSuKJGR",
            "request_body": "{\"query\": {\"nested\": {\"path\": \"${parameters.nestedPath:-null}\",\"score_mode\": \"${parameters.score_mode:-null}\",\"query\": {\"neural\": {\"embeddingField\": {\"query_text\": \"${parameters.query_text:-null}\",\"model_id\": \"${parameters.model_id:-null}\",\"k\": \"${parameters.k:-null}\"}}}}}}"
          }
        },
        {
          "type": "MLModelTool",
          "name": "LLM interaction tool",
          "parameters": {
            "uri": "/${parameters.index}/_search",
            "textEmbeddingModelId": "FSdp4ZIBKOcmWBSuKJGR",
            "request_body": "{\"parameters\": {\"prompt\": \"You're an political expert can answer any questions related to politics.\",\"question\": \"${parameters.question}\",\"max_token\": 10}}"
          }
        }
      ]
    }
  ],
  "app_type": "rag"
}

During agent runtime, the tools are being created with the configuration, the tool itself doesn’t have to be implemented in code base, the only constraints is user needs to ensure the uri exists in OpenSearch.
During tool execution, a dummy RestRequest will be created and the corresponding TransportHandler will be selected to handle the request. With this, it doesn’t need API level operations like AuthN/AuthZ, and the API response can be returned to tools and agent.

Future plans

Phase1

In first phase we will provide user this capability to use dynamic tools, the mixed use of dynamic tool and existing tool will not be supported.

Phase2

In second phase we will migrate the existing tools to dynamic tools and old tools will be deprecated, there’ll be several build-in functions created for their output/input processing.

The text was updated successfully, but these errors were encountered:

yuye-aws · 2024-11-21T06:24:10Z

It's really good to see you create this RFC. It provides users with a Tool to call any API and saves developer much time to develop a new tool for API.

dblock · 2024-11-25T17:09:35Z

[Catch All Triage - 1, 2, 3, 4, 5]

zane-neo added enhancement New feature or request untriaged labels Nov 5, 2024

zane-neo mentioned this issue Nov 5, 2024

[RFC] Dynamic tool support in agent framework opensearch-project/ml-commons#3202

Open

mingshl added untriaged and removed untriaged labels Nov 5, 2024

zane-neo self-assigned this Nov 6, 2024

dblock removed the untriaged label Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Dynamic tool support in skills #459

[RFC] Dynamic tool support in skills #459

zane-neo commented Nov 5, 2024 •

edited

Loading

yuye-aws commented Nov 21, 2024

dblock commented Nov 25, 2024

[RFC] Dynamic tool support in skills #459

[RFC] Dynamic tool support in skills #459

Comments

zane-neo commented Nov 5, 2024 • edited Loading

Dynamic tool support in skills

Problem statement

Purposed solution

Future plans

Phase1

Phase2

yuye-aws commented Nov 21, 2024

dblock commented Nov 25, 2024

zane-neo commented Nov 5, 2024 •

edited

Loading