feat: remove special handling of builtin::rag tool #1015

ehhuang · 2025-02-08T00:04:18Z

Summary:

Lets the model decide which tool it needs to call to respond to a query.

Test Plan:

LLAMA_STACK_CONFIG=fireworks pytest -s -v tests/client-sdk/ --safety-shield meta-llama/Llama-Guard-3-8B

Stack created with Sapling. Best reviewed with ReviewStack.

yanxi0830 · 2025-02-11T17:39:50Z

tests/client-sdk/agents/test_agents.py

@@ -466,5 +503,8 @@ def test_rag_and_code_agent(llama_stack_client, agent_config):
            documents=docs,
        )
        logs = [str(log) for log in EventLogger().log(response) if log is not None]
-        logs_str = "".join(logs)
+        logs_str = "\n".join(logs)
+        print(logs_str)


yanxi0830 · 2025-02-11T17:42:43Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

@@ -381,92 +379,6 @@ async def _run(
        if documents:
            await self.handle_documents(session_id, documents, input_messages, tool_defs)

-        if RAG_TOOL_GROUP in toolgroups and len(input_messages) > 0:


Wondering what happens if we explicitly specify the builtin::rag toolgroups now?

https://github.com/meta-llama/llama-stack-apps/blob/7119ea1d4064ab774f10bb6c0f292bc517cb49b7/examples/agents/rag_with_vector_db.py#L83-L88

I'm wondering if there's a way we can specify how to force retrieve v.s. retrieve based on model tool call.

There's a knowledge_search tool being exposed (see list_runtime_tools). The model chooses which tools to call among all the tools. The user can include in instructions that they would like certain tools called. The tool_choice option will also support this, coming next.

hardikjshah · 2025-02-13T04:32:29Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

-                    tools=[
-                        tool for tool in tool_defs.values() if tool_to_group.get(tool.tool_name, None) != RAG_TOOL_GROUP
-                    ],
+                    tools=[tool for tool in tool_defs.values()],


For builtin tools we do not have a description in the system prompt besides

Environment: ipython Tools: brave_search, wolfram_alpha

for all other tools , we have a prompt saying here is the tool and the description of when to use it.
Now, we are neither explicitly calling the tool (sine this was never a builtin tool that the model was trained with) nor does the model know when to invoke this since its not in the client tools for which we include descriptions.

Thus, its not clear how this will get invoked if at all. Can you test our RAG examples to see if this works ?

But the client_sdk tests do have rag in multiple tests and it seems like they passed ? So what am i missing here ?

Yea by removing this logic to remove RAG_TOOL_GROUP before, we actually are passing the rag tool to model now, with descriptions. So the model does call it. And you're right the client_tests do test it.

llama_stack/providers/inline/tool_runtime/rag/memory.py

tests/client-sdk/agents/test_agents.py

hardikjshah · 2025-02-13T04:51:10Z

Also reminder - we will need to update getting_started and other places where we have RAG agent.

llama_stack/providers/inline/tool_runtime/rag/memory.py

hardikjshah

can we update the tests to not hard code but manage other models properly please ?

Summary: Test Plan: Summary: Test Plan:

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 8, 2025

ehhuang changed the title ~~memory~~ feat: remove special handling of builtin::rag tool Feb 8, 2025

ehhuang changed the title ~~feat: remove special handling of builtin::rag tool~~ [RFC] feat: remove special handling of builtin::rag tool Feb 8, 2025

ehhuang force-pushed the pr1014 branch from 6be512a to 0280234 Compare February 8, 2025 01:00

ehhuang changed the title ~~[RFC] feat: remove special handling of builtin::rag tool~~ memory Feb 8, 2025

ehhuang changed the title ~~memory~~ [RFC] feat: remove special handling of builtin::rag tool Feb 8, 2025

ehhuang force-pushed the pr1014 branch 3 times, most recently from 17c3d05 to 01cc4c0 Compare February 11, 2025 07:16

ehhuang changed the title ~~[RFC] feat: remove special handling of builtin::rag tool~~ feat: remove special handling of builtin::rag tool Feb 11, 2025

ehhuang force-pushed the pr1014 branch from 01cc4c0 to b067e63 Compare February 11, 2025 07:17

ehhuang marked this pull request as ready for review February 11, 2025 07:25

ehhuang requested review from ashwinb, yanxi0830, hardikjshah, dltn, raghotham, dineshyv, vladimirivic, sixianyi0721 and terrytangyuan as code owners February 11, 2025 07:25

yanxi0830 reviewed Feb 11, 2025

View reviewed changes

hardikjshah reviewed Feb 13, 2025

View reviewed changes

hardikjshah modified the milestones: v0.1.3, v0.1.4 Feb 13, 2025

hardikjshah assigned ehhuang Feb 14, 2025

ehhuang force-pushed the pr1014 branch from b067e63 to 5a958a0 Compare February 19, 2025 18:44

This was referenced Feb 19, 2025

feat: tool outputs metadata #1155

Merged

[RFC] feat: log model input #1126

Open

ashwinb reviewed Feb 19, 2025

View reviewed changes

llama_stack/providers/inline/tool_runtime/rag/memory.py Show resolved Hide resolved

ehhuang force-pushed the pr1014 branch from 5a958a0 to c7a32ea Compare February 20, 2025 22:45

hardikjshah requested changes Feb 20, 2025

View reviewed changes

ehhuang modified the milestones: v0.1.4, v0.1.5 Feb 21, 2025

wukaixingxp mentioned this pull request Feb 24, 2025

Make DocQA a one-clickable app implementation meta-llama/llama-stack-apps#151

Open

5 tasks

ehhuang force-pushed the pr1014 branch from c7a32ea to 1e3cfba Compare February 24, 2025 23:21

ehhuang mentioned this pull request Feb 24, 2025

feat: allow specifying specific tool within toolgroup #1239

Open

feat: remove special handling of builtin::rag tool

71eb6e1

Summary: Test Plan: Summary: Test Plan:

ehhuang force-pushed the pr1014 branch from 1e3cfba to 71eb6e1 Compare February 25, 2025 07:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: remove special handling of builtin::rag tool #1015

feat: remove special handling of builtin::rag tool #1015

ehhuang commented Feb 8, 2025 •

edited

Loading

yanxi0830 Feb 11, 2025

yanxi0830 Feb 11, 2025 •

edited

Loading

ehhuang Feb 11, 2025

hardikjshah Feb 13, 2025

hardikjshah Feb 13, 2025

ehhuang Feb 18, 2025

hardikjshah commented Feb 13, 2025

hardikjshah left a comment

feat: remove special handling of builtin::rag tool #1015

Are you sure you want to change the base?

feat: remove special handling of builtin::rag tool #1015

Conversation

ehhuang commented Feb 8, 2025 • edited Loading

yanxi0830 Feb 11, 2025

Choose a reason for hiding this comment

yanxi0830 Feb 11, 2025 • edited Loading

Choose a reason for hiding this comment

ehhuang Feb 11, 2025

Choose a reason for hiding this comment

hardikjshah Feb 13, 2025

Choose a reason for hiding this comment

hardikjshah Feb 13, 2025

Choose a reason for hiding this comment

ehhuang Feb 18, 2025

Choose a reason for hiding this comment

hardikjshah commented Feb 13, 2025

hardikjshah left a comment

Choose a reason for hiding this comment

ehhuang commented Feb 8, 2025 •

edited

Loading

yanxi0830 Feb 11, 2025 •

edited

Loading