fix(weave): Fixes Async integration timing #3653

tssweeney · 2025-02-11T02:59:42Z

Bug Observation: Many integrations (notably OpenAI) when using async methods would report very long runtimes - specifically inside of an eval loop. Initial triage showed that the start event was getting emitted long before the actual function started executing.

Bug Root Cause: The root cause of this issue is the widespread usage of the following pattern inside the integration. Specifically: wrapping an async function with an outer async function, and binding the op to the outer function:

async def _async_wrapper(*args: Any, **kwargs: Any) -> Any:
    # Possible pre-processing
    res = await fn(*args, **kwargs)
    # Possible post-processing
    return res

wrapped_op = weave.op(_async_wrapper)

Explanation: This is particularly bad because the weave tracing engine thinks that the user-defined method is _async_wrapper, however the function if interest is actually fn. Therefore, we start our wall clock at the entry of _async_wrapper, which immediately pauses. This means the time reported includes all the scheduling delay of the event loop!

Fix: This is an anti-pattern in our integrations. We should move the pre- and post- processing logic to different mechanics. There are a few classes of changes:

Cases where no additional logic was even needed and this pattern spread via copy-pasta:
- weave/integrations/anthropic/anthropic_sdk.py
- weave/integrations/cerebras/cerebras_sdk.py
- weave/integrations/google_ai_studio/google_ai_studio_sdk.py
- weave/integrations/instructor/instructor_iterable_utils.py
- weave/integrations/vertexai/vertexai_sdk.py
Cases where post processing was used to change the output to weave. In these cases we should simply use the new postprocess_output method
- weave/integrations/cohere/cohere_sdk.py
Harder cases (OpenAI). In this case we do some really weird logic where we inject a param to get the costs out. This required two changes: 1) do the param injections outside the op; 2) use a context var to determine skipping logic as opposed to inputs. Note: this is definitely not pretty... but it works
- weave/integrations/openai/openai_sdk.py

Finally, weave/trace/op_extensions/accumulator.py was needed to be enhanced such that on_output can handle Coroutines now (:

Testing: All the integration tests served as a pretty good suite

circle-job-mirror · 2025-02-11T03:08:32Z

Preview this PR with FeatureBee: https://beta.wandb.ai/?betaVersion=0dcde2d3a65b7edb29845dec8d15eb4532c5380e

gtarpenning · 2025-02-11T17:16:11Z

weave/integrations/cohere/cohere_sdk.py

        op_kwargs = settings.model_dump()
-        op = weave.op(_post_process_response(fn), **op_kwargs)
+        user_provided_postprocess_output = op_kwargs.pop("postprocess_output", None)


how were we using this before?

gtarpenning · 2025-02-11T17:17:57Z

weave/integrations/openai/openai_sdk.py

@@ -311,32 +312,41 @@ def openai_on_input_handler(
 def create_wrapper_sync(settings: OpSettings) -> Callable[[Callable], Callable]:
    def wrapper(fn: Callable) -> Callable:
        "We need to do this so we can check if `stream` is used"
+        should_skip_last = contextvars.ContextVar("should_skip_last", default=False)


the comment is now confusing, also single quotes is weird while we are here

gtarpenning

I think @andrewtruong should look at the accumulator bit

andrewtruong · 2025-02-11T19:09:43Z

weave/integrations/openai/openai_sdk.py

@@ -311,32 +312,41 @@ def openai_on_input_handler(
 def create_wrapper_sync(settings: OpSettings) -> Callable[[Callable], Callable]:
    def wrapper(fn: Callable) -> Callable:
        "We need to do this so we can check if `stream` is used"
+        should_skip_last = contextvars.ContextVar("should_skip_last", default=False)


What does skip_last mean? Maybe a comment here would be helpful

andrewtruong · 2025-02-11T19:10:55Z

weave/integrations/openai/openai_sdk.py

@@ -347,33 +357,45 @@ def _openai_stream_options_is_set(inputs: dict) -> bool:
 def create_wrapper_async(settings: OpSettings) -> Callable[[Callable], Callable]:
    def wrapper(fn: Callable) -> Callable:
        "We need to do this so we can check if `stream` is used"
+        should_skip_last = contextvars.ContextVar(
+            "should_skip_last_async", default=False


Same here, I'm not sure what skip_last means. Implicitly it has something to do with streaming?

this is because openai appends token information to the end. we need that info, so we always try to grab it. if the user wants it, we dont want to skip it, but if the user hasn't requested it, we need to skip it so the output looks as expected

tssweeney added 2 commits February 10, 2025 18:57

fixes a bunch of async integrations

e658d5d

Lint

6bceb49

tssweeney requested a review from a team as a code owner February 11, 2025 02:59

Small fixes

4d8f556

tssweeney added 5 commits February 10, 2025 23:19

Address typing and coroutines

602260e

Address typing and coroutines

7466351

Fixed openai integratio

1293388

Fixed openai integration

4bcbe0b

Fixed openai integration

b050b64

gtarpenning reviewed Feb 11, 2025

View reviewed changes

gtarpenning approved these changes Feb 11, 2025

View reviewed changes

andrewtruong reviewed Feb 11, 2025

View reviewed changes

Merge branch 'master' into tim/fix_async_openai

06a165e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(weave): Fixes Async integration timing #3653

fix(weave): Fixes Async integration timing #3653

tssweeney commented Feb 11, 2025 •

edited

Loading

circle-job-mirror bot commented Feb 11, 2025 •

edited

Loading

gtarpenning Feb 11, 2025

gtarpenning Feb 11, 2025

gtarpenning left a comment

andrewtruong Feb 11, 2025

andrewtruong Feb 11, 2025

gtarpenning Feb 14, 2025

fix(weave): Fixes Async integration timing #3653

Are you sure you want to change the base?

fix(weave): Fixes Async integration timing #3653

Conversation

tssweeney commented Feb 11, 2025 • edited Loading

circle-job-mirror bot commented Feb 11, 2025 • edited Loading

gtarpenning Feb 11, 2025

Choose a reason for hiding this comment

gtarpenning Feb 11, 2025

Choose a reason for hiding this comment

gtarpenning left a comment

Choose a reason for hiding this comment

andrewtruong Feb 11, 2025

Choose a reason for hiding this comment

andrewtruong Feb 11, 2025

Choose a reason for hiding this comment

gtarpenning Feb 14, 2025

Choose a reason for hiding this comment

tssweeney commented Feb 11, 2025 •

edited

Loading

circle-job-mirror bot commented Feb 11, 2025 •

edited

Loading