Fix: Handle Error for CUDA Configuration and API Request #6572

skywinder · 2024-12-12T09:03:45Z

Issue

Running the following curl request:

curl -X POST http://localhost:7860/v1/completions \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "Hello, how are you?",
    "max_tokens": 0
  }'

results in an error:

raise AssertionError("Torch not compiled with CUDA enabled")

Steps to Reproduce
1. Start the API using the following command:

./start_macos.sh --api --api-port 7860 --verbose

2.	Test the endpoint with:

curl -X POST http://localhost:7860/v1/completions \
-H "Content-Type: application/json" \
-d '{
    "prompt": "Hello, how are you?",
    "max_tokens": 0
}'

Expected Behavior

The request should return a clear error or handle cases where CUDA is not enabled.

Proposed Solution
• Add a preflight check during the startup process to verify CUDA compatibility.
• Ensure the API gracefully handles requests even when CUDA is unavailable.

Impact
• Prevents crashes due to unhandled assertions.
• Improves debugging experience with clear error messages.

Checklist:

I have read the Contributing guidelines.

jfmherokiller · 2024-12-22T17:16:02Z

I applied your fix locally because it allows the system to fallback to cpu when I am playing a game that is making heavy use of my gpu.

skywinder · 2024-12-24T17:22:14Z

I applied your fix locally because it allows the system to fallback to cpu when I am playing a game that is making heavy use of my gpu.

thanks, one more useful use case. hope it will be merged

add one more check

944deba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Handle Error for CUDA Configuration and API Request #6572

Fix: Handle Error for CUDA Configuration and API Request #6572

skywinder commented Dec 12, 2024 •

edited

Loading

jfmherokiller commented Dec 22, 2024

skywinder commented Dec 24, 2024

Fix: Handle Error for CUDA Configuration and API Request #6572

Are you sure you want to change the base?

Fix: Handle Error for CUDA Configuration and API Request #6572

Conversation

skywinder commented Dec 12, 2024 • edited Loading

Issue

Checklist:

jfmherokiller commented Dec 22, 2024

skywinder commented Dec 24, 2024

skywinder commented Dec 12, 2024 •

edited

Loading