Quickstart
Prerequisites
Step 1: Authenticate
predictor loginStep 2: Serve a Model
Option A: Local Model File
predictor up ./llama-7b-q4.ggufOption B: HuggingFace Model
Step 3: Use Your Endpoint
Test with curl
Use with OpenAI SDK
Step 4: Monitor Your Endpoint
Step 5: Shutdown
Next Steps
Last updated