Fastest LLM inference on the planet (Llama 3.1, Mixtral, Gemma) — Spawnpay handles the receipts, you handle the throughput.
Real-time agents where latency matters. Bulk-summary pipelines that need to bill end-users per-call. Multi-tenant tools that want one billing surface across Groq + OpenAI.
POST/openai/v1/chat/completionsAuthorization$0.001 USDC per upstream 2xx (4xx is free)curl -X POST https://spawnpay.ai/api/proxy \
-H 'Authorization: Bearer spk_live_…' \
-H 'X-Provider-Key: $GROQ_KEY' \
-H 'Content-Type: application/json' \
-d '{
"provider": "groq",
"path": "/openai/v1/chat/completions",
"method": "POST",
"body": {
"model": "llama-3.1-70b-versatile",
"messages": [
{
"role": "user",
"content": "hello"
}
]
}
}'
import { proxy } from 'spawnpay-paywall';
const out = await proxy({
"provider": "groq",
"path": "/openai/v1/chat/completions",
"method": "POST",
"body": {
"model": "llama-3.1-70b-versatile",
"messages": [
{
"role": "user",
"content": "hello"
}
]
},
"providerKey": process.env.GROQ_KEY
});
console.log(out.status, out.body);
console.log('charged', out.charged); // txHash + $0.001 USDC receipt
The full upstream JSON body, the upstream status code, the upstream response headers, plus a charged field with the USDC receipt (tx hash) when the call succeeded.