Inference
Generate Streaming Chat Completions
Generate conversational responses with real-time streaming support for immediate response chunks
POST
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
application/json
Response
200
application/json
Streaming chat completion responses or complete response depending on stream parameter
The response is of type any
.