Check the deployment status of an aligned model.
This endpoint retrieves the current deployment status of a model, including progress and completion information.
Path Parameters:
model_id: Unique identifier of the model to check deployment status forReturns:
model_id: The model identifierstatus: Deployment status
PENDING: Deployment in progressCOMPLETED: Successfully deployed (provider confirms DEPLOYED/READY)FAILED: Deployment failedresult: Task result details (if completed)start_time: Timestamp when deployment startedend_time: Timestamp when deployment completed (null if still pending)Raises:
404: If model not found or doesn’t belong to userExample Request:
GET /api/v3/models/deploy-model/alignment-xyz789/status
Headers: {"Authorization": "Bearer <api_key>"}
Example Response (Pending):
{
"model_id": "alignment-xyz789",
"status": "PENDING",
"result": null,
"start_time": "2024-02-24T10:00:00Z",
"end_time": null
}
Example Response (Completed):
{
"model_id": "aligned-model-01kmqm4nrn9fw6r",
"status": "COMPLETED",
"result": {
"deployment_id": "deploy-abc123",
"endpoint_url": "https://api.example.com/v1/models/alignment-xyz789"
},
"start_time": "2024-02-24T10:00:00Z",
"end_time": "2024-02-24T10:05:00Z"
}
Example Response (Failed):
{
"model_id": "aligned-model-01kmqm4nrn9fw6r",
"status": "FAILED",
"result": {
"error": "Deployment failed due to insufficient resources"
},
"start_time": "2024-02-24T10:00:00Z",
"end_time": "2024-02-24T10:02:30Z"
}
Notes:
COMPLETED means the model is ready for inference/models/aligned to get the inference endpoint URLBearer authentication header of the form Bearer <token>, where <token> is your auth token.
Returns the current deployment status of an aligned model.
model ID that you want to deploy
"model_abc123"
Task status: PENDING (in progress), COMPLETED (finished), or FAILED (error)
"completed"
Task result if COMPLETED
Timestamp when deployment process started
"2024-02-24T10:00:00Z"
Timestamp when deployment process completed
"2024-02-24T10:05:00Z"