Check the deployment status of an aligned model.
This endpoint retrieves the current deployment status of a model, including progress and completion information.
Path Parameters:
model_id: Unique identifier of the model to check deployment status forReturns:
model_id: The model identifierstatus: Deployment status
pending: Deployment in progresscompleted: Successfully deployed (provider confirms DEPLOYED/READY)failed: Deployment failedresult: Task result details (if completed)start_time: Timestamp when deployment startedend_time: Timestamp when deployment completed (null if still pending)Raises:
404: If model not found or doesn’t belong to userExample Request:
GET /api/v3/models/deploy-model/alignment-xyz789/status
Headers: {"Authorization": "Bearer <api_key>"}
Example Response (Pending):
{
"model_id": "alignment-xyz789",
"status": "pending",
"result": null,
"start_time": "2024-02-24T10:00:00Z",
"end_time": null
}
Example Response (Completed):
{
"model_id": "alignment-xyz789",
"status": "completed",
"result": {
"deployment_id": "deploy-abc123",
"endpoint_url": "https://api.example.com/v1/models/alignment-xyz789"
},
"start_time": "2024-02-24T10:00:00Z",
"end_time": "2024-02-24T10:05:00Z"
}
Example Response (Failed):
{
"model_id": "alignment-xyz789",
"status": "failed",
"result": {
"error": "Deployment failed due to insufficient resources"
},
"start_time": "2024-02-24T10:00:00Z",
"end_time": "2024-02-24T10:02:30Z"
}
Notes:
completed means the model is ready for inference/models/aligned to get the inference endpoint URLBearer authentication header of the form Bearer <token>, where <token> is your auth token.
Returns the current deployment status of an aligned model.
model ID that you want to deploy
"model_abc123"
Task status: pending (in progress), completed (finished), or failed (error)
"completed"
Task result if completed
Timestamp when deployment process started
"2024-02-24T10:00:00Z"
Timestamp when deployment process completed
"2024-02-24T10:05:00Z"