Instances
Last updated
Last updated
This endpoint retrieves a list of all instances. The request requires a Bearer token in the header.
Successfully retrieved all instances
This endpoint returns detailed information of the passed instance.
The unique identifier of the instance.
"instance-12345"
Successfully returned information about a specific instance.
This endpoint starts an instance based on the provided instance arguments. The full instance is not available immediately after starting, but the status can be checked using the /instances/{instance_id} endpoint. The request requires a Bearer token in the header.
The id of the model to start.
"meta-llama--Meta-Llama-3.1-8B-Instruct"
The id of hardware type to run the model on. Default is recommended_config of the model.
"NVIDIA_H100_1"
The billing interval.
"per_minute"
Context length can be reduced if the full context length is not needed for the task. Decreasing the maximum context length increases the throughput of the model. Default is min(32000, max_context_length).
8192
The instance is starting.
Status message indicating that the instance is starting.
"Instance is starting."
This endpoint restarts an instance based on the provided instance ID. The full instance is not available immediately after restarting, but the status can be checked using the /instances/{instance_id} endpoint. The request requires a Bearer token in the header.
The ID of the instance to restart.
"instance-12345"
Successfully restarted the instance.
Status message indicating that the instance is starting.
"Instance is starting."
This endpoint stops an instance based on the provided instance ID. The request requires a Bearer token in the header.
The ID of the instance to stop.
"instance-12345"
Successfully stopped the instance.
Status message indicating that the instance was stopped.
"Instance stopped successfully"
This endpoint stops all running instances. The request requires a Bearer token in the header.
Successfully stopped all instances.
Status message indicating that all instances were stopped.
"All instances stopped successfully."
The IDs of the stopped instances.
["instance-12345","instance-67890"]
This endpoint deletes an instance based on the provided instance ID. The instance must first be stopped before it can be deleted. To stop the instance, use the /instances/stop endpoint. The request requires a Bearer token in the header.
The ID of the instance to delete.
"instance-12345"
Successfully deleted the model instance.
Status message indicating that the instance was deleted.
"Instance deleted successfully."
The ID of the deleted instance.
"instance-12345"
This endpoint deletes all instances. If the force_deletion parameter is set to true, all instances will be deleted without stopping them first. Otherwise, only the stopped instances will be deleted. The request requires a Bearer token in the header.
Successfully deleted all instances.
Status message indicating that all instances were deleted.
"All instances deleted successfully."
The IDs of the deleted instances.
["instance-12345","instance-67890"]