Instances
Last updated
Last updated
This endpoint returns detailed information of the passed instance.
/instances/{instance_id}
The unique identifier of the instance.
instance-12345
This endpoint starts an instance based on the provided instance arguments. The full instance is not available immediately after starting, but the status can be checked using the /instances/{instance_id} endpoint. The request requires a Bearer token in the header.
/instances/start
The id of the model to start.
meta-llama--Meta-Llama-3.1-8B-Instruct
The id of hardware type to run the model on. Default is recommended_config of the model.
NVIDIA_H100_1
The billing interval.
per_minute
per_minute
, per_hour
Context length can be reduced if the full context length is not needed for the task. Decreasing the maximum context length increases the throughput of the model. Default is min(32000, max_context_length).
8192
The number of workers to run the model on. Default is 1.
1
This endpoint restarts an instance based on the provided instance ID. The full instance is not available immediately after restarting, but the status can be checked using the /instances/{instance_id} endpoint. The request requires a Bearer token in the header.
/instances/restart
The ID of the instance to restart.
instance-12345
This endpoint stops an instance based on the provided instance ID. The request requires a Bearer token in the header.
/instances/stop
The ID of the instance to stop.
instance-12345
This endpoint deletes an instance based on the provided instance ID. The instance must first be stopped before it can be deleted. To stop the instance, use the /instances/stop endpoint. The request requires a Bearer token in the header.
/instances/{instance_id}
The ID of the instance to delete.
instance-12345
This endpoint deletes all instances. If the force_deletion parameter is set to true, all instances will be deleted without stopping them first. Otherwise, only the stopped instances will be deleted. The request requires a Bearer token in the header.
/instances/delete-all
Whether to force delete all instances without stopping them first.
true