Models
Last updated
Last updated
This endpoint stops a model instance based on the provided instance ID. The request requires a Bearer token in the header.
The ID of the model instance to stop.
"instance-12345"
Successfully stopped the model instance
Status message indicating that the instance was stopped.
"Instance stopped successfully"
Indicates whether the instance was successfully stopped.
true
This endpoint deletes a model instance based on the provided instance ID. The request requires a Bearer token in the header.
The ID of the model instance to delete.
"instance-12345"
Successfully deleted the model instance
Status message indicating that the instance was deleted.
"Model deleted successfully"
Indicates whether the model instance was successfully deleted.
true
This endpoint restarts a model instance based on the provided instance ID. The request requires a Bearer token in the header.
The ID of the model instance to restart.
"instance-12345"
Successfully restarted the model instance
Status message indicating that the instance is starting.
"Instance is starting"
The ID of the model instance that is restarting.
"instance-12345"
This endpoint returns detailed information of the passed model including evaluations and hardware configs.
Successfully returned information about a model
This endpoint starts a model instance based on the provided model name and instance type. The request requires a Bearer token in the header.
The name of the model to start.
"meta-llama--Meta-Llama-3.1-70B-Instruct"
The type of instance to run the model on. Default is recommended instance type of the model.
"NVIDIA_H100_2"
per_minute (default) or per_hour
"per_minute"
Context length can be reduced if full context length is not needed for the current task. Default is max_context_length of model
4096
Successfully started the model instance
Status message indicating that the instance is starting.
"Instance is starting"
The ID of the started model instance.
"instance-12345"
This endpoint retrieves information about all available models. The request requires a Bearer token in the header.
A list of available models
This endpoint returns the status of all model instances (started and stopped) associated with the authenticated user. The request requires a Bearer token in the header.
Successfully retrieved model instance statuses