Class ServingEndpointsAPI
- java.lang.Object
-
- com.databricks.sdk.service.serving.ServingEndpointsAPI
-
@Generated public class ServingEndpointsAPI extends Object
The Serving Endpoints API allows you to create, update, and delete model serving endpoints.You can use a serving endpoint to serve models from the Databricks Model Registry. Endpoints expose the underlying models as scalable REST API endpoints using serverless compute. This means the endpoints and associated compute resources are fully managed by Databricks and will not appear in your cloud account. A serving endpoint can consist of one or more MLflow models from the Databricks Model Registry, called served models. A serving endpoint can have at most ten served models. You can configure traffic settings to define how requests should be routed to your served models behind an endpoint. Additionally, you can configure the scale of resources that should be applied to each served model.
-
-
Constructor Summary
Constructors Constructor Description ServingEndpointsAPI(ApiClient apiClient)
Regular-use constructorServingEndpointsAPI(ServingEndpointsService mock)
Constructor for mocks
-
Method Summary
-
-
-
Constructor Detail
-
ServingEndpointsAPI
public ServingEndpointsAPI(ApiClient apiClient)
Regular-use constructor
-
ServingEndpointsAPI
public ServingEndpointsAPI(ServingEndpointsService mock)
Constructor for mocks
-
-
Method Detail
-
waitGetServingEndpointNotUpdating
public ServingEndpointDetailed waitGetServingEndpointNotUpdating(String name) throws TimeoutException
- Throws:
TimeoutException
-
waitGetServingEndpointNotUpdating
public ServingEndpointDetailed waitGetServingEndpointNotUpdating(String name, Duration timeout, Consumer<ServingEndpointDetailed> callback) throws TimeoutException
- Throws:
TimeoutException
-
buildLogs
public BuildLogsResponse buildLogs(String name, String servedModelName)
-
buildLogs
public BuildLogsResponse buildLogs(BuildLogsRequest request)
Retrieve the logs associated with building the model's environment for a given serving endpoint's served model.Retrieves the build logs associated with the provided served model.
-
create
public Wait<ServingEndpointDetailed,ServingEndpointDetailed> create(String name, EndpointCoreConfigInput config)
-
create
public Wait<ServingEndpointDetailed,ServingEndpointDetailed> create(CreateServingEndpoint request)
Create a new serving endpoint.
-
delete
public void delete(String name)
-
delete
public void delete(DeleteServingEndpointRequest request)
Delete a serving endpoint.
-
exportMetrics
public void exportMetrics(String name)
-
exportMetrics
public void exportMetrics(ExportMetricsRequest request)
Retrieve the metrics corresponding to a serving endpoint for the current time in Prometheus or OpenMetrics exposition format.Retrieves the metrics associated with the provided serving endpoint in either Prometheus or OpenMetrics exposition format.
-
get
public ServingEndpointDetailed get(String name)
-
get
public ServingEndpointDetailed get(GetServingEndpointRequest request)
Get a single serving endpoint.Retrieves the details for a single serving endpoint.
-
list
public ListEndpointsResponse list()
Retrieve all serving endpoints.
-
logs
public ServerLogsResponse logs(String name, String servedModelName)
-
logs
public ServerLogsResponse logs(LogsRequest request)
Retrieve the most recent log lines associated with a given serving endpoint's served model.Retrieves the service logs associated with the provided served model.
-
query
public QueryEndpointResponse query(String name)
-
query
public QueryEndpointResponse query(QueryRequest request)
Query a serving endpoint with provided model input.
-
updateConfig
public Wait<ServingEndpointDetailed,ServingEndpointDetailed> updateConfig(Collection<ServedModelInput> servedModels, String name)
-
updateConfig
public Wait<ServingEndpointDetailed,ServingEndpointDetailed> updateConfig(EndpointCoreConfigInput request)
Update a serving endpoint with a new config.Updates any combination of the serving endpoint's served models, the compute configuration of those served models, and the endpoint's traffic config. An endpoint that already has an update in progress can not be updated until the current update completes or fails.
-
impl
public ServingEndpointsService impl()
-
-