Interface InferenceMetrics.Builder

    • Method Detail

      • maxInvocations

        InferenceMetrics.Builder maxInvocations​(Integer maxInvocations)

        The expected maximum number of requests per minute for the instance.

        Parameters:
        maxInvocations - The expected maximum number of requests per minute for the instance.
        Returns:
        Returns a reference to this object so that method calls can be chained together.
      • modelLatency

        InferenceMetrics.Builder modelLatency​(Integer modelLatency)

        The expected model latency at maximum invocations per minute for the instance.

        Parameters:
        modelLatency - The expected model latency at maximum invocations per minute for the instance.
        Returns:
        Returns a reference to this object so that method calls can be chained together.