TritonModelLatency

Screenshot of an AWS CloudWatch Metrics dashboard, highlighting a ModelLatency metric for NVIDIA Triton DistilBERT with an average value of 1.48 milliseconds over a 1-minute period. The metric is visualized on a graph.

keyboard_arrow_up