agent-integration
Ray
Monitor Ray, an open-source unified compute framework for scaling AI and Python workloads, to track health, performance, and resource utilization.
-
Datadog help@datadoghq.com
https://www.datadoghq.com
agent-integration
Monitor Ray, an open-source unified compute framework for scaling AI and Python workloads, to track health, performance, and resource utilization.
Datadog help@datadoghq.com
agent-integration
The Datadog TorchServe integration enables comprehensive monitoring of your TorchServe instances by collecting metrics, events, and logs from the Inference API, Management API, and OpenMetrics endpoints. Track the overall health status, model performance, and custom metrics, and receive alerts on key events such as model additions or removals. This integration supports flexible configuration for hosts, Docker, and Kubernetes environments, helping you ensure your TorchServe deployments are performing optimally and issues are detected quickly.
Datadog help@datadoghq.com
agent-integration
vLLM is a library for LLM inference and serving
Datadog help@datadoghq.com
agent-integration
Monitors Weaviate, an AI-native open-source vector database, to provide real-time visibility into query performance, data ingestion, and storage operations for optimizing AI application workloads.
Datadog help@datadoghq.com
agent-integration
Stream operational and inference-related metrics from Algorithmia's MLOps platform to Datadog for comprehensive monitoring of machine learning models, including detection of model drift, data drift, and model bias.
Algorithmia support@algorithmia.io