Discover community-driven integrations, libraries, and resources to extend Datadog across your stack. Filter by platform, data type, use case, and more.
Collects TokuMX database metrics such as operation counters, replication lag, cache table utilization, and storage size. Note: This integration is only compatible with Python 2 and Agent versions up to 7.37.
The Datadog TorchServe integration enables comprehensive monitoring of your TorchServe instances by collecting metrics, events, and logs from the Inference API, Management API, and OpenMetrics endpoints. Track the overall health status, model performance, and custom metrics, and receive alerts on key events such as model additions or removals. This integration supports flexible configuration for hosts, Docker, and Kubernetes environments, helping you ensure your TorchServe deployments are performing optimally and issues are detected quickly.
Gain visibility into your Velero deployments by monitoring backup, restore, and snapshot operations to track the health, performance, and reliability of your disaster recovery processes.
Monitors Weaviate, an AI-native open-source vector database, to provide real-time visibility into query performance, data ingestion, and storage operations for optimizing AI application workloads.