aws_sdk_client allowlist, so the integration cannot mutate your RDS resources.
Prerequisites
- AWS credentials configured per the AWS integration (role ARN recommended)
- An RDS DB instance you want OpenSRE to investigate
- IAM permissions for the two RDS describe actions listed below
Setup
Environment variables
| Variable | Default | Description |
|---|---|---|
RDS_DB_INSTANCE_IDENTIFIER | — | Required. The DB instance identifier OpenSRE should investigate. |
AWS_REGION | us-east-1 | AWS region the instance lives in. Used by both the integration config and per-tool param extraction. |
RDS_REGION | us-east-1 | Fallback used only when AWS_REGION is not set. |
regionfield on the source dict (when configured via the integrations store)AWS_REGIONenvironment variableRDS_REGIONenvironment variableus-east-1(default)
IAM permissions
The integration only needs two read-only RDS actions:ReadOnlyAccess policy, both actions are already covered.
Tools
| Tool | AWS API call | What it returns |
|---|---|---|
describe_rds_instance | rds:DescribeDBInstances | Instance status, engine + version, instance class, Multi-AZ flag, endpoint address/port, storage type and size, availability zone, and backup window. |
describe_rds_events | rds:DescribeEvents | Recent events for the DB instance — failovers, maintenance, parameter group changes, and backup activity. Defaults to the last 60 minutes; bounded to 20160 minutes (14 days, the AWS limit). |
rds.db_instance_identifier is present in the resolved sources.
Use cases
- Verifying RDS instance status (
available,modifying,failed) when an alert fires - Detecting Multi-AZ failover events around an incident timestamp
- Tracing recent maintenance, parameter group changes, or backup activity that may correlate with the incident
Troubleshooting
| Symptom | Fix |
|---|---|
AccessDenied on rds:DescribeDBInstances | Add the IAM policy above to the role or user used by the AWS integration. |
| DBInstanceNotFound | Confirm RDS_DB_INSTANCE_IDENTIFIER matches an instance in AWS_REGION. |
| Tool reports the wrong region | Either AWS_REGION is set to a different region, or the source dict has a stale region field. Check the resolution order above. |
Upstream correlation validation
OpenSRE also includes a deterministic upstream-correlation smoke validation path for no-trace-ID RDS CPU spike investigations. This allows validating correlation output locally without requiring live Datadog credentials or full LLM investigation setup.Local smoke validation
Run:- correlated signals
- most likely causal driver(s)
Live investigation validation
For live validation, configure Datadog and trigger an investigation against an RDS CPU spike alert. The upstream correlation runtime automatically scopes RDS metrics to the alerting DB instance using thedbinstanceidentifier tag to avoid cross-instance aggregation in multi-RDS environments.
Recommended alert fields:
- correlated signals
- most likely causal driver(s)
Tracer