summaryrefslogtreecommitdiffstats
path: root/health/guides/redis/redis_master_link_down.md
diff options
context:
space:
mode:
authorDaniel Baumann <daniel.baumann@progress-linux.org>2024-03-09 13:19:22 +0000
committerDaniel Baumann <daniel.baumann@progress-linux.org>2024-03-09 13:19:22 +0000
commitc21c3b0befeb46a51b6bf3758ffa30813bea0ff0 (patch)
tree9754ff1ca740f6346cf8483ec915d4054bc5da2d /health/guides/redis/redis_master_link_down.md
parentAdding upstream version 1.43.2. (diff)
downloadnetdata-c21c3b0befeb46a51b6bf3758ffa30813bea0ff0.tar.xz
netdata-c21c3b0befeb46a51b6bf3758ffa30813bea0ff0.zip
Adding upstream version 1.44.3.upstream/1.44.3
Signed-off-by: Daniel Baumann <daniel.baumann@progress-linux.org>
Diffstat (limited to 'health/guides/redis/redis_master_link_down.md')
-rw-r--r--health/guides/redis/redis_master_link_down.md50
1 files changed, 50 insertions, 0 deletions
diff --git a/health/guides/redis/redis_master_link_down.md b/health/guides/redis/redis_master_link_down.md
new file mode 100644
index 000000000..5a2d24293
--- /dev/null
+++ b/health/guides/redis/redis_master_link_down.md
@@ -0,0 +1,50 @@
+### Understand the alert
+
+The `redis_master_link_down` alert is triggered when there is a disconnection between a Redis master and its slave for more than 10 seconds. This alert indicates a potential problem with the replication process and can impact the data consistency across multiple instances.
+
+### Troubleshoot the alert
+
+1. Check the Redis logs
+
+ Examine the Redis logs for any errors or issues regarding the disconnection between the master and slave instances. By default, Redis log files are located at `/var/log/redis/redis.log`. Look for messages related to replication, network errors or timeouts.
+
+ ```
+ grep -i "replication" /var/log/redis/redis.log
+ grep -i "timeout" /var/log/redis/redis.log
+ ```
+
+2. Check the Redis replication status
+
+ Connect to the Redis master using the `redis-cli` tool, and execute the `INFO` command to get the detailed information about the master instance:
+
+ ```
+ redis-cli
+ INFO REPLICATION
+ ```
+
+ Also, check the replication status on the slave instance. If you have access to the IP address and port of the slave, connect to it and run the same `INFO` command.
+
+3. Verify the network connection between the master and slave instances
+
+ Test the network connectivity using `ping` and `telnet` or `nc` commands, ensuring that the connection between the master and slave instances is stable and there are no issues with firewalls or network policies.
+
+ ```
+ ping <slave_ip_address>
+ telnet <slave_ip_address> <redis_port>
+ ```
+
+4. Restart the Redis instances (if needed)
+
+ If Redis instances are experiencing issues or are unable to reconnect, consider restarting them. Be cautious as restarting instances might result in data loss or consistency issues.
+
+ ```
+ sudo systemctl restart redis
+ ```
+
+5. Monitor the situation
+
+ After addressing the potential issues, keep an eye on the Redis instances to ensure that the problem doesn't reoccur.
+
+### Useful resources
+
+1. [Redis Replication Documentation](https://redis.io/topics/replication)