summaryrefslogtreecommitdiffstats
path: root/health/guides/hdfs/hdfs_capacity_usage.md
diff options
context:
space:
mode:
authorDaniel Baumann <daniel.baumann@progress-linux.org>2024-04-19 02:57:58 +0000
committerDaniel Baumann <daniel.baumann@progress-linux.org>2024-04-19 02:57:58 +0000
commitbe1c7e50e1e8809ea56f2c9d472eccd8ffd73a97 (patch)
tree9754ff1ca740f6346cf8483ec915d4054bc5da2d /health/guides/hdfs/hdfs_capacity_usage.md
parentInitial commit. (diff)
downloadnetdata-upstream.tar.xz
netdata-upstream.zip
Adding upstream version 1.44.3.upstream/1.44.3upstream
Signed-off-by: Daniel Baumann <daniel.baumann@progress-linux.org>
Diffstat (limited to 'health/guides/hdfs/hdfs_capacity_usage.md')
-rw-r--r--health/guides/hdfs/hdfs_capacity_usage.md42
1 files changed, 42 insertions, 0 deletions
diff --git a/health/guides/hdfs/hdfs_capacity_usage.md b/health/guides/hdfs/hdfs_capacity_usage.md
new file mode 100644
index 00000000..666dcdc2
--- /dev/null
+++ b/health/guides/hdfs/hdfs_capacity_usage.md
@@ -0,0 +1,42 @@
+### Understand the alert
+
+This alert calculates the percentage of used space capacity across all DataNodes in the Hadoop Distributed File System (HDFS). If you receive this alert, it means that your HDFS DataNodes space capacity utilization is high.
+
+The alert is triggered into warning when the percentage of used space capacity across all DataNodes is between 70-80% and in critical when it is between 80-90%.
+
+### Troubleshoot the alert
+
+Data is priceless. Before you perform any action, make sure that you have taken any necessary backup steps. Netdata is not liable for any loss or corruption of any data, database, or software.
+
+#### Check your Disk Usage across the cluster
+
+1. Inspect the Disk Usage for each DataNode:
+
+ ```
+ root@netdata # hadoop dfsadmin -report
+ ```
+
+ If all the DataNodes are in Disk pressure, you should consider adding more disk space. Otherwise, you can perform a balance of data between the DataNodes.
+
+2. Perform a balance:
+
+ ```
+ root@netdata # hdfs balancer –threshold 15
+ ```
+
+ This means that the balancer will balance data by moving blocks from over-utilized to under-utilized nodes, until each DataNode’s disk usage differs by no more than plus or minus 15 percent.
+
+#### Investigate high disk usage
+
+1. Review your Hadoop applications, jobs, and scripts that write data to HDFS. Identify the ones with excessive disk usage or logging.
+
+2. Optimize or refactor these applications, jobs, or scripts to reduce their disk usage.
+
+3. Delete any unnecessary or temporary files from HDFS, if safe to do so.
+
+4. Consider data compression or deduplication strategies, if applicable, to reduce storage usage in HDFS.
+
+### Useful resources
+
+1. [Apache Hadoop on Wikipedia](https://en.wikipedia.org/wiki/Apache_Hadoop)
+2. [HDFS architecture](https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html) \ No newline at end of file