felix0221 寫:
https://github.com/felix0221/H_log
這邊...麻煩您了
感覺 slave-2 的 DataNode 超不穩的。
代碼:
~$ grep ERR hadoop-hadoop-datanode-slave-2.log.txt
2016-06-23 01:22:57,570 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:22:57,660 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:22:58,099 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:46:04,123 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:46:04,131 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:46:04,522 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:56:42,471 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:56:42,499 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 01:56:43,202 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 03:18:41,230 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
2016-06-23 03:18:41,303 ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: DatanodeRegistration(10.1.1.150:50010, storageID=DS-1471367236-120.97.32.113-50010-1464657472281, infoPort=50075, ipcPort=50020):DataXceiver
而且從 DataNode 的 DS-XXX-YYY-50010-ZZZ 看起來,每台 VM 有兩個 IP,一個是 private IP,一個是 public IP.
代碼:
~$ grep "node registration" hadoop-hadoop-namenode-master.log.txt | awk '{ print $10","$12 }' | sort -n | uniq -c
5 10.1.1.118:50010,DS-2036442461-120.97.32.115-50010-1465748226065
13 10.1.1.150:50010,DS-1471367236-120.97.32.113-50010-1464657472281
3 10.1.1.189:50010,DS-1061459805-120.97.32.114-50010-1464657472333
- Jazz