We ran into the following sequence of events:
- master startup failed after only ROOT had been assigned (for another reason)
- restarted the master without restarting other servers. Since there was at least one region assigned, it went through the failover code path
- master scanned META and inserted every region into /hbase/unassigned in ZK.
- then, it called "listChildren" on the /hbase/unassigned znode, and crashed with "Packet len6080218 is out of range!" since the IPC response was larger than the default maximum.
- is broken by
-
ZOOKEEPER-1162 consistent handling of jute.maxbuffer when attempting to read large zk "directories"
-
- Open
-
- is related to
-
HBASE-24544 Recommend upping zk jute.maxbuffer in all but minor installs
-
- Open
-
-
HBASE-6625 If we have hundreds of thousands of regions getChildren will encouter zk exception
-
- Resolved
-