Description
We ran into the following sequence of events:
- master startup failed after only ROOT had been assigned (for another reason)
- restarted the master without restarting other servers. Since there was at least one region assigned, it went through the failover code path
- master scanned META and inserted every region into /hbase/unassigned in ZK.
- then, it called "listChildren" on the /hbase/unassigned znode, and crashed with "Packet len6080218 is out of range!" since the IPC response was larger than the default maximum.
Attachments
Issue Links
- is broken by
-
ZOOKEEPER-1162 consistent handling of jute.maxbuffer when attempting to read large zk "directories"
- Open
- is related to
-
HBASE-24544 Recommend upping zk jute.maxbuffer in all but minor installs
- Open
-
HBASE-6625 If we have hundreds of thousands of regions getChildren will encouter zk exception
- Closed