Currently the troubleshooting docs are blank. As much as I like to believe Cassandra never has any problems I was thinking of writing up a troubleshooting page focussing on:
- Finding the hosts(s) that are behaving badly (common error messages)
- Which logs exist, where they are, and what to look for in which log (common error messages, gc logs, etc)
- Which nodetool commands can give you more information
- Java/Operating systems tools that can help dive deep into performance issues (jstat, top, iostat, cachestat, etc)
Since this is going to be a fairly lengthy page I wanted to get a jira going in case someone else had ideas or had already started. Also if there are any large areas I missed above please comment here and I can include them.