Thanks for your comments Chris
The Nutch webapp is just a set of web pages that let someone know that Search is working. They are decent web pages, have a great look and feel and are something I've seen nearly every newbie Nutch user I've been around leverage to tell whether or not Nutch installed correctly.
well the SOLR webapps would be just as good if not better for debugging. You get all sorts of stats + can debug your queries etc... The front end and its configuration is also a common source of trouble for beginners.
I'm also a fan of the "let's not loose functionality on a technology upgrade task" mantra. That is, we are reorganizing the architecture of Nutch to improve it, not to take away functionality. We should at least support the baseline of functionality that was present in 1.x.
I don't think it is completely lost we still do have the webapps from SOLR
Regardless of the debug aspect mentioned earlier I really think that any real application based on Nutch would customise the front end anyway.
That said, I'm not sure the existing webapp should be maintained in its current form. Maybe we should take a pass at updating the webapp to work with the Nutch 2.0 architecture underneath. I'm happy to pick up a shovel and dig on that one.
This would need doing indeed i.e. get the cached data or inlinks straight from the webtable via GORA. Speaking of which we should probably think in terms of "what functionalities do we have in Nutch that are currently missing in SOLR", one of them being to be able to get the cache from HDFS/GORA/etc... without having to store the content in the index.