Not sure if it's a bug or a missing feature I'm trying to make Solr work on Alluxio, as described by Timothy Potter in https://www.slideshare.net/thelabdude/running-solr-in-the-cloud-at-memory-speed-with-alluxio/1
The problem I'm facing here is with autoAddReplicas. If I have replicationFactor=1 and the node with that replica dies, the node taking over incorrectly assigns the data directory. For example:
The same happens for ulogDir. Apparently, this has to do with this bit from HDFSDirectoryFactory:
If I add "alluxio:/" in there, the paths are correct and the index is recovered.
I see a few options here:
- add "alluxio:/" to the list there
- add a regular expression in the lines of [a-z]*:/ I hope that's not too expensive, I'm not sure how often this method is called
- don't do anything and expect alluxio to work with an "hdfs:/" path? I actually tried that and didn't manage to make it work
- have a different DirectoryFactory or something else?
What do you think?