|
Since it's in the examples, a few brief notes on the initialization of HaltonSequence and on nextPoint would help orient readers. It doesn't need to be a course in statistics- the existing code isn't, either- but even a sentence or two on why PiEstimator is using it and perhaps a couple comments identifying the variables would be helpful.
+1 on the patch, though; documentation is just a suggestion. 4437_20081103.patch: added javadoc, no code changes.
This is nice. I understand it is just an example but if we run more maps for a longer period of time can we get more Pi digits?
Now, as an example it should imo have much better documentation. -1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12393275/4437_20081103.patch against trunk revision 709609. +1 @author. The patch does not contain any @author tags. -1 tests included. The patch doesn't appear to include any new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs warnings. +1 Eclipse classpath. The patch retains Eclipse classpath integrity. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: http://hudson.zones.apache.org/hudson/job/Hadoop-Patch/3527/testReport/ This message is automatically generated. > This is nice. I understand it is just an example but if we run more maps for a longer period of time can we get more Pi digits?
Yes, the more samples used the more digits will get in both Monte Carlo method (java.util.Random) and qMC method (Halton sequence). However, the discrepancy for Halton sequence is smaller than java.util.Random. The expected error of java.util.Random is O(1/sqrt(N)) while the expected error of using Halton sequence is O((ln N)/N), where N is the number for samples. For estimating Pi with 100,000,000 samples, the accuracy of Halton is ~7 digits but java.util.Random is only ~4 digits as shown previously. > Now, as an example it should imo have much better documentation. This is a good point. I plan to further improve the PiEstimator. Let me also improve the documentation in the next issue. Tested manually with a 100-nodes cluster. The biggest sample set I have run was
I just committed this. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Try totally 100000000 samples
Job Finished in 22.422 seconds
Estimated value of PI is 3.14145832
Job Finished in 13.375 seconds
Estimated value of PI is 3.14159256000000000000