Details

Type: Bug

Status: Closed

Priority: Major

Resolution: Fixed

Affects Version/s: None

Fix Version/s: None

Labels:None

Environment:
Operating System: Windows XP
Platform: PC
Description
Hi, for all that might use this class:
several things I found when using this class to calculate the
cumulative probability. I attached my code FYI. three things:
1. when I used my code to calculate the cumulativeProbability(50) of
5000 200 100 (Population size, number of successes, sample size),
result was greater than 1 (1.0000000000134985);
2. when I calculated cumulativeProbability(50) and
cumulativeProbability(51) for the distribution 5000 200 100, I got the
same results, but it should have been different;
2. the cumulativeProbability returns "for this distribution, X,
P(X<=x)", but most of the time (at least in my case) what I care about
is the upper tail (X>=x). based on the above findings, I can't simply
use 1cumulativeProbability(x1) to get what I want.
here's what I think might be related to the problem: since the
cumulativeProbability is calculating the lower tail (X<=x), a
distribution like above often has this probability very close to 1;
thus it's difficult to record a number that = 11E50 'cause all you
can do is record sth like 0.9999..... and further digits will be
rounded. to avoid this, I suggest adding a new function to calculate
upper tail or change this to calculate x in a range like (n<=x<=m), in
addition to fix the overflow of the current function.
thank you for your patience to get here. I'm a newbie but I've asked
Java experts in our lab about this. looking into the source code really
isn't up for me......hope someone can fix it, BTW I'm using cygwin under
WinXP pro SP2, with Java SDK 1.4.2_09 build b05, and the commonsmath I used is
both the 1.0 and the nightly build of 81505.
the code:

import org.apache.commons.math.distribution.HypergeometricDistributionImpl;
class HyperGeometricProbability {
public static void main(String args[]) {
if(args.length != 4)
{ System.out.println("USAGE: java HyperGeometricProbabilityCalc [population] [numsuccess] [sample] [overlap]"); }else {
String population = args[0];
String numsuccess = args[1];
String sample = args[2];
String overlap = args[3];
int populationI = Integer.parseInt(population);
int numsuccessI = Integer.parseInt(numsuccess);
int sampleI = Integer.parseInt(sample);
int overlapI = Integer.parseInt(overlap);
HypergeometricDistributionImpl hDist = new
HypergeometricDistributionImpl(populationI, numsuccessI, sampleI);
double raw_probability = 1.0;
double cumPro = 1.0;
double real_cumPro = 1.0;
try {
if (0 < overlapI && 0 < numsuccessI && 0 < sampleI)
}
catch (Exception e)
}
}
}

I tested the nightly build 82905. it seems like the bugs I mentioned above are
all fixed. will post if I get further problems.
Thanks a lot!