LikeLike

]]>Can you explain why you want this? The correct probability distribution for sampling in the way it is normally done in e-discovery (i.e., without replacement) is the hypergeometric distribution, which is what my calculator uses. As far as I know, the only reason the binomial distribution is used in e-discovery is as an approximation to the hypergeometric distribution that is easier to calculate. Granted, the binomial distribution is a very good approximation to the hypergeometric distribution in almost all situations that you are likely to encounter in e-discovery (one possible exception being stratified sampling where some strata are rather small), but why use an approximate result when the exact result is already available to you without any need to do the hard math (since I already did it for you when I built the calculator)?

LikeLike

]]>