Cumulative distribution function of a binomial random variable

Gennaro · December 13, 2021, 4:20pm

I've a random variable X/n, where X is a binomial random variable with p=1/2 and n= 10, 100, 1000. The goal is to plot the cumulative distribution function of X/n.

I'm wondering why the first argument of the command pbinom is x*n.

delta = 1/1000
x = seq(0, 1, by=delta)

plot(x=NULL, y=NULL, xlim=c(0,1), ylim=c(0,1), xlab='x', ylab='CDF')

n = 10
y = pbinom(x*n, n, 1/2)
lines(x, y, type='s', col='red')

n = 100
y = pbinom(x*n, n, 1/2)
lines(x, y, type='s', col='blue')

n = 1000
y = pbinom(x*n, n, 1/2)
lines(x, y, type='s')

FJCC · December 13, 2021, 5:40pm

If we label the states of X as 0 and 1, x represents the fraction of the n samples that had X = 1. When x = 0, there were no observations of X = 1 and when x = 1, all n of the samples had X = 1. The other 999 values of x * n represent intermediate cases. When n = 10, almost all of the values of x * n do not represent actually realizable results of integer values of observations. The resulting value remains constant until the next integer threshold, making the stair step appearance of the plot.

system · January 3, 2022, 5:40pm

This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.

If you have a query related to it or one of the replies, start a new topic and refer back with a link.