PROBLEM
Consider the following population with 196 elements stratified into 6 strata.
| NUMBER | No of elements in different strata | sd |
| 1 | 13 | 325 |
| 2 | 18 | 190 |
| 3 | 26 | 186 |
| 4 | 42 | 82 |
| 5 | 73 | 86 |
| 6 | 24 | 190 |
i) Allocate the sample size 56 to different strata using the proportional and neyman allocation .
ii) Which sampling strategy would be suggested? Justify using the values of variance of population mean for these sampling designs.
Theory-
Sample size allocation to different strata in proportional allocation is given by-
nh = Nh.(n/N)
Sample size allocation to different strata in neyman allocation is given by-
Variance of population mean in proportional allocation is given by-
Variance of population mean in neyman allocation is given by-
Where,
nh = sample size allocation in hth stratum
Nh = total number of elements in hth stratum
n = total sample size
N = population size
Sh = variability in hth stratum.
L = number of strata
The allocation method having minimum variance will be considered as better allocation method.
R Code-
#command to remove previous objects
rm(list=ls())
#loading and viewing data set
Population <- read_excel("Practical STA304/prob 10.xlsx")
View(Population)
#checking the class of the dataset
data.class(Population)
#loading dataset on environment
attach(Population)
#extracting the variables
Nh = `No of elements in different strata`
Sdh = `sd`
#calculating some variables
n=56 #given
N=sum(Nh)
f=n/N
Wh=as.numeric(format(Nh/N,digits=4))
Sh=as.numeric(format(sqrt((N/(Nh-1))*Sdh^2),digits=4))
L=length(NUMBER)
Stratum=NUMBER
#defining some vector
n_h_prop = c()
n_h_neyman = c()
#loop to calculate sample allocation in prop. and neyman allocation
for (h in 1:L ){
n_h_prop[h] = round(Nh[h]*n/N)
n_h_neyman[h] = round((n*Nh[h]*Sh[h])/sum(Nh*Sh))
}
#calculating the variance of estimate of population mean (i.e. y bar st.)
y_bar_st_prop=((1-f)/n)*sum(Wh*Sh^2) #for proportional allocation
y_bar_st_Neyman=(((1/n)*(sum(Wh*Sh))^2) - ((1/N)*sum(Wh*Sh^2)) ) #for neyman
#filling data in 7th row of dataframe for different variable
Stratum[7]="Total"
Nh[7]=N
Wh[7]=sum(Wh[1:6])
Sdh[7]="-"
Sh[7]="-"
n_h_prop[7]=sum(n_h_prop[1:6])
n_h_neyman[7]=sum(n_h_neyman[1:6])
#priting the table of different variable along with sample allocation
View(data.frame(Stratum,Nh,Wh,Sdh,Sh,n_h_prop,n_h_neyman))
#printing the variance of estimate of population mean (i.e. y bar st.)
cat("variance of estimate of population mean in Proportional allocation is: ",y_bar_st_prop,"\n")
cat("variance of estimate of population mean in Neyman allocation is: ",y_bar_st_Neyman,"\n")
Some Insights From The R Console-
Imported Dataset On R Console-
| NUMBER | No of elements in different strata | sd |
| 1 | 13 | 325 |
| 2 | 18 | 190 |
| 3 | 26 | 186 |
| 4 | 42 | 82 |
| 5 | 73 | 86 |
| 6 | 24 | 190 |
Output Table With Proportional And Neyman Allocation-
| Stratum | Nh | Wh | Sdh | Sh | n_h_prop | n_h_neyman |
| 1 | 13 | 0.06633 | 325 | 1313.5 | 4 | 13 |
| 2 | 18 | 0.09184 | 190 | 645.1 | 5 | 9 |
| 3 | 26 | 0.13265 | 186 | 520.8 | 7 | 10 |
| 4 | 42 | 0.21429 | 82 | 179.3 | 12 | 6 |
| 5 | 73 | 0.37245 | 86 | 141.9 | 21 | 8 |
| 6 | 24 | 0.12245 | 190 | 554.6 | 7 | 10 |
| Total | 196 | 1.00001 | – | – | 56 | 56 |
And Obtained Variances are-
> #printing the variance of estimate of population mean (i.e. y bar st.)
> cat("variance of estimate of population mean in Proportional allocation is: ",y_bar_st_prop,"\n")
variance of estimate of population mean in Proportional allocation is: 3070.006
> cat("variance of estimate of population mean in Neyman allocation is: ",y_bar_st_Neyman,"\n")
variance of estimate of population mean in Neyman allocation is: 1278.315
Conclusion-
From the above output on R console we conclude that in proportional allocation, the sample allocation to different strata serial wise are- 4,5,7,12,21,7
And in neyman allocation, the sample allocation to different strata serial wise are- 13,9,10,6,8,10 The variance of the estimate of population mean in neyman allocation is 1278.315 which is far less than variance of the estimate of population mean in proportional allocation which is 3070.006. hence we conclude that neyman allocation is more efficient than proportional allocation.