The term cluster sampling is used when random groups
(clusters), rather than random individuals are sampled. For
example,
to sample the students in large school, one might pick at random
4, say, classes. If each class has, say, 30 students, each of who
receives a questionairre, is the sample size 120, or is it 4?
Items in the same cluster are not independent.
- Suppose one of the questions is, "What is your program year?"
and at least three of the classes are 3-rd or 4-th year classes?
(Which could happen in a sample as small as 4)
- Suppose science classes are over represented,
or under-represented, (again, plausible with only 4 classes)
and the questionairre probes attitudes toward science?
- Suppose one of the courses is, Auto Mechanics, or Home
Economics, or Nursing, and some of the questions are gender
related?
Cluster sampling is not always recognized as such:
In Development of Survey Methods to Assess Survey Practices
by Barbara Bailar and Micheal Lanphier, © American Statistical
Association, 36 surveys were investigated. Of these, 24 had
"at least partially clustered" samples. From page 45 of the report:
"In the clustered samples included in the pilot study, . . . .
With only four exceptions, estimates of variance, when made at
all, were computed as though the data had come from a simple
random sample."
Tale heard during a party at a meeting of the Statistal Society of Canada:
Each ocean bottom sample cost some thousands of dollars to dredge.
Then, the sample required hours processing in the lab to reduce it
to 1 c.c. of purified material to be inserted into an instrument
from which a value is read:
All that money and effort for one number.
Despite the cost there were pages and pages of data
Statistician: "How could you ever afford all that data"?
Oceanographer: "What's the problem?
After the specimen is collected and processed, I put it in the
instrument, and Push the button and get a number, and
Push the button again and get another number, and
Push the button and get another number, and
Push the button and get another number,
and . . . "