# Cluster sampling

﻿
Cluster sampling

Cluster Sampling is a sampling technique used when "natural" groupings are evident in a statistical population. It is often used in marketing research. In this technique, the total population is divided into these groups (or clusters) and a sample of the groups is selected. Then the required information is collected from the elements within each selected group. This may be done for every element in these groups or a subsample of elements may be selected within each of these groups. A common motivation for cluster sampling is to reduce the average cost per interview. Given a fixed budget, this can allow an increased sample size. Assuming a fixed sample size, the technique gives more accurate results when most of the variation in the population is within the groups, not between them.

## Cluster elements

Elements within a cluster should ideally be as heterogeneous as possible, but there should be homogeneity between cluster means. Each cluster should be a small scale representation of the total population. The clusters should be mutually exclusive and collectively exhaustive. A random sampling technique is then used on any relevant clusters to choose which clusters to include in the study. In single-stage cluster sampling, all the elements from each of the selected clusters are used. In two-stage cluster sampling, a random sampling technique is applied to the elements from each of the selected clusters.

The main difference between cluster sampling and stratified sampling is that in cluster sampling the cluster is treated as the sampling unit so analysis is done on a population of clusters (at least in the first stage). In stratified sampling, the analysis is done on elements within strata. In stratified sampling, a random sample is drawn from each of the strata, whereas in cluster sampling only the selected clusters are studied. The main objective of cluster sampling is to reduce costs by increasing sampling efficiency. This contrasts with stratified sampling where the main objective is to increase precision.

There also exists multistage sampling, where more than two steps are taken in selecting clusters from clusters.

## Aspects of cluster sampling

One version of cluster sampling is area sampling or geographical cluster sampling. Clusters consist of geographical areas. Because a geographically dispersed population can be expensive to survey, greater economy than simple random sampling can be achieved by treating several respondents within a local area as a cluster. It is usually necessary to increase the total sample size to achieve equivalent precision in the estimators, but cost savings may make that feasible.

In some situations, cluster analysis is only appropriate when the clusters are approximately the same size. This can be achieved by combining clusters. If this is not possible, probability proportionate to size sampling is used. In this method, the probability of selecting any cluster varies with the size of the cluster, giving larger clusters a greater probability of selection and smaller clusters a lower probability. However, if clusters are selected with probability proportionate to size, the same number of interviews should be carried out in each sampled cluster so that each unit sampled has the same probability of selection.

Cluster sampling is used to estimate high mortalities in cases such as wars, famines and natural disasters.[1]

• Can be cheaper than other methods - e.g. fewer travel expenses, administration costs

• Higher sampling error, which can be expressed in the so-called "design effect", the ratio between the number of subjects in the cluster study and the number of subjects in an equally reliable, randomly sampled unclustered study.[2]

## References

1. ^ David Brown, Study Claims Iraq's 'Excess' Death Toll Has Reached 655,000, Washington Post, Wednesday, October 11, 2006. Retrieved September 14, 2010.
2. ^ Kerry and Bland (1998). Statistics notes: The intracluster correlation coefficient in cluster randomisation. British Medical Journal, 316, 1455-1460.

Wikimedia Foundation. 2010.

### Look at other dictionaries:

• cluster sampling — grupinis ėmimas statusas T sritis Standartizacija ir metrologija apibrėžtis Ėmimo būdas, kai visa tiriamoji visuma tam tikru būdu padalijama į nesanklotines grupes arba lizdus, o matavimai atliekami ėminių, kurie sudaryti iš tų atskirų grupių ar… …   Penkiakalbis aiškinamasis metrologijos terminų žodynas

• cluster sampling — / klʌstə ˌsɑ:mplɪŋ/ noun sampling on the basis of well defined groups ● Cluster sampling was used in the survey since there were several very distinct groups in the population under study …   Marketing dictionary in english

• cluster sampling — grupinis ėmimas statusas T sritis chemija apibrėžtis Ėmimas iš nesanklotinių grupių, į kurias tam tikru būdu suskirstyta tiriamoji visuma. atitikmenys: angl. cluster sampling rus. групповая выборка; групповой выбор …   Chemijos terminų aiškinamasis žodynas

• cluster sampling — A method of selecting a sample in which the members of the sample are chosen from one or more groups (clusters), rather than from the population as a whole. This technique is often used when random sampling from the whole population would be… …   Big dictionary of business and management

• cluster sampling — (Statistics) technique of sampling where the whole population is divided into groups and a random sample of these clusters are picked out …   English contemporary dictionary

• cluster sampling — Ops see random sampling …   The ultimate business dictionary

• cluster sampling — A method of selecting a sample in which the population is divided into clusters (groups) from each of which a random sample is taken. This technique is often used in auditing. For example, groups of, say, invoices are chosen at random by the… …   Accounting dictionary

• multi-stage cluster sampling — daugiapakopis grupinis ėmimas statusas T sritis Standartizacija ir metrologija apibrėžtis Grupinis dvipakopis ar daugiapakopis ėmimas, kai imama iš kiekvienos pakopos grupių, į kurias suskaidomos ankstesniu ėmimu gautos grupės. atitikmenys: angl …   Penkiakalbis aiškinamasis metrologijos terminų žodynas

• Sampling (statistics) — Sampling is that part of statistical practice concerned with the selection of individual observations intended to yield some knowledge about a population of concern, especially for the purposes of statistical inference. Each observation measures… …   Wikipedia

• Cluster — A cluster is a small group or bunch of something. Contents 1 In science 2 In astrophysics 3 In biology and health sciences 4 In computing …   Wikipedia