Response to the Office of Management and Budget
2011 California Health Interview Survey (CHIS 2011)
OMB #: 0925-0598
Would you please ask the Program if there is an update regarding section B2.2. copied below?
B.2.2. Estimation Procedure
CHIS data will be statistically weighted to account for the differential probability of selecting persons into the sample, and the weights will be raked to the various domains of California population totals. Specific strategy for drawing and weighting the CHIS sample will depend on the results of the recently fielded CHIS 2009, and as such, final specifications are not available at this time. The methods detailed below, therefore, enumerate the anticipated estimation procedure.
Unlike CHIS 2007 where the cell sample was screened and data collected from cell-only households, CHIS 2009 collects information from all households identified in the cell sample. These include cell only adults, adults that receive most of their calls in their cell phone, adults who receive some of their cells and some on their landlines, and adults who received few or no calls on their cell phone. The same groups are also identified in the landline sample if the adult has a cell phone.
Since the landline and cell phone populations and samples overlap and the drawn samples are probability samples, we can use a multiple frame estimation approach to combine and create weights for these samples. This approach follows the ideas of Hartley (1962) and is different from the approach used to combine the landline and surname samples. The proposed method is needed because we cannot determine the multiple probabilities of selection of all units in the sample. The method we plan to use is outlined in Brick, Flores Cervantes, Norman, and Lee (submitted).
There
are three domains of interest in the overlapping frames. The first
domain called a includes all adults in households with only landline
service, the second domain called b includes all adults in cell-only
households, and the third domain called ab includes all adults in
households with both landline and cell phones. Let
be a characteristic for adults in a domain (e.g., the number of
adults with health insurance). Let
be the estimate of
computed using the landline sample, and let
the estimate of
computed using the records in cell phone sample. An estimate of
using the landline sample is
,
where
is the estimate computed using the records from landline only
households and
is the estimate computed using the adults with a landline and cell
phone from in the landline sample. In a similar way, an estimate of
based
on the cell phone sample is
where
is the estimate computed using the adults with a landline and cell
phone from the cell phone sample and
is the estimate computed using the records from cell only households.
Notice
that neither
nor
are unbiased estimates of
.
However, an unbiased estimate of
can be computed as
,
where
(
)
is the composite or weighting factor. The value of
was chosen to minimize the bias of
.
Before
creating the composite weights, both samples (landline and cell) were
poststratified separately to control totals defined by telephone
service or type (i.e., persons in landline only households, persons
in cell phone only households, and persons in households with both
services). The distribution of telephone usage for California was
derived from the National Health Interview Survey for January to June
2010 for the West region. The poststratified person weight,
is computed as
where
is the person weight (i.e., adult, child, or adolescent) and
is the control total by telephone service.
Once
the samples were poststratified, a composite weight that combined the
landline and cell phone sample was created. Based on the research by
Brick et. al (submitted), using the composite factor
was used to reduce the bias of estimates computed using both samples.
This factor and its complement (
)
can be seen as an additional weighting adjustment factors to apply to
the poststratified weights. The expression of the composite weight,
,
is
where
is the poststratified person weight above. The total sum of weights
is an estimate of the total eligible population in California.
Since the landline and cell phone samples are independent samples, the estimates of variance can be computed using replication or linearization (i.e., Taylor series approximation).
In summary, the supplemental samples (i.e., geographic and surnames samples) are combined with the landline sample at the beginning of the weighting process. The cell phone sample and the combined landline-supplemental samples are first poststratified to telephone status or usage, combined through a composite factor, and then raked altogether.
April 8, 2011
File Type | application/vnd.openxmlformats-officedocument.wordprocessingml.document |
Author | Royce Park |
File Modified | 0000-00-00 |
File Created | 2021-02-01 |