Attachment E - Methodology

Attachment E Methodology for Conducting Perceived Benefits Analysis.doc

Assessing the Diabetes Detection Initiative for Policy Decisions

Attachment E - Methodology

OMB: 0920-0791

Document [doc]
Download: doc | pdf

Assessing the Diabetes Detection Initiative for Policy Decisions





Attachment E. Technical Supplement: Methodology for Conducting Perceived Benefits Analysis



Methodology for conducting the perceived benefits analysis


The underlying behavioural model is a random utility model of consumer decision-making. The standard structure of the random utility model consists of two parts. The first part is a systematic component. The systematic component captures the effect of observable attributes of the choice, such as time to clinic, on choices. The second part of the model is the random component. The random component captures the unobservable portion of the decision. The complete indirect utility function is:


(1)


where V is the systematic portion of utility, y represents income, aj represents an attribute at level j, s is a vector of other individual characteristics and ε represents the random portion. When confronted with a cost $P, the individual will agree to pay if the value of the change in the attributes aj is more valuable than the lost income (P). Specifically:


(2)


Otherwise, the individual will choose to keep the income and live with the original set of attributes. The process, however, is stochastic. Since the researcher cannot know everything each individual's choice is a random variable from the researcher’s perspective. The observed choices are observed with a probability distribution represented by:


(3)


and Prob(not paying) = 1 - Prob(paying). The maximum willingness to pay (P*) for a change in attributes for a representative individual is found by setting ΔV=0 and solving for P. Hanemann and Kanninen (1999) show that if one assumes that the difference between random terms is generated by a standard normal CDF, the probability that individual i pays for a change in the level of an attribute corresponds to a standard probit model.1


A more sophisticated model, which accounts for a data generating process that acquires multiple responses for each respondent would be a random effects probit model. One can envision a data collection process where several stated preference questions could be asked in a single interview. The data collection process would survey individuals using several questions changing the attributes from question to question. This method requires the researcher to replicate equation 2 for each question. In the case where there are j survey questions and n attributes the behavioural model structure would be:











where the superscript denotes an attribute level offered by that pair of questions. For example, the superscript 1 attributes form one bundle of attributes and a cost, P1, which are compared to attribute bundle denoted with superscript 0 and cost P0. If one assumes that each observation is independently distributed N(0,σ2), one can pool the observations across questions and use the standard probit model to estimate the parameters.


However, there can be individual-specific responses to changing attribute levels. Modeling this feature requires the researcher to distinguish three sources of variation among individual responses: i) purely random factors that arise independently in each question; ii) random factors that are individual specific; and iii) deterministic variables such as travel distance that affect the probability of paying to change attribute levels. This correlation pattern can be captured in a random effects model (after Heckman and Willis 1975)2 :


(5)


where Yit* is an unobserved latent variable, Xi is a 1 x k vector of variables representing attribute levels, and β is a k x 1 vector of coefficients. The other two terms represent error components that are mutually independent. The first, μi, represents an unobservable characteristic specific to individual i that does not vary among the observations from i and is N(0,σμ2). The second, εit, is N(0,σε2) and is a component that varies among individuals and across the observations from each individual. The observed random variable, Yit is defined by:


(6)


Define σ2 = σε2 + σμ2, ρ = σμ2/σ2, and impose the normalization that σ2 = 1. Then if Yi = [Yi1, Yi2, ..., Yij] is the observed sequence of choices for i, and defining vi = μiμ then the likelihood function for this model is:


(7)


where Φ(.) is the normal cumulative distribution function. In this expression, ρ represents the correlation coefficient between responses from respondent i across the questions and allows the measurement of the proportion of total variance explained by systematic correlated components. If no correlation is present, ρ=0 and the information can be pooled across questions without regard for the particular respondent who answered and the model parameters estimated using the standard probit model. If ρ>0, then use of the standard probit results in biased standard errors of the coefficients (Guilkey and Murphy 1993) and one must consider the random effects in the estimation.3


The likelihood function for the random effects probit for an observed sample of N Yit`s is simply:



(8)



where P(Yi) represents equation (6). As numerous authors note, the estimation of parameters is difficult due to the computation of the joint probabilities of a T-variate normal distribution, which involves evaluating T-dimensional integrals. However, by conditioning on the individual effects, this problem can be reduced to a single integral involving the product of a standard normal density and the difference of two normal cumulative density functions and solved using Gaussian quadrature procedures (Butler and Moffit 1982).4



Data Development


The data to support estimation of these models will be derived from the choices indicated by DDI participants when presented with sets of alternatives. To illustrate the process for the data base construction, suppose attributes have the following five dimensions:


  • Time Spent Taking Test and Waiting

  • Type of Test

  • Test performance

  • Cost of Test to Participant

  • Location of Test


Hypothetical Choice Set


Dimension

Choice A

Choice B

Choice C

Choice D

Time Spent Taking Test and Waiting

4 hours

2 hours

45 minutes

0

Type of Test

Drink concentrated sugar water and use needle to draw a sample of blood from a vein into a small vial-twice

Answer seven questions

Prick finger and smear a drop of blood on test paper

No Screening

Test performance

Test is not very accurate but will help you decide whether to have a better test

Test is not very accurate but will help you decide whether to have a better test

Test is very accurate but might need another test some of the time

Not Applicable

Cost of Test to You

$10

$35

$35

No Cost

Location of Test

At work or at home if you do not work

At this health clinic

At work or at home if you do not work

Not Applicable




For this example, initially five variables will be sufficient to describe each attribute selected and not selected. However, in the analysis, other variables will need to be constructed because while time and cost are continuous variables, the others are categorical. Thus, we will transform each categorical variable into a set of dummy variables for each level of the attribute except one. For example, the type of test would include three dummy variables for each of the different blood tests and paper test would be the omitted category. Once the complete set of dummy variables has been defined and coded, there will be a vector of characteristics used to describe of each of the choices A through D. It should be noted that choice D remains invariant across sets as it represents the do nothing option. In addition, each set of vectors will also include the respondent ID to facilitate estimation of random effects models.


Perceived Benefit Component


The model developed above will be fitted to the data collected during the data collection phase of the project. It is expected that geographic, gender, age and ethnic differences will be explored using standard econometric methods and the results reported in the Final Report. For simplicity this section will work from a single representative individual.

There are two aspects to the analysis. First, there is the benefit of the program to the users. Second, the models will predict participation in the program and can, therefore, be used to predict the rate of participation in the program as the attributes of the program are changed. This is particularly useful in the context of changing costs because the model will also predict the relationship between the participation and the cost of the program as cost varies while holding attributes constant.


(9)

It is expected that the modeling will examine both linear and non-linear specifications of the attributes. Linear attributes will simply follow the usual procedures. The Marginal willingness-to-pay (MWTP) for an attribute (such as waiting time) to the representative member of the sample is simply:




Marginal willingness-to-pay (MWTP) for a quadratic attribute is found using the following equation:


(10)


where β1 is the parameter on congestions and β2 congestion squared. βTC is the bid coefficient. It should be noted that the same method can be applied to ethnic, age and gender effects by interacting the attribute with the demographic variable of interest. If for example one were interested in gender differences in perceived program benefits a variable that interacted gender with another attribute would be created. If this variable was significantly different from zero

there would be a statistically significant difference between the sexes. The two perceived benefits (if male were coded 0 and female 1) would be:



(11)



The second aspect of the analysis is the development of penetration curves, or participation curves. Each attribute of the program, including cost has an impact on the proportion of people who participate. Using a methodology similar to the one shown for estimating differential MWTP the probability of participation by different demographic groups for a given program can be estimated using the parameters of the model. This can be provided in a tabular of graphical format. It is useful to policymakers as it provides a very clear picture of the results of policy decisions on the success of the program. Figure 1 shows a hypothetical participation graph relating cost to participation:







































1 Hanemann, W.M. and B. Kanninen (1999): "The Statistical Analysis of Discrete-Response CV Data", in I.J. Bateman and K.G. Willis (1999).


2 Heckman, J. J., and R. J. Willis. 1975. Estimation of a stochastic model of reproduction: An econometric approach. In N. Terleckyj (Ed.), Household production and consumption. New York: Columbia University Press.


3 Guilkey, D.K. and Murphy, J.L. (1993) - `Estimation and testing in the random effects probit model', Journal of Econometrics, 59, 301-317.


4Butler, J.S. & Moffit, R. (1982). Notes and comments: A computationally efficient quadrature procedure for the one-factor multinomial probit model.


File Typeapplication/msword
File TitleAugust 17, 2004
AuthorBattelle
Last Modified Bycww6
File Modified2008-01-08
File Created2005-09-28

© 2025 OMB.report | Privacy Policy