__Probit Analysis__

__Probit Analysis__

__Introduction__Probit Analysis is a method of analyzing the relationship between a stimulus (dose) and the quantal (all or nothing) response. Quantitative responses are almost always preferred, but in many situations they are not practical. In these cases, it is only possible to determine if a certain response (such as death) has occurred. In a typical quantal response experiment, groups of animals are given different doses of a drug. The percent dying at each dose level is recorded. These data may then be analyzed using Probit Analysis.

The Probit Model assumes that the percent response is related to the log dose as the cumulative normal distribution. That is, the log doses may be used as variables to read the percent dying from the cumulative normal. Using the normal distribution, rather than other probability distributions, influences the predicted response rate at the high and low ends of possible doses, but has little influence near the middle. Hence, much of the comparison of different drugs is done using response rates of fifty percent. The probit model may be expressed mathematically as follows:

** P=+αβ[log10(Dose)]**

where P is five plus the inverse normal transform of the response rate (called the Probit). The five is added to reduce the possibility of negative probits, a situation that caused confusion when solving the problem by hand.

The popularity of the method is due in large part to the work of Finney (1971), in his book Probit Analysis. He explains the proper use and analysis of quantal response data. In

**NCSS,**we have coded the algorithms given in his book, and we refer you to it for further information and background

*Data Structure*The data below are suitable for analysis by this procedure. Note that the first variable, Dose, gives the dose level of the treatment. The second variable, Subjects, gives the number of individuals receiving a specific dose level. The third variable, Response, gives the number of treated individuals who exhibited the response of interest.

These data are contained on the Survival data set.

Survival data set

Dose |
Subjects |
Response |

50 |
102 |
19 |

60 |
121 |
26 |

70 |
111 |
24 |

80 |
105 |
31 |

90 |
117 |
54 |

100 |
108 |
83 |

__Procedure Options__This section describes the options available in this procedure.

__Variables Tab__This panel specifies the variables used in the analysis.

**Count Variable**

__R: Count Variable__This variable contains the number of individuals with the desired response. It must be less than the number of animals. The analysis adds one-half t

**Sample Size Variable**

__N: Sample Size Variable__This is the variable containing the total number of individuals sampled at a particular dose level.

**Dose Variable**

__X: Dose Variable__This option contains the name of the variable containing the dose levels. Note that the analysis uses the log (base 10) transformation of dose levels

**Group Variable**

__Group Variable__An optional categorical (grouping) v

__Reports Tab__The following options control the display of reports and plots.

**Select Reports**

Probit Estimation Report … Dose Percentiles Report

These options specify whether to display the corresponding report.

**Percentiles**

A separate row in the Dose Percentile report is created for each percentage value given here. This is a list of numbers between 0 and 100 separated by blanks or commas.

**Report Options**

**Precision**

Specify the precision of numbers in the report. A single-precision number will show seven-place accuracy, while a double-precision number will show thirteen-place accuracy. Note that the reports are formatted for single precision. If you select double precision, some numbers may run into others. Also note that all calculations are performed in double precision regardless of which option you select here. This is for reporting purposes only.

**Variable Names**

This option lets you select whether to display only variable names, variable labels, or both.

**Value Labels**

This option lets you select whether to display only values, value labels, or both. Use this option if you want to automatically attach labels to the values of the group variable (like 1=Yes, 2=No, etc.). See the section on specifying Value Labels elsewhere in this manual.

**Plots Tab**

These options control the attributes of the corresponding plots.

**Select Plots**

**Dose – Response Plot … Probit Plot**

These options specify whether to display the corresponding plot. Click the plot format button to change the plot settings.

__Example 1 – Probit Analysis__This section presents an example of how perform a probit analysis using the data that were shown earlier and found in the Survival dataset.

You may follow along here by making the appropriate entries or load the completed template Example 1 by clicking on Open Example Template from the File menu of the Probit Analysis window.

**1. Open the Survival dataset.**

- • From the File menu of the NCSS Data window, select Open Example Data.
- • Click on the file Survival.NCSS.
- • Click Open.

**2 Open the Probit Analysis window.**

- • On the menus, select Analysis, then Survival / Reliability, then Probit Analysis. The Probit Analysis procedure will be displayed.
- • On the menus, select File, then New Template. This will fill the procedure with the default template.

**3 Specify the variables.**

- • On the Probit Analysis window, select the Variables tab.
- • Double-click in the R: Count Variable box. This will bring up the variable selection window.
- • Select Response from the list of variables and then click Ok.
- • Double-click in the X: Dose Variable box. This will bring up the variable selection window.
- • Select Dose from the list of variables and then click Ok.
- • Double-click in the N: Sample Size Variable box. This will bring up the variable selection window.
- • Select Subjects from the list of variables and then click Ok.

**4 Run the procedure.**

- • From the Run menu, select Run Procedure. Alternatively, just click the green Run button.

**Probit Estimation Section**

**Probit Estimation Section**

Parameter |
Estimate |
Std. Error |

Alpha |
–4.545974 |
1.032341 |

Beta |
4.901165 |
0.5483724 |

LD50 |
1.947695 |
1.304145E-02 |

Dose50 |
88.65325 |
2.662173 |

**Alpha**

The estimated value of the intercept, with its associated standard error.

**Beta**

The estimated value of the slope, with its associated standard error.

**LD50**

The estimated value, on the log10(dose) scale, at which 50% responded.

**Dose50**

The estimated value, on the dose scale, at which 50% responded.

This report displays a table that would have been used if the calculations were carried out by hand. It is presented more for completeness than for any analytic purpose. It does, however, let you investigate the goodness-of-fit of the dose-response model to the data by considering the Chi-square values.

**Dose**

The dose level.

**Actual Percent**

The ratio of the count to the sample size (R/N).

**Probit Percent**

The estimated ratio (R/N) based on the probit model.

**N**

The sample size.

**R**

The count (number responding).

**E(R)**

The expected count based on the probit model.

**Difference**

The difference between the actual and the expected counts.

**Chi-Square**

The Chi-Square statistic for testing the significance (non-zero) of the difference. Since these are single degree of freedom tests, the value should be greater than 3.81 to be significant at the 0.05 level.

**Total Chi-Square**

The total of the Chi-Square values, used to test the overall significance of the differences from the model.

**D.F.**

The degrees of freedom of the Chi-Square test.

**Prob Level**

The probability to the right of the above Chi-Square value. The significance level of the Total Chi-Square test.

**Dose Percentile Section**

This report displays the dose levels yielding various predicted response rates.

**Percentile**

The response rate times 100.

**Probit**

The normal transform of the percentage plus five. (The five is added to avoid the possibility of a negative probit. This practice was helpful when calculations were done by hand, but is based solely on tradition now that calculations are carried out by computer.)

**Log Dose**

The logarithm of the dose level (base 10).

**Std. Error Log(Dose)**

The standard error of the estimated log dose level.

**Dose**

The dose level.

**Std. Error Dose**

The standard error of the estimated dose level.

This plot lets you look at the relationship between percent response and dose. Usually, this plot will be nonlinear. |

This plot lets you look at the relationship between percent response and log dose. Usually, this plot will be nonlinear. |