Document

Introduction

We trained various classes of instrument-level models on the normative sample at V5. We refer to these classes of models as full, race, edu, adi, wrat, adi+wrat.

Each model-class was used to generate global z-Scores for the full V5 sample.

We followed the full V5 sample to see whether or not they had ARIC-adjudicated dementia at V6 (cogstatus61 from derive61).

For each model-class, we fit a logistic regression model on the full V5 sample. For each model, the outcome was V6 dementia and the predictor was the V5 global z-Score from a model-class. (Example Stata command: logistic v6_dem v5_globz_full.)

We then used each logistic regression model to predict a "risk score": probability for dementia at visit 6. The basic idea is something like "the higher your V5 global z-Score, the lower your risk for dementia at V6."

Suppose that we wanted to place risk scores into "low risk" and "high risk" categories. This categorization would depend on a "risk-cutoff." Based on this cutoff, we could count many participants were below and above the cutoff for a given batch of risk scores.

Then we could compare these counts with the counts corresponding to the batch of risk scores from the full model. This idea is devloped in more detail with these tables.

Comparison Model

Controls
Did not have Dementia at V6

Cases
Had Dementia at V6