finegray

Analysis

Fine-Gray subdistribution hazard models for competing risks

 . net install finegray, from(...)  

View on GitHub →

Version 1.0.0 | 2026-04-06

finegray fits the Fine and Gray (1999) subdistribution hazards model for competing risks data. It uses a native Mata forward-backward scan implementation that avoids data expansion, so it remains practical on datasets where stcrreg becomes slow or infeasible.

The package also includes post-estimation tools for prediction and proportional subdistribution hazards diagnostics. The intended workflow is finegray for estimation, finegray_predict for xb, CIF, or Schoenfeld residuals, and finegray_phtest for the proportional hazards check.

Requirements

Stata 16 or later
Data must be stset with id()
The estimation sample must contain one observation per subject

Installation

capture ado uninstall finegray
net install finegray, from("https://raw.githubusercontent.com/tpcopeland/Stata-Tools/main/finegray") replace

Commands

Command	Description
`finegray`	Fit a Fine-Gray subdistribution hazards model
`finegray_predict`	Generate `xb`, CIF, or Schoenfeld residuals after `finegray`
`finegray_phtest`	Test the proportional subdistribution hazards assumption

How It Works

The workflow has three parts:

stset the data with one record per subject and an id() variable.
Fit finegray with a compete() event-type variable and cause() for the event of interest.
Use finegray_predict or finegray_phtest after estimation.

Operational details that matter:

compete() is usually coded as 0 = censored, 1 = cause 1, 2 = cause 2, and so on
cause(#) selects the event type of interest
finegray_predict, xb can be used on datasets that contain the model covariates
finegray_predict, cif additionally requires a time variable (_t or timevar())
finegray_predict, schoenfeld and finegray_phtest require the original stset estimation data
Factor-variable models are supported, but prediction on new data still requires the same factor-level support as the estimation sample

Worked Examples

These examples use Stata's built-in webuse hypoxia data because it is a natural competing-risks dataset for the package.

1. Fit the basic Fine-Gray model

failtype identifies competing event types. After creating a clean event-type variable, finegray estimates the subdistribution hazard ratio for cause 1.

webuse hypoxia, clear
gen byte status = failtype
stset dftime, failure(dfcens==1) id(stnum)

finegray ifp tumsize pelnode, compete(status) cause(1)

This is the canonical starting point. By default, the command reports exponentiated subdistribution hazard ratios with sandwich standard errors.

2. Predict cumulative incidence after estimation

Use finegray_predict, cif when you want the fitted cumulative incidence at each observation's event time or at an explicitly supplied time variable.

webuse hypoxia, clear
gen byte status = failtype
stset dftime, failure(dfcens==1) id(stnum)
finegray ifp tumsize pelnode, compete(status) cause(1)

finegray_predict cif_hat, cif
gen double t5 = 5
finegray_predict cif_at5, cif timevar(t5)

cif_hat uses each subject's current _t. cif_at5 instead asks for the fitted CIF at time 5 for every observation.

3. Run the proportional hazards diagnostic

finegray_phtest is the post-estimation check for time-varying effects. It uses scaled Schoenfeld residuals and therefore must be run on the original estimation data.

webuse hypoxia, clear
gen byte status = failtype
stset dftime, failure(dfcens==1) id(stnum)
finegray ifp tumsize pelnode, compete(status) cause(1)

finegray_phtest
finegray_phtest, time(log)

Use the default rank-based test first. time(log) is a sensible sensitivity check when you suspect departures later in follow-up.

4. Common model variations

The package supports factor variables, stratified censoring distributions, cluster-robust inference, and model-based standard errors.

webuse hypoxia, clear
gen byte status = failtype
stset dftime, failure(dfcens==1) id(stnum)

finegray i.pelnode##c.ifp tumsize, compete(status) cause(1)
finegray ifp tumsize, compete(status) cause(1) strata(pelnode)
finegray ifp tumsize pelnode, compete(status) cause(1) norobust
finegray ifp tumsize pelnode, compete(status) cause(1) noshr

norobust switches from the default sandwich variance to the observed-information variance. noshr reports log-SHR coefficients instead of exponentiated SHRs.

Features

Native forward-backward scan implementation without data expansion
Support for factor variables and interactions
Stratified censoring distributions via strata()
Robust, clustered, or model-based standard errors
CIF prediction on estimation data or at user-supplied times
Approximate proportional subdistribution hazards test after estimation
Support for left-truncated data handled through stset

Validation

The package QA cross-validates finegray against Stata's stcrreg and independent R implementations of Fine-Gray regression. The validation files under qa/ cover coefficients, standard errors, log pseudo-likelihoods, CIF predictions, baseline hazards, and stratified censoring behavior.

References

Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association. 1999;94(446):496-509.
Grambsch PM, Therneau TM. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994;81(3):515-526.
Kawaguchi ES, Shen JI, Suchard MA, Li G. Scalable algorithms for large competing risks data. Journal of Computational and Graphical Statistics. 2021;30(3):685-693.

Version History

1.0.0 (2026-04-06): Initial Stata-Tools release of finegray, finegray_predict, and finegray_phtest

Author

Timothy P Copeland, Karolinska Institutet

License

MIT