Result tables description
Result of PheWAS analysis
| Variable Name | Type | Description | 
|---|---|---|
| 
 | String | Disease code (Phecode) used in PheWAS analysis | 
| 
 | String | Disease name corresponding to the Phecode | 
| 
 | String | Phecode disease system corresponding to the Phecode (e.g., infectious diseases) | 
| 
 | String | Sex-specificity of the disease (e.g., Both, Male, Female) | 
| 
 | Integer | Number of individuals diagnosed with the disease in the exposed group | 
| 
 | String | Descriptions of the model fitting state and removed covariates with reasons | 
| 
 | String | Incidence rate (unit: per 1,000 person-years) in the exposed group | 
| 
 | String | Incidence rate (unit: per 1,000 person-years) in the unexposed group | 
| 
 | Float | Estimated coefficient from the model | 
| 
 | Float | Standard error of the estimated coefficient | 
| 
 | Float | P-value indicating statistical significance of the coefficient | 
| 
 | Boolean | Indicates whether the result is statistically significant based on adjusted p-value (True/False) | 
| 
 | Float | Adjusted p-value accounting for multiple comparisons | 
Result of comorbidity strength estimation
| Variable Name | Type | Description | 
|---|---|---|
| 
 | Integer | Phecode for disease 1 in the disease pair | 
| 
 | Integer | Phecode for disease 2 in the disease pair | 
| 
 | String | Name of the disease pair (format: “D1-D2”) | 
| 
 | Integer | Total number of individuals in exposed group | 
| 
 | Integer | Number of exposed individuals included in the sub-cohort that meet the sex-specificity eligibility criteria for both diseases and after excluding those with history of either disease 1, disease 2, or related diseases | 
| 
 | Integer | Number of individuals diagnosed with both diseases | 
| 
 | Integer | Number of individuals diagnosed with disease 1 | 
| 
 | Integer | Number of individuals diagnosed with disease 2 | 
| 
 | Integer | Number of individuals diagnosed with both disease 1 and disease 2 but without defined temporal order (i.e., the time interval between the two diagnosis is smaller than or equal to  | 
| 
 | Integer | Number of individuals diagnosed with disease 1 followed by disease 2 in a defined temporal order (i.e., the time interval between the two diagnosis is larger than  | 
| 
 | Integer | Number of individuals diagnosed with disease 2 followed by disease 1 in a defined temporal order (i.e., the time interval between the two diagnosis is larger than  | 
| 
 | Float | Phi coefficient (φ), Pearson’s correlations for two binary variables | 
| 
 | Float | P-value for Phi coefficient significance | 
| 
 | Float | Relative risk of observing both conditions in the same individual relative to expectation | 
| 
 | Float | P-value for relative risk | 
| 
 | Float | Adjusted P-value for Phi coefficient (multiple comparisons) | 
| 
 | Float | Adjusted P-value for relative risk (multiple comparisons) | 
| 
 | Boolean | Whether the Phi is statistically significant based on adjusted p-value | 
| 
 | Boolean | Whether the RR is statistically significant based on adjusted p-value | 
| 
 | String | Name of disease 1 | 
| 
 | String | Phecode disease system related to disease 1 | 
| 
 | String | Sex-specificity of disease 1 | 
| 
 | String | Name of disease 2 | 
| 
 | String | Phecode disease system related to disease 2 | 
| 
 | String | Sex-specificity of disease 2 | 
Result of binomial test
| Variable Name | Type | Description | 
|---|---|---|
| 
 | Float | Phecode for disease 1 in the temporal disease pair | 
| 
 | Float | Phecode for disease 2 in the temporal disease pair | 
| 
 | String | Name of the temporal disease pair (e.g., D1->D2) | 
| 
 | Float | Number of individuals diagnosed with both disease 1 and disease 2 but without defined temporal order (i.e., the time interval between the two diagnosis is smaller than or equal to  | 
| 
 | Float | Number of individuals diagnosed with disease 1 followed by disease 2 in a defined temporal order (i.e., the time interval between the two diagnosis is larger than  | 
| 
 | Float | Number of individuals diagnosed with disease 2 followed by disease 1 in a defined temporal order (i.e., the time interval between the two diagnosis is larger than  | 
| 
 | Float | P-value from the binomial test for directionality | 
| 
 | Float | Proportion of successful outcomes in the binomial test | 
| 
 | String | Confidence interval for the binomial proportion | 
| 
 | String | Name of disease 1 | 
| 
 | String | Phecode disease system for disease 1 | 
| 
 | String | Sex-specificity of disease 1 | 
| 
 | String | Name of disease 2 | 
| 
 | String | Phecode disease system for disease 2 | 
| 
 | String | Sex-specificity of disease 2 | 
| 
 | Boolean | Indicates whether the result is statistically significant based on adjusted p-value | 
| 
 | Float | Adjusted p-value for multiple comparisons | 
Result of comorbidity network analysis
| Variable Name | Type | Description | 
|---|---|---|
| 
 | Float | Phecode for disease 1 in the non-temporal disease pair | 
| 
 | Float | Phecode for disease 2 in the non-temporal disease pair | 
| 
 | String | Name of the non-temporal disease pair (e.g., “D1-D2”) | 
| 
 | Integer | Total number of individuals in exposed group | 
| 
 | Integer | Number of exposed individuals included in the sub-cohort that meet the sex-specificity eligibility criteria for both diseases and after excluding those with history of either disease 1, disease 2, or related diseases | 
| 
 | String | Number of exposed individuals (individuals with diagnosis of D1) among cases (individuals with diagnosis of D2) | 
| 
 | String | Number of exposed individuals (individuals with diagnosis of D1) among controls (individuals without diagnosis of D2) | 
| 
 | String | Method used for comorbidity network analysis | 
| 
 | String | Description of the model fitting, removed covariates in the model, and reasons for removal of covariates in the model | 
| 
 | String | List of covariates used in the model | 
| 
 | String | Z-values for each covariate in the model | 
| 
 | Float | Estimated coefficient from the comorbidity model | 
| 
 | Float | Standard error of the estimated coefficient | 
| 
 | Float | P-value for the comorbidity coefficient | 
| 
 | Float | Akaike information criterion for the model | 
| 
 | String | Name of the disease 1 | 
| 
 | String | Phecode disease system for the disease 1 | 
| 
 | String | Sex-specificity of the disease 1 | 
| 
 | String | Name of the disease 2 | 
| 
 | String | Phecode disease system for the disease 2 | 
| 
 | String | Sex-specificity of the disease 2 | 
| 
 | Boolean | Whether the result is statistically significant based on adjusted p-value | 
| 
 | Float | Adjusted p-value accounting for multiple comparisons | 
| Columns for RPCN method | ||
| 
 | Float | Hyperparameter used for l1-norm (Weight multiplying the l1 penalty term) | 
| Columns for PCN_PCA method | ||
| 
 | Float | The cumulative proportion of variance that is accounted for by a selected number of principal components in a Principal Component Analysis (sum of explained variance for principal components) | 
Result of disease trajectory analysis
| Variable Name | Type | Description | 
|---|---|---|
| 
 | Float | Phecode for disease 1 in the temporal disease pair | 
| 
 | Float | Phecode for disease 2 in the temporal disease pair | 
| 
 | String | Name of the temporal disease pair (e.g., “D1 → D2”) | 
| 
 | Integer | Total number of individuals in exposed group | 
| 
 | Integer | Number of exposed individuals included in the nested case-control dataset, where eligible cases (with diagnosis of D2) are all selected and matched with specified number of controls using incidence density sampling | 
| 
 | String | Number of exposed individuals (individuals with diagnosis of D1) among cases (individuals with diagnosis of D2) | 
| 
 | String | Number of exposed individuals (individuals with diagnosis of D1) among cases (individuals with diagnosis of D2) | 
| 
 | String | Method used for disease trajectory analysis | 
| 
 | String | Description of the model fitting, removed covariates in the model, and reasons for removal of covariates in the model | 
| 
 | String | List of covariates included in the model | 
| 
 | String | Z-values for each covariate in the model | 
| 
 | Float | Estimated coefficient from the model | 
| 
 | Float | Standard error of the estimated coefficient | 
| 
 | Float | P-value for the coefficient | 
| 
 | Float | Akaike information criterion for the model | 
| 
 | String | Name of the disease 1 | 
| 
 | String | Phecode disease system for the disease 1 | 
| 
 | String | Sex-specificity of the disease 1 | 
| 
 | String | Name of the disease 2 | 
| 
 | String | Phecode disease system for the disease 2 | 
| 
 | String | Sex-specificity of the disease 2 | 
| 
 | Boolean | Whether the result is statistically significant based on adjusted p-value | 
| 
 | Float | Adjusted p-value accounting for multiple comparisons | 
| Columns for RPCN method | ||
| 
 | Float | Hyperparameter used for l1-norm (weight multiplying the l1 penalty term) | 
| Columns for PCN_PCA method | ||
| 
 | Float | The cumulative proportion of variance in a dataset that is accounted for by a selected number of principal components in a Principal Component Analysis (sum of explained variance for principal components) |