Overstay2 scoring model for St. Boniface: Difference between revisions

From CCMDB Wiki
Jump to navigation Jump to search
JMojica (talk | contribs)
JMojica (talk | contribs)
 
Line 47: Line 47:


== Log ==
== Log ==
* 2025-10-28 - model 13 selected as final model with AUC=0.771 and HL goodness of fit =11.1321(prob=0.133)
* 2025-10-15 - model 13 selected as final model with AUC=0.771 and HL goodness of fit =11.1321 (prob=0.133)
* 2025-08-11 - found good fit model in the training data set (Nov2020 to Dec2024) as well as in the validation data set1 (Nov2020 to Dec2024) but not in the validation dataset2 (2025 data). When discussed with Dr Roberts and Tina, it was agreed to redefine the period to include Nov 2020 to July 2025 and divide into training set and one validation set and from these data sets find the good fit model. The results presented in Oct 15, 2025.
* 2025-07-18 - found good fit model but was agreed to re-do to include more parameters like Chronic Addiction, Acute OD Toxicity, geographic locations, etc as well as clarify the definitions of some parameters.


== Related articles ==  
== Related articles ==  

Latest revision as of 11:07, 18 November 2025

This page describes the Overstay2 scoring model for St. Boniface for the Project Overstay2, one of the site-based Overstay2 scoring models. The model and the Data definition for factor candidates for the Overstay2 project it uses were developed in Re-analysis and generation of Overstay2 model and is used to generate the Overstay2 colour, which in turn drives the Overstay2 processes on the units to reduce discharge delay.


Data elements and assigned coefficients

The binary logistic regression Y (Overstay GE 10 Days) which gave the AUC C-index =0.771 implying better predicted model and Hosmer and Lemeshow Goodness of fit test Chi-Square value of 11.1321 (prob=0.133) is :

LN(OS/1-OS) = Y

Y = Intercept Estimate + sum of all the rest of (the Parameter values multiplied by the Estimate) listed below:

Parameter Estimate Prob > ChiSq
Intercept -7.308524846 <.0001
Homeless 1.012402498 0.0087
Dementia 0.570758477 0.0081
MI 0.456135105 0.0067
Diabetes -0.527659475 0.0068
GCS_Motor 0.416028616 0.0323
ADL_bath 0.412497217 <.0001
ADL_dress -0.19051492 0.0034
ADL_continence 0.181464804 <.0001
ADL_Adlmean_nh -0.059389318 0.0013

Where OS is the likelihood to overstay 10 days and beyond.

How to calculate OS:

  • OS = exp(Y) / (1 + exp(Y)
  • The model gave a 0.028 optimum cut-off probability for OS GE 10 Days with sensitivity = 69.0%, the rate of identifying OS GE 10d out of the patients who actually have OS GE 10d and with specificity = 70.4 the rate of identifying OS LT 10d out of the patients who actually have OS LT 10d. This cut-off point of 0.028 will result to 29.6% rate of identifying OS GE 10d out of the patients who actually have OS LT 10d. Increasing the cut off predicted probability of (Overstay >= 10 days) to 0.039 resulted to sensitivity = 66.9%, thus reducing the rate of identifying OS GE 10d out of the patients who actually have OS LT 10d to 26.3%. Thus a predicted probability of (Overstay >= 10 days) of 0.039 is going to be used to classify patients into two groups; Overstay >= 10 days and Overstay < 10 days. See more discussion on Overstay2 colour.
  • The model has been validated using the data from the validation set of same period. The AUC C-index = 0.742, the Hosmer and Lemeshow Goodness of fit test Chi-Square value = 13.4047 (prob=0.099) and the optimum cut-off probability for OS = 0.028 with sensitivity = 67.1% and specificity = 71.6%.

Model files

  • files describing the model and records of discussion about it are at

S:\MED\MED_CCMED\Julie\MedProjects\Overstay_Project_2025\3_HSCSTB_15Oct2025\STB\STB_FinalModel

Log

  • 2025-10-15 - model 13 selected as final model with AUC=0.771 and HL goodness of fit =11.1321 (prob=0.133)
  • 2025-08-11 - found good fit model in the training data set (Nov2020 to Dec2024) as well as in the validation data set1 (Nov2020 to Dec2024) but not in the validation dataset2 (2025 data). When discussed with Dr Roberts and Tina, it was agreed to redefine the period to include Nov 2020 to July 2025 and divide into training set and one validation set and from these data sets find the good fit model. The results presented in Oct 15, 2025.
  • 2025-07-18 - found good fit model but was agreed to re-do to include more parameters like Chronic Addiction, Acute OD Toxicity, geographic locations, etc as well as clarify the definitions of some parameters.

Related articles

Related articles: