Re-analysis and generation of Overstay2 model: Difference between revisions
| Line 166: | Line 166: | ||
=== Decision on a model === | === Decision on a model === | ||
*For each site's training set and validation set, chi square test for independence between the variable OS (Overstay >= 10days and Overstay < 10d) and each factors listed [[Data definition for contributing factors for the Overstay2 project]] to identify the factors that may affect the overstay. | *For each site's training set and validation set, chi square test for independence between the variable OS (Overstay >= 10days and Overstay < 10d) and each factors listed [[Data definition for contributing factors for the Overstay2 project]] to identify the factors that may affect the overstay individually. | ||
*Methodology to find the '''best''' model involves | *Methodology to find the '''best''' model involves | ||
** Basic plan for selecting the variables for the model - | ** Basic plan for selecting the variables for the model - | ||