Re-analysis and generation of Overstay2 model: Difference between revisions
| Line 166: | Line 166: | ||
=== Decision on a model === | === Decision on a model === | ||
For each site's training set and validation set, chi square test for independence between the variable OS (Overstay >= 10days and Overstay < 10d) and each factors listed [[Data definition for contributing factors for the Overstay2 project]] to identify the factors that may affect the overstay. | *For each site's training set and validation set, chi square test for independence between the variable OS (Overstay >= 10days and Overstay < 10d) and each factors listed [[Data definition for contributing factors for the Overstay2 project]] to identify the factors that may affect the overstay. | ||
Methodology to find the '''best''' model involves | *Methodology to find the '''best''' model involves | ||
** Basic plan for selecting the variables for the model - | |||
{{DJ | | {{DJ | | ||
* the statistical tests that were done to evaluate the model | * the statistical tests that were done to evaluate the model | ||