Re-analysis and generation of Overstay2 model: Difference between revisions

JMojica (talk | contribs)
JMojica (talk | contribs)
Line 166: Line 166:


=== Decision on a model ===
=== Decision on a model ===
*For each site's training set and validation set, chi square test for independence between the variable OS (Overstay >= 10days and Overstay < 10d) and each factors listed [[Data definition for contributing factors for the Overstay2 project]] to identify the factors that may affect the overstay.
*For each site's training set and validation set, chi square test for independence between the variable OS (Overstay >= 10days and Overstay < 10d) and each factors listed [[Data definition for contributing factors for the Overstay2 project]] to identify the factors that may affect the overstay individually.
*Methodology to find the '''best''' model involves  
*Methodology to find the '''best''' model involves  
** Basic plan for selecting the variables for the model -  
** Basic plan for selecting the variables for the model -