Changes to make Centralized data smaller: Difference between revisions

From CCMDB Wiki
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
As of 2022-08-10 [[Centralized data.mdb]] was 1.348GB . This is large and slowly approaching MS Access' 2GB limit. The size also makes it slow to copy down, and even slower to copy up. This page is about attempts to make the file smaller.  
As of 2022-08-10 [[Centralized data.mdb]] was 1.348GB . This is large and slowly approaching MS Access' 2GB limit. The size also made it slow to copy down, and even slower to copy up. We made some changes to reduce the size by [[#Removing the L_Dxs table and L_Como table]] and [[#Reducing fields and field sizes in the Postal_Code_Master table]]. This initiative is now complete.  


=== [[L_Dxs table]] and [[L_Como table]] ===
=== Removing the [[L_Dxs table]] and [[L_Como table]] ===
We can reduce the file size to 0.810GB by moving the legacy tables [[L_Dxs table]] and [[L_Como table]] to [[Legacy TMS Data.mdb]]. These tables are “stale” and no longer change, so we don’t really need to have them in [[Centralized data.mdb]].  
We can reduce the file size to 0.810GB by moving the legacy tables [[L_Dxs table]] and [[L_Como table]] to [[Legacy TMS Data.mdb]]. These tables are “stale” and no longer change, so we don’t really need to have them in [[Centralized data.mdb]].  
==== Next Steps ====
{{Todo
| who = Tina
| todo_added = 2022-08-10
| todo_action = 2022-08-10
| question = 
* after Pagasa has legacy local, remove tables from Centralized master.
}}


==== Log ====
==== Log ====
* 2022-08-31: Teams with Pagasa where we removed the [[L_Dxs table]] and [[L_Como table]] from [[Centralized data.mdb]]
* 2022-08-10:  
* 2022-08-10:  
** emailed Pagasa to copy Legacy to local
** emailed Pagasa to copy Legacy to local
Line 21: Line 13:
** Julie copied the  [[Legacy_TMS_Data.mdb]] to local where SAS will pull from (08Aug2022 )
** Julie copied the  [[Legacy_TMS_Data.mdb]] to local where SAS will pull from (08Aug2022 )


=== [[Postal_Code_Master table]] ===
=== Reducing fields and field sizes in the [[Postal_Code_Master table]] ===
There is a [[Postal_Code_Master table]] in [[Centralized data.mdb]]. The table is likely in the correct file because Pagasa maintains it and Julie uses it.  
There is a [[Postal_Code_Master table]] in [[Centralized data.mdb]]. The table is likely in the correct Access file because Pagasa maintains it and Julie uses it.  
 
When extracted to its own file, table is 89MB
 
Discussion with Julie established that she doesn't use the following fields, so eliminating them: latitude and longitude, so
{{Todo
| who = Tina
| todo_added = 2022-08-11
| todo_action = 2022-08-11
| question =  _after
* reduce field sizes as much as possible
* do some quality control
* remove
** latitude and longitude
}}
 
{{DJ |
...
** yes I use the table  and particularly helpful for MB where the RHA are listed. ...  ---[[User:JMojica|JMojica]] 09:44, 2022 August 11 (CDT)
*** How about the CA, CA_Name, NC, NC_Name columns - do you use those? If you use the table we will just keep it in there, but I might still reduce the field sizes, they don't need to be 255. [[User:Ttenbergen|Ttenbergen]] 10:52, 2022 August 11 (CDT)
**** I think I have used it when asked where from winnipeg area patients come from but very rare. - --[[User:JMojica|JMojica]] 11:03, 2022 August 11 (CDT)
***** Do you use the code and name fields for them or would you be OK if we use the code fields?  }}
 
 
==== Next Steps ====
{{Todo
| who = Tina
| todo_added = 2022-08-10
| todo_action = 2022-08-10
| question =  _after
* Waiting for info from Julie:
** Can we move the file out? [[User:Ttenbergen|Ttenbergen]] 16:38, 2022 August 10 (CDT)
** Can we reduce field sizes? [[User:Ttenbergen|Ttenbergen]] 16:38, 2022 August 10 (CDT)
}}


==== Log ====
==== Log ====
* removed "code" fields that duplicate "name" field data, and reduced field sizes as much as possible, which reduced table size from ~89MB to ~70MB.


== Related articles ==  
== Related articles ==  

Latest revision as of 15:19, 2022 September 1

As of 2022-08-10 Centralized data.mdb was 1.348GB . This is large and slowly approaching MS Access' 2GB limit. The size also made it slow to copy down, and even slower to copy up. We made some changes to reduce the size by #Removing the L_Dxs table and L_Como table and #Reducing fields and field sizes in the Postal_Code_Master table. This initiative is now complete.

Removing the L_Dxs table and L_Como table

We can reduce the file size to 0.810GB by moving the legacy tables L_Dxs table and L_Como table to Legacy TMS Data.mdb. These tables are “stale” and no longer change, so we don’t really need to have them in Centralized data.mdb.

Log

Reducing fields and field sizes in the Postal_Code_Master table

There is a Postal_Code_Master table in Centralized data.mdb. The table is likely in the correct Access file because Pagasa maintains it and Julie uses it.

Log

  • removed "code" fields that duplicate "name" field data, and reduced field sizes as much as possible, which reduced table size from ~89MB to ~70MB.

Related articles

Related articles: