Requirements for Re-Platforming: Difference between revisions

From CCMDB Wiki
Jump to navigation Jump to search
Relatively Hard Facts: copied over Allan's draft
 
(14 intermediate revisions by 4 users not shown)
Line 9: Line 9:


== Relatively Hard Facts ==
== Relatively Hard Facts ==
* 15-20 users on laptops, sometimes working from home, sometimes not connected to the network due to lack if [[wifi]]
* 15-20 users on laptops, sometimes working from home, sometimes not connected to the network due to lack of [[wifi]]
* About 2-2.5GB of data altogether as stored in various MS Access DBs (size may vary on other platforms)
* About 2-2.5GB of data altogether as stored in various MS Access DBs (size may vary on other platforms)
* Has several highly customized front-ends that facilitates efficient and low-error data entry
* Has several highly customized front-ends that facilitate efficient and low-error data entry and processing
** facilitates data entry from a daily dump received from ADT
** facilitates data entry from a daily dump received from ADT (and other intermittent dumps)
* Data we need to store is in [[Auto Data Dictionary]]
* Data we store is in [[Auto Data Dictionary]]
** it is currently stored in [[CCMDB Data Structure]] - this structure could be stored differently but would cause large changes
** it is currently stored in [[CCMDB Data Structure]] - this structure could be stored differently but would cause large changes
* We have (and continuously improve) [[Data Integrity Checks]]
* We have (and continuously improve) [[Data Integrity Checks]]
* Number of fields not necessarily relevant because of [[Entity–attribute–value model of the L Tmp V2 table]]
* Implemented as a system of intermittently linked MS Access databases with a fair bit of batch file and other automation facilitating their use and maintenance


== Requirements ==
== Requirements ==
''as also discussed in [[UM MedIT Re-platforming Meetings]], with decisions made tracked there and the "current master" located here. ''
''as also discussed in [[UM MedIT Re-platforming Meetings]], with decisions made tracked there and the "current master" located here. ''


Here is that draft, including for each item, a preliminary notation of its priority, on a scale of 1=lowest priority to 5=highest priority.  After the draft is completed, we will need to complete the MedIT Project Intake Process form, and then Kiran and her team will consider the best options for our needs.
Here is a draft, including for each item, a preliminary notation of its priority, on a scale of 1=lowest priority to 5=highest priority.  After the draft is completed, we will need to complete the MedIT Project Intake Process form, and then Kiran and her team will consider the best options for our needs.


*Maintaining, without change (or with minimal change), the current format/structure of the data input by the data collectors on the laptops (4)
'''This defines the functionality needed; the tools are up for discussion unless noted. '''
*Allowing us to upload data collected on patients from the laptops to the server (5) -- although this could possible occur via some intermediary program
*Allowing us to modify individual data items in individual records (5)
*Allowing us to modify multiple (and possible ALL) records in a "batch" (5)
*Allowing Julie, Pagasa and Tina to bi-directionally transfer the data (5) <-- ''this item could use some clarification from the 3 of them''
*Allowing Julie to interface with the data via SAS (5) -- although this could possible occur via some intermediary program
*To the maximum extent possible, maintaining an interface with the data that looks like, and has functionalities of, our current MS ACCESS interface (4)
*Ability, in future, to expand the capabilities of the databases by linking the data to other data obtained automatically -- e.g. Canadian Blood Services data about blood transfusions (4)


== Current Implementation ==
=== Data collection ===
* system of intermittently linked MS Access databases with a fair bit of automation  
* This would replace our current [[CCMDB.accdb]] Access front-end and would need to:
* facilitated interfaces with Shared Health systems (we request and receive data in files that we import as either temporary or permanent data)
** Provide facilitated, partly automated input of admissions from the daily Shared Health data export (5)
* number of fields not necessarily relevant because of [[Entity–attribute–value model of the L Tmp V2 table]]
** Maintain a user interface that has the general look and feel of the current one (3)
** Allow modification of individual data items in individual records (5)
** Allow modification of multiple records via queries, programming and/or automation (5)
** Needs to work with poor or non-existent [[wifi]] (5)
 
=== Data control and possibly transfer ===
* We have a "[[sending]]" process which currently includes both the movement of data from the locally-installed database to the central one, and the setting of the [[RecordStatus]] field that encodes whether the collector maintains "control" of the record or that control has been handed off to [[#Data processing]]; the collector maintains control of "incomplete" records until they "complete" the record, which triggers some mandatory final cross checks that will prevent completion unless passed.
* A new platform would need to provide this functionality:
** Make incomplete and complete data available to the [[#Data processing]] and [[#Data analysis]] stages(5)
** Allow data collectors to add and update the data they are working on -- up until they mark a record as "complete" (5)
** No longer allow them to update or view the record once it is set to "complete", or to the later stages it is set to during [[#Data processing]](5)
** If collection happens in a separate database, move and synch updated data from the [[#Data collection]] tool(5)
 
=== Data processing ===
* This would replace our current [[CFE]] Access front-end and would need to
** Maintain the current [[Data Processing]] functionality for the [[data processor]], who often works remotely (5)
** Maintain ability to run the integrity checks performed during [[Centralized data Vetting Process]], leading to the [[RecordStatus]] field being set to "vetted" if passed (5)
** Maintain the ability to browse, search, sort, filter and update (add, update, delete) the data interactively, including data validation (5)
** Allow modification of individual data items in individual records (5)
** Allow modification of multiple records via queries, programming and/or automation (5)
** Maintain a user interface that has the general look and feel of the current one (3)
 
=== Data analysis ===
* Allowing our database personnel to transfer data into and out of the database, such as an export to file or import from file, ad hoc (5)
* Retain the ability to analyze the data using other tools, including but not limited to SAS (5)
 
=== Ongoing Improvements ===
* The ability to do the following:
** add / remove / change fields and tables in the data structure (5)
** update the user interfaces to incorporate these data changes (5)
** add / remove / change data validation and cross checks (5)
* For our database personnel to do these changes independently following a reasonable change management process (4)
 
=== Miscellaneous items ===
* Maintaining the back end data format/structure as much as possible (3)
* Maintaining our various "Created_*" queries/generated data functionality that provides APACHE score and individual element score, Charlson Score, etc
* Ability  to built the ETL to manage these updates, rather than rely on other teams (4)
* Ability, in future, to expand the capabilities of the databases by linking the data to other data obtained automatically -- e.g. Canadian Blood Services data about blood transfusions (4)
 
* Ability for our team to do the update and maintenance on data structure and interface (4)


== Related articles ==  
== Related articles ==  

Latest revision as of 15:27, 8 April 2025

This is an index for content relevant to the U of M IT team in proposing a solution for hosting our database.

Repeated failure points we should address before going too far

We have had many failed attempts at re-platforming. The technical change would be tedious but doable, the sticking points have always been around the following, so we should discuss these before planning too much further.

  • governance of any implemented system
  • data ownership
  • support model
  • our team's continued ability to change this as needed

Relatively Hard Facts

  • 15-20 users on laptops, sometimes working from home, sometimes not connected to the network due to lack of wifi
  • About 2-2.5GB of data altogether as stored in various MS Access DBs (size may vary on other platforms)
  • Has several highly customized front-ends that facilitate efficient and low-error data entry and processing
    • facilitates data entry from a daily dump received from ADT (and other intermittent dumps)
  • Data we store is in Auto Data Dictionary
    • it is currently stored in CCMDB Data Structure - this structure could be stored differently but would cause large changes
  • We have (and continuously improve) Data Integrity Checks
  • Number of fields not necessarily relevant because of Entity–attribute–value model of the L Tmp V2 table
  • Implemented as a system of intermittently linked MS Access databases with a fair bit of batch file and other automation facilitating their use and maintenance

Requirements

as also discussed in UM MedIT Re-platforming Meetings, with decisions made tracked there and the "current master" located here.

Here is a draft, including for each item, a preliminary notation of its priority, on a scale of 1=lowest priority to 5=highest priority. After the draft is completed, we will need to complete the MedIT Project Intake Process form, and then Kiran and her team will consider the best options for our needs.

This defines the functionality needed; the tools are up for discussion unless noted.

Data collection

  • This would replace our current CCMDB.accdb Access front-end and would need to:
    • Provide facilitated, partly automated input of admissions from the daily Shared Health data export (5)
    • Maintain a user interface that has the general look and feel of the current one (3)
    • Allow modification of individual data items in individual records (5)
    • Allow modification of multiple records via queries, programming and/or automation (5)
    • Needs to work with poor or non-existent wifi (5)

Data control and possibly transfer

  • We have a "sending" process which currently includes both the movement of data from the locally-installed database to the central one, and the setting of the RecordStatus field that encodes whether the collector maintains "control" of the record or that control has been handed off to #Data processing; the collector maintains control of "incomplete" records until they "complete" the record, which triggers some mandatory final cross checks that will prevent completion unless passed.
  • A new platform would need to provide this functionality:
    • Make incomplete and complete data available to the #Data processing and #Data analysis stages(5)
    • Allow data collectors to add and update the data they are working on -- up until they mark a record as "complete" (5)
    • No longer allow them to update or view the record once it is set to "complete", or to the later stages it is set to during #Data processing(5)
    • If collection happens in a separate database, move and synch updated data from the #Data collection tool(5)

Data processing

  • This would replace our current CFE Access front-end and would need to
    • Maintain the current Data Processing functionality for the data processor, who often works remotely (5)
    • Maintain ability to run the integrity checks performed during Centralized data Vetting Process, leading to the RecordStatus field being set to "vetted" if passed (5)
    • Maintain the ability to browse, search, sort, filter and update (add, update, delete) the data interactively, including data validation (5)
    • Allow modification of individual data items in individual records (5)
    • Allow modification of multiple records via queries, programming and/or automation (5)
    • Maintain a user interface that has the general look and feel of the current one (3)

Data analysis

  • Allowing our database personnel to transfer data into and out of the database, such as an export to file or import from file, ad hoc (5)
  • Retain the ability to analyze the data using other tools, including but not limited to SAS (5)

Ongoing Improvements

  • The ability to do the following:
    • add / remove / change fields and tables in the data structure (5)
    • update the user interfaces to incorporate these data changes (5)
    • add / remove / change data validation and cross checks (5)
  • For our database personnel to do these changes independently following a reasonable change management process (4)

Miscellaneous items

  • Maintaining the back end data format/structure as much as possible (3)
  • Maintaining our various "Created_*" queries/generated data functionality that provides APACHE score and individual element score, Charlson Score, etc
  • Ability to built the ETL to manage these updates, rather than rely on other teams (4)
  • Ability, in future, to expand the capabilities of the databases by linking the data to other data obtained automatically -- e.g. Canadian Blood Services data about blood transfusions (4)
  • Ability for our team to do the update and maintenance on data structure and interface (4)

Related articles

Related articles: