TY - JOUR
T1 - Using the PARAFAC2 tensor factorization on EHR audit data to understand PCP desktop work
AU - Perros, Ioakeim
AU - Yan, Xiaowei
AU - Jones, J. B.
AU - Sun, Jimeng
AU - Stewart, Walter F.
N1 - Funding Information:
This work was supported by the National Science Foundation , award IIS-1418511, IIS-1838042 and CCF-1533768, the National Institute of Health award 1R01MD011682-01, R01-HL116832 and R56HL138415. Appendix A
Funding Information:
This work was supported by the National Science Foundation, award IIS-1418511, IIS-1838042 and CCF-1533768, the National Institute of Health award 1R01MD011682-01, R01-HL116832 and R56HL138415.
Publisher Copyright:
© 2019 Elsevier Inc.
PY - 2020/1
Y1 - 2020/1
N2 - Background: Activity or audit log data are required for EHR privacy and security management but may also be useful for understanding desktop workflow. Objective: We determined if the EHR audit log file, a rich source of complex time-stamped data on desktop activities, could be processed to derive primary care provider (PCP) level workflow measures. Methods: We analyzed audit log data on 876 PCPs across 17,455 ambulatory care encounters that generated 578,394 time-stamped records. Each individual record represents a user interaction (e.g., point and click) that reflects all or part of a specific activity (e.g., order entry access). No dictionary exists to define how to combine clusters of sequential audit log records to represent identifiable PCP tasks. We determined if PARAFAC2 tensor factorization could: (1) learn to identify audit log record clusters that specifically represent defined PCP tasks; and (2) identify variation in how tasks are completed without the need for ground-truth labels. To interpret the result, we used the following PARAFAC2 factors: a matrix representing the task definitions and a matrix containing the frequency measure of each task for each encounter. Results: PARAFAC2 automatically identified 4 clusters of audit log records that represent 4 common clinical encounter tasks: (1) medications’ access, (2) notes’ access, (3) order entry access, and (4) diagnosis modification. PARAFAC2 also identified the most common variants in how PCPs accomplish these tasks. It discovered variation in how the notes’ access task was done, including identification of 9 distinct variants of notes access that explained 77% of the input data variation for notes. The discovered variants mapped to two known workflows for notes’ access and to two distinct PCP user groups who accessed notes by either using the Visit Navigator or the Wrap-Up option. Conclusions: Our results demonstrate that EHR audit log data can be rapidly processed to create higher-level constructed features that represent time-stamped PCP tasks.
AB - Background: Activity or audit log data are required for EHR privacy and security management but may also be useful for understanding desktop workflow. Objective: We determined if the EHR audit log file, a rich source of complex time-stamped data on desktop activities, could be processed to derive primary care provider (PCP) level workflow measures. Methods: We analyzed audit log data on 876 PCPs across 17,455 ambulatory care encounters that generated 578,394 time-stamped records. Each individual record represents a user interaction (e.g., point and click) that reflects all or part of a specific activity (e.g., order entry access). No dictionary exists to define how to combine clusters of sequential audit log records to represent identifiable PCP tasks. We determined if PARAFAC2 tensor factorization could: (1) learn to identify audit log record clusters that specifically represent defined PCP tasks; and (2) identify variation in how tasks are completed without the need for ground-truth labels. To interpret the result, we used the following PARAFAC2 factors: a matrix representing the task definitions and a matrix containing the frequency measure of each task for each encounter. Results: PARAFAC2 automatically identified 4 clusters of audit log records that represent 4 common clinical encounter tasks: (1) medications’ access, (2) notes’ access, (3) order entry access, and (4) diagnosis modification. PARAFAC2 also identified the most common variants in how PCPs accomplish these tasks. It discovered variation in how the notes’ access task was done, including identification of 9 distinct variants of notes access that explained 77% of the input data variation for notes. The discovered variants mapped to two known workflows for notes’ access and to two distinct PCP user groups who accessed notes by either using the Visit Navigator or the Wrap-Up option. Conclusions: Our results demonstrate that EHR audit log data can be rapidly processed to create higher-level constructed features that represent time-stamped PCP tasks.
KW - Electronic health records
KW - Workflow analysis
UR - http://www.scopus.com/inward/record.url?scp=85076093220&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076093220&partnerID=8YFLogxK
U2 - 10.1016/j.jbi.2019.103312
DO - 10.1016/j.jbi.2019.103312
M3 - Article
C2 - 31627022
AN - SCOPUS:85076093220
SN - 1532-0464
VL - 101
JO - Journal of Biomedical Informatics
JF - Journal of Biomedical Informatics
M1 - 103312
ER -