The widespread adoption of electronic medical records (EMRs) in healthcare has provided vast new amounts of data for statistical machine learning researchers in their efforts to model and predict patient health status, potentially enabling novel advances in treatment. However, there are significant barriers that must be overcome to extract these insights from EMR data. First, EMR datasets consist of both static and dynamic observations of discrete and continuous-valued variables, many of which may be missing, precluding the application of standard multivariate analysis techniques. Second, clinical populations observed via EMRs and relevant to the study and management of debilitating conditions like sepsis are often heterogeneous; properly accounting for this heterogeneity is critical. Here, we describe a joint probabilistic framework called a composite mixture model that can simultaneously accommodate the wide variety of observations frequently observed in EMR datasets, stratify heterogeneous clinical populations into relevant subgroups, and handle missing observations. We demonstrate the efficacy of our approach by applying our framework to a large-scale sepsis cohort, identifying physiological trends and distinct subgroups of the dataset associated with elevated risk of mortality during hospitalization.

Source: Flexible Analysis of Electronic Medical Record Data with Composite Mixture Models | bioRxiv

Categories: Uncategorized

Related Posts

Uncategorized

Becoming a 10x Data Scientist – Algorithmia

Borrowing tips and tricks from software developers, learn how to create a more productive workflow on the journey to becoming a 10X Data Scientist. Source: Becoming a 10x Data Scientist – Algorithmia Related PostsTrey Causey Read more…

Uncategorized

Announcing Rust 1.20 – The Rust Programming Language Blog

curl https://sh.rustup.rs -sSf | sh rustup update stable Source: Announcing Rust 1.20 – The Rust Programming Language Blog Related PostsIn Defense of C++Principles for C programming – Drew DeVault’s BlogVulnerability announced: update your Git clientsVulnerability Read more…

Uncategorized

Documentation and Analysis of the Linux Random Number Generator – LinuxRNG_EN.pdf

Source: Documentation and Analysis of the Linux Random Number Generator – LinuxRNG_EN.pdf Related Postsscikit-bio — scikit-bio 0.2.3 documentationComputational Statistics in Python — Computational Statistics in Python 0.1 documentationWhat’s New In Python 3.5 — Python 3.5.0b2 Read more…