• Stars
    star
    245
  • Rank 165,304 (Top 4 %)
  • Language
    R
  • License
    Other
  • Created over 8 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

R tools for healthcare machine learning

healthcareai

Travis-CI Build Status codecov badge CRAN_Status_Badge CRAN downloads badge License: MIT DOI

Overview

The aim of healthcareai is to make machine learning in healthcare as easy as possible. It does that by providing functions to:

  • Develop customized, reliable, high-performance machine learning models with minimal code
  • Easily make and evaluate predictions and push them to a database
  • Understand how a model makes its predictions
  • Make data cleaning, manipulation, imputation, and visualization as simple as possible

Usage

healthcareai can take you from messy data to an optimized model in one line of code:

models <- machine_learn(pima_diabetes, patient_id, outcome = diabetes)
models
# > Algorithms Trained: Random Forest, eXtreme Gradient Boosting, and glmnet
# > Model Name: diabetes
# > Target: diabetes
# > Class: Classification
# > Performance Metric: AUROC
# > Number of Observations: 768
# > Number of Features: 12
# > Models Trained: 2018-09-01 18:19:44 
# > 
# > Models tuned via 5-fold cross validation over 10 combinations of hyperparameter values.
# > Best model: Random Forest
# > AUPR = 0.71, AUROC = 0.84
# > Optimal hyperparameter values:
# >   mtry = 2
# >   splitrule = extratrees
# >   min.node.size = 12

Make predictions and examine predictive performance:

predictions <- predict(models, outcome_groups = TRUE)
plot(predictions)

Learn More

For details on what’s happening under the hood and for options to customize data preparation and model training, see Getting Started with healthcareai as well as the helpfiles for individual functions such as ?machine_learn, ?predict.model_list, and ?explore.

Documentation of all functions as well as vignettes on various uses of the package are available at the package website: https://docs.healthcare.ai/.

Also, be sure to read our blog and watch our broadcasts to learn more about what’s new in healthcare machine learning and how we are using this toolkit to put machine learning to work in real healthcare systems.

Get Involved

We have a Slack community that is a great place to introduce yourself, share what you’re doing with the package, ask questions, and troubleshoot your code.

Contributing

If you are interested in contributing the package (great!), please read the contributing guide, and look for issues with the “help wanted” tag. Feel free to tackle any issue that interests you; those are a few issues that we feel would make a good place to start.

Feedback

Your feedback is hugely appreciated. It is makes the package work well and helps us make it more useful to the community. Both feature requests and bug reports should be submitted as Github issues.

Bug reports should be filed with a minimal reproducable example. The reprex package is extraordinarily helpful for this. Please also include the output of sessionInfo() or better yet, devtools::session_info().

Legacy

Version 1 of healthcareai has been retired. You can continue to use it, but its compatibility with changes in the R ecosystem are not guaranteed. You should always be able to install it from github with: install.packages("remotes"); remotes::install_github("HealthCatalyst/[email protected]").

For an example of how to adapt v1 models to the v2 API, check out the Transitioning vignettes.

More Repositories

1

healthcareai-py

Python tools for healthcare machine learning
Python
314
star
2

Fabric.Cashmere

Health Catalyst’s comprehensive design system.
TypeScript
66
star
3

Fabric.Authorization

Permissions service for applications
C#
21
star
4

documentation

Content for healthcare.ai, old posts, some hosted notebooks
HTML
14
star
5

Fabric.Identity

Identity service to provide authentication
C#
12
star
6

hcposh

HCPosh is a Powershell module that provides some useful functions and tools when working with data in the Health Catalyst Analytics Platform. Key Features include 1) Split SAM Designer hcx files into smaller files for source control using it's built-in column-level SQL Parser, developed using the Microsoft.SqlServer.TransactSql.ScriptDom library. 2) Generate a React web application for documentation that contains ERD and Data Flow Diagrams for a professional look and presentation of a subject area mart 3) Integration of Graphviz software for ERD and Data flow diagram generation (pdf, png, and svg)
PowerShell
11
star
7

PythonPowershellUtilities

The only powershell module you should ever need.
PowerShell
9
star
8

Fabric.Realtime

Provides a real-time messaging service where the client can subscribe to a queue to receive HL7 messages
C#
7
star
9

Fabric.Databus

Pipeline to convert SQL into JSON and send to ElasticSearch or other REST Api
C#
5
star
10

InstallScripts

Shell
5
star
11

Catalyst.SqlUtilities

SQL Parsers and such
C#
4
star
12

SSIS

Supporting SSIS project containing extensibile packages for R and Python.
3
star
13

Fabric.Realtime.RabbitMq

RabbitMq with configurations needed for Fabric.Realtime
Shell
3
star
14

Fabric.Terminology

Service to provide shared healthcare terminology data
C#
3
star
15

Fabric.Docker.InterfaceEngine

Docker for interface engine to use for realtime
Shell
3
star
16

dos.powershell

powershell functions to control DOS
PowerShell
2
star
17

react-cashmere

React version of @HealthCatalyst/Fiber.Cashmere styles applied to @mui/material components.
TypeScript
2
star
18

Fabric.Docker.NGINX-Kerberos

Docker container for running NGINX as a reverse proxy with Kerberos Authentication
Shell
2
star
19

DosInstallUtilities.Kube

PowerShell
1
star
20

Fabric.Realtime.Tester

C#
1
star
21

Fabric.EHR

REST service for rendering the Fabric Pane inside an EHR
JavaScript
1
star
22

Fabric.FHIR

FHIR REST service
C#
1
star