• Stars
    star
    2
  • Language
    Jupyter Notebook
  • Created over 5 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Analyzing Jigsaw's toxic comments Kaggle challenge using fastai + pytorch

More Repositories

1

coral-ordinal

Tensorflow Keras implementation of ordinal regression using consistent rank logits (CORAL) by Cao et al. (2019)
Python
77
star
2

varimpact

Variable importance through targeted causal inference, with Alan Hubbard
R
57
star
3

superlearner-guide

SuperLearner guide: fitting models, ensembling, prediction, hyperparameters, parallelization, timing, feature selection, etc.
HTML
36
star
4

ck37r

R functions for project setup, data cleaning, machine learning, SuperLearner, parallelization, and targeted learning.
R
18
star
5

atlantic-causal-2017

Targeted Learning entry in the Atlantic Causal Inference Conference's 2017 competition
R
12
star
6

Predictive-Modeling-in-R

Workshop (2-6 hours): cleaning, missing value imputation, EDA, ensemble learning, calibration, variable importance ranking, accumulated local effect plots. WIP.
R
12
star
7

featurerank

Ensemble feature ranking for SuperLearner variable selection
R
9
star
8

randomize_ado

Stata module for random assignment, including blocking, balance checking, and automated rerandomization.
Stata
8
star
9

hpc-savio-xsede

Multicore and multi-node parallel R computation via SLURM on the Savio cluster at UC Berkeley, plus XSEDE
Shell
8
star
10

garden-iot

Automated garden system for my office
Python
7
star
11

htestimate

Horvitz-Thompson estimator for RCTs, with Joel Middleton
R
6
star
12

ppmi-challenge-2016

Parkinson's Progression Marker Initiative data science challenge, 2016
R
4
star
13

kp-dsc-2018

Kaiser Permanente Data Science Competition 2018
R
3
star
14

intrees-stel

Customization of Houtao Deng's inTrees-STEL analysis
R
3
star
15

clinsent

Estimate sentiment in clinical notes via keywords or deep learning models
Python
3
star
16

google_geocoder.py

Geocodes a text file using the Google Maps API.
Python
3
star
17

concise-r

A concise introduction to R programming and data analysis
HTML
3
star
18

gmail_add_emails.py

Add a file of emails into a gmail contact list.
Python
2
star
19

mimic-clinical-sentiment

Manuscript under review
Jupyter Notebook
2
star
20

vip-polling-place

Determine the polling place for records listed in a text file, using Google's Voter Information Project API.
Python
1
star
21

the-matrix-154

Text analytics project for Stat 154 Statistical Learning at UC Berkeley, Fall 2015.
R
1
star
22

rcteval_ado

Stata module to evaluate randomized controlled trials using a systematic methodology.
Stata
1
star
23

tlmixture

Data-adaptive creation of exposure (treatment) mixtures using targeted learning
R
1
star