• Stars
    star
    287
  • Rank 144,232 (Top 3 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created about 8 years ago
  • Updated about 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Code from my experiments on Numerai

Numerai Experiments

Folder structure:

  • ensemble.py - combines multiple predictions using geometric mean
  • fit_tsne.py - uses this t-SNE implementation for 2D embedding (does not work in 3D)
  • search_params.py - uses RandomSearchCV for hyperparameter search
  • tpot_test.py - runs tpot over the data
  • tpot_pipeline.py - best tpot model
  • notebooks/ - contains Jupyter notebooks
  • bh_tsne/ - is the original C++ t-SNE implementation with scripts for converting the csvs to the format the binary expects
  • models/ - various model implementations
    • adverarial/ - generative adversarial model that saves the learned features for each sample
    • autoencoder/ - simple autoencoder with regular and denoising variants (also saves learned features)
    • classifier/ - simple neural network classifier
    • pairwise/ - pairwise model implementation described in the blog post
    • pipeline/ - various scikit-learn models
      • estimators.py - custom wrappers around KernelPCA and Isomap that fit on a small portion of the training samples to avoid memory errors
      • transformers.py - contains ItemSelector which allows for selecting data by a key when building pipelines (source)
      • fm.py - factorization machines
      • lr.py - logistic regression with t-SNE features
      • pairwise.py - sklearn variant of the pairwise model
      • simple.py - simple logistic regression with polynomial features