• Stars
    star
    112
  • Rank 312,240 (Top 7 %)
  • Language
    TeX
  • Created about 7 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

fastdeepnets

The progress of the project can be followed in the in project journal(Chronological order).

Tools

Regression CLI

We provide a CLI to train regression models for people who do not want to spend the time building their own models and data loaders. The training set is expected to be a CSV file with i + o columns where i is the number of inputs and o is the number of outputs.

Example:

python tools/fit.py tools/example_regression_data.csv /tmp/model_test.data --layers 2 --max_neurons 1000 --input_features 32 --output_features 1

In order for it to work make sure the dynnet package is available, either install the package or add the root of this repository in your PYTHONPATH.

Goal

The goal of this project is to try to "learn" the number of hidden units (neurons for fully connected networks and channels for CNNs), directly during training. The main motivation is to reduce the size of set of potential hyper-parameters and avoid overfitting by killing neurons while the network is trained.

Context

This project is part of my (Guillaume Leclerc) Master thesis

More Repositories

1

treeline

An update-in-place key-value store for modern storage.
C++
130
star
2

deneva

Deneva is a distributed in-memory database framework that supports the evaluation of various concurrency control algorithms.
C++
110
star
3

aurum-datadiscovery

Python
74
star
4

palimpzest

A Declarative System for Optimizing AI Workloads
Python
48
star
5

asciiclass

Notes and Labs for Advanced Topics in Data Processing
JavaScript
38
star
6

ml-class-iap2017

20
star
7

AdaptDB

Java
16
star
8

amoeba

Java
16
star
9

bigdata

MIT Big Data Challenge
JavaScript
14
star
10

twitinfo

A timeline-based visualization of events as they are discussed on Twitter
Python
14
star
11

lazo

Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
Java
13
star
12

imputedb

A database with automatic dynamic imputation of missing values.
Java
10
star
13

datascienceclass

Software Systems for Data Science Main Repo
Jupyter Notebook
9
star
14

genbase

Code for GenBase: complex analytics based genomics benchmark
R
6
star
15

XSystem

XSystem: Extracting Syntactical Patterns from Databases
Scala
6
star
16

modeldb-notebooks

Scala
5
star
17

ycsbr

Customizable synthetic workload generator and runner.
C++
4
star
18

brad

A virtualization layer for cloud data infrastructures.
Python
4
star
19

iap-class

Resources for the "Programming with Data" IAP class.
Jupyter Notebook
3
star
20

forecache-code

ForeCache codebase.
JavaScript
3
star
21

wifivis

Visualize some mit wifi access point data
Python
3
star
22

logos

Human-in-the-Loop Causal Analysis of Log Files
Python
3
star
23

learnedsystems-www

JavaScript
2
star
24

confo

Python
2
star
25

purk

hit layer infrastructure
1
star