• Stars
    star
    1,176
  • Rank 39,761 (Top 0.8 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 7 years ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Small scale machine learning projects to understand the core concepts . Give a Star 🌟If it helps you. BONUS: Interview Bank coming up..!

Machine-Learning-with-Python GitHub stars GitHub forks

alt text

Small scale machine learning projects to understand the core concepts (order: oldest to newest)

  • Topic Modelling using Latent Dirichlet Allocation with newsgroups20 dataset, implemented with Python and Scikit-Learn
  • Implemented a simple neural network built with Keras on MNIST dataset
  • Stock Price Forecasting on Google using Linear Regression
  • Implemented a simple a social network to learn basics of Python
  • Implemented Naives Bayes Classifier to filter spam messages on SpamAssasin Public Corpus
  • Churn Prediction Model for banking dataset using Keras and Scikit-Learn
  • Implemented Random Forest from scratch and built a classifier on Sonar dataset from UCI repository
  • Simple Linear Regression in Python on sample dataset
  • Multiple Regression in Python on sample dataset
  • PCA and scaling sample stock data in Python [working_with_data]
  • Decision Trees in Python on sample dataset
  • Logistic Regression in Python on sample dataset
  • Built a neural network in Python to defeat a captcha system
  • Helper methods include commom operations used in Statistics, Probability, Linear Algebra and Data Analysis
  • K-means clustering with example data; clustering colors with k-means; Bottom-up Hierarchical Clustering
  • Generating Word Clouds
  • Sentence generation using n-grams
  • Sentence generation using Grammars and Automata Theory; Gibbs Sampling
  • Topic Modelling using Latent Dirichlet Analysis (LDA)
  • Wrapper for using Scikit-Learn's GridSearchCV for a Keras Neural Network
  • Recommender system using cosine similarity, recommending new interests to users as well as matching users as per common interests
  • Implementing different methods for network analysis such as PageRank, Betweeness Centrality, Closeness Centrality, EigenVector Centrality
  • Implementing methods used for Hypothesis Inference such as P-hacking, A/B Testing, Bayesian Inference
  • Implemented K-nearest neigbors for next presedential election and prediciting voting behavior based on nearest neigbors.

Installation notes

MLwP is built using Python 3.5. The easiest way to set up a compatible environment is to use Conda. This will set up a virtual environment with the exact version of Python used for development along with all the dependencies needed to run MLwP.

  1. Download and install Conda.
  2. Create a Conda environment with Python 3.

(Note: enter cd ~ to go on $HOME , then perform these commands)

```
conda create --name *your env name* python=3.5
```

You will get the following, mlwp-test is the env name used in this example

Solving environment: done

## Package Plan ##

environment location: /home/user/anaconda3/envs/mlwp-test

added / updated specs: 
 - python=3.5


The following NEW packages will be INSTALLED:

 ca-certificates: 2018.12.5-0            
 certifi:         2018.8.24-py35_1       
 libedit:         3.1.20181209-hc058e9b_0
 libffi:          3.2.1-hd88cf55_4       
 libgcc-ng:       8.2.0-hdf63c60_1       
 libstdcxx-ng:    8.2.0-hdf63c60_1       
 ncurses:         6.1-he6710b0_1         
 openssl:         1.0.2p-h14c3975_0      
 pip:             10.0.1-py35_0          
 python:          3.5.6-hc3d631a_0       
 readline:        7.0-h7b6447c_5         
 setuptools:      40.2.0-py35_0          
 sqlite:          3.26.0-h7b6447c_0      
 tk:              8.6.8-hbc83047_0       
 wheel:           0.31.1-py35_0          
 xz:              5.2.4-h14c3975_4       
 zlib:            1.2.11-h7b6447c_3      

Proceed ([y]/n)?  *Press y*

Preparing transaction: done
Verifying transaction: done
Executing transaction: done
#
# To activate this environment, use:
# > source activate mlwp-test
#
# To deactivate an active environment, use:
# > source deactivate
#

The environment is successfully created.

  1. Now activate the Conda environment.

    source activate *your env name*
    

    You will get the following

    (mlwp-test) amogh@hp15X34:~$ 
    

    Enter conda list to get the list of available packages

        (mlwp-test) amogh@hp15X34:~$ conda list
    # packages in environment at /home/amogh/anaconda3/envs/mlwp-test:
    #
    # Name                    Version                   Build  Channel
    ca-certificates           2018.12.5                     0  
    certifi                   2018.8.24                py35_1  
    libedit                   3.1.20181209         hc058e9b_0  
    libffi                    3.2.1                hd88cf55_4  
    libgcc-ng                 8.2.0                hdf63c60_1  
    libstdcxx-ng              8.2.0                hdf63c60_1  
    ncurses                   6.1                  he6710b0_1  
    openssl                   1.0.2p               h14c3975_0  
    pip                       10.0.1                   py35_0  
    python                    3.5.6                hc3d631a_0  
    readline                  7.0                  h7b6447c_5  
    setuptools                40.2.0                   py35_0  
    sqlite                    3.26.0               h7b6447c_0  
    tk                        8.6.8                hbc83047_0  
    wheel                     0.31.1                   py35_0  
    xz                        5.2.4                h14c3975_4  
    zlib                      1.2.11               h7b6447c_3 
    
  2. Install the required dependencies.

    (mlwp-test) amogh@hp15X34:~$ conda install --yes --file *path to requirements.txt*
    
  3. In case you are not able to install the packages or getting PackagesNotFoundError Use the following command conda install -c conda-forge *list of packages separated by space*. For more info, refer issue #3 Unable to install requirements

How good is the code ?

  • It is well tested
  • It passes style checks (PEP8 compliant)
  • It can compile in its current state (and there are relatively no issues)

How much support is available?

  • FAQs (coming soon)
  • Documentation (coming soon)

Issues

Feel free to submit issues and enhancement requests.

Contributing

Please refer to each project's style guidelines and guidelines for submitting patches and additions. In general, we follow the "fork-and-pull" Git workflow.

  1. Fork the repo on GitHub
  2. Clone the project to your own machine
  3. Commit changes to your own branch
  4. Push your work back up to your fork
  5. Submit a Pull request so that we can review your changes

NOTE: Be sure to merge the latest from "upstream" before making a pull request!

More Repositories

1

Python-Interview-Problems-for-Practice

120+ Common code and interview problems solved in Python **(it's GROWING...)** Give a Star 🌟If it helps you. Please go through the README.md before starting.
Jupyter Notebook
971
star
2

Keras-Style-Transfer

An implementation of "A Neural Algorithm of Artistic Style" in Keras
Python
36
star
3

deep-learning-with-python

Examples and Exercises from Deep Learning with Python - Francis Chollet
Python
6
star
4

machine-learning-for-hackers

Exercises and solutions for Machine Learning for Hackers by Drew Conway & John Myles White
R
6
star
5

covid19-india-impact-dashboard

Website: http://covid19dashboard.pythonanywhere.com/
Python
6
star
6

Telemarketing-Prediction-for-Banking

This is an independent data science notebook where we want to predict and improve whether a customer will buy a term loan or not
Jupyter Notebook
5
star
7

ML-Study-Plan

4
star
8

intermediate-python

Some intermediate python concepts useful for revision
Python
3
star
9

lstm-text-generation

A generative model for text, character-by-character using LSTM recurrent neural networks in Python with Keras. Later extending to make Predictive Keyboard.
Python
3
star
10

BounceGame

Bounce Game developed with HTML and Javascript
2
star
11

Algorithms-Princeton-University

Exercises for Algorithms Course by Princeton University
Python
2
star
12

Data-Structures-Programs

Data Structure Programs in C++, will be refactoring and adding comments soon.
C++
2
star
13

pos-tagger-nltk-scikit-learn

Part-Of-Speech Tagger using custom trained models, implemented with Scikit-Learn and NLTK
Python
2
star
14

news_classifier-scikitlearn

Multinomial Naive Bayes Classifier for newsgroup20 dataset
Python
2
star
15

An-Introduction-To-Machine-Intelligence

Repository containing Jupyter notebooks for the course: An Introduction to Machine Intelligence for Architects and other Nonengineers ETH Zurich, Chair for CAAD, Autumn semester 2019
2
star
16

books-to-finish

These are the books I wish to finish by 2019
2
star
17

devAmoghS.github.io

Github Pages
JavaScript
2
star
18

Hands-On-Machine-Learning-Notebooks

Notebooks for the chapters in "Hands-On Machine Learning with Scikit-Learn and TensorFlow [O'Reily]"
Jupyter Notebook
2
star
19

Spelling-Corrector-Python

Spelling Corrector made with Python, courtesy of Peter Norvig's Essay
Python
2
star
20

inverted-index-nlp

The purpose of an inverted index is to allow fast full text searches, at a cost of increased processing when a document is added to the database. The inverted file may be the database file itself, rather than its index.
Python
2
star
21

Fake-News-Detection-Python

A deep net that lets you detect if a news article is fake or not
Python
2
star
22

Research-papers-

1
star
23

Spark-Intro-to-ML

This repository contains some of my intial work with Spark-Scala and Machine Learning. TODO: Extend it in the future.
1
star
24

The-Data-Science-Manual

Exercise solutions for The Data Science Design Manual - Steven Skiena
1
star
25

ml-interview-prep

This is a repository to collate random questions asked in an ML interview
1
star
26

BookmarkAppDjango

A simple bookmark application made with intention to learn Django
Python
1
star
27

python-interview-practice

Python
1
star
28

ISLR-Lab

This repository contains the program from Introduction to Statistical Learning with R
R
1
star
29

songs-from-youtube

A repository of songs accidently and boy are they amazing !
1
star
30

Python-Programs

A repository of all the solutions to all tutorial questions from SRM University Course Code: 15IT322E
Python
1
star
31

Chrome-Plugins

Plugins made for Google Chrome
1
star
32

UC-Berkeley-CS61A

Exercises for the lectures from Structure and Interpretation of Programs
Python
1
star
33

Steganography-Application

Steganography Application made as a part of college project. Will be adding a demo soon
1
star
34

andrew-ng-machine-learning

This repository contains the program from Coursera - Machine Learning (Andrew Ng)
MATLAB
1
star
35

tensorflow_tutorial

Python
1
star
36

File-Processing-Microservice

A file processing micro-service written with Flask, RethinkDB and VueJS
Python
1
star
37

CRUD-using-ROR

A basic CRUD interface using React on Rails, can be extended as a template. Feel free to raise an issue if you can make it better
Ruby
1
star