Koh Jia Xuan (@kohjiaxuan)

Top repositories

1

Wikipedia-Article-Scraper

A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive code.
Python
17
star
2

Stock-Market-Dashboard

Creating a stock market dashboard from an external API that tracks daily performance of stocks
Python
14
star
3

NLP-Model-for-Corpus-Similarity

A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
Python
9
star
4

Visualisation-of-Gradient-Descent

By visualizing the gradient descent algorithm applied on a set of points that fits a quadratic equation, we understand better how the algorithm works in machine learning
Jupyter Notebook
2
star
5

Predicting-HDB-Price-with-Machine-Learning

Data Project of Predicting HDB Resale Flat Prices with data cleaning, feature engineering and machine learning. Models used: Random Forest, XGBoost, Neural Networks, Decision Tree, Support Vector Regressors, Linear Regression
Jupyter Notebook
2
star
6

Data-Science-Competition-for-Revenue-Maximization

Data Science Competition that challenged teams to come up with creative ways to increase the revenue of an e-commerce company. Won 1st place! Write-up in repository
1
star
7

Fraud-Detection-Pipeline

A structured data science pipeline for classification problems that does scaling, sampling, k-fold cross validation with evaluation metrics
Jupyter Notebook
1
star