Discover @Wittline Open Source projects

Ramses Alexander Coraspe Valdez (@Wittline)

Wittline

Stars
340
Global Rank 80,470 (Top 3 %)
Followers 90
Following 116
Registered about 10 years ago
Most used languages

Python
45.9 %

HTML
29.7 %

Jupyter Notebook
16.2 %

VBA 2.7 %

C#
2.7 %

SCSS 2.7 %
Location 🇲🇽 Mexico
Country Total Rank 153
Country Ranking

VBA 1

Jupyter Notebook
11

Python
40

HTML
42

C#
90

SCSS 125

uber-expenses-tracking

The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such as Apache Airflow, AWS Redshift and Power BI.

Jupyter Notebook

apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

csv-schema-inference

A tool to automatically infer columns data types in .csv files

Jupyter Notebook

data-engineer-challenge

Challenge Data Engineer

pyspark-on-aws-emr

The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.

pyDag

Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag

Dropout-Students-Prediction

The goal of this project is to identify students at risk of dropping out the school

data-engineering-challenge-th

Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)

D3JS-Dashboard

Building Responsive DashBoard with D3.js and ASP.NET MVC from scratch (SQL SERVER - SSIS - API REST)

wbz

A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler transform (BWT) and Move to front (MTF) to improve the Huffman compression. For now, this tool only will be focused on compressing .csv files, and other files on tabular format.

recommendation-system

Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)

docker-livy

Dockerizing and Consuming an Apache Livy environment

text-analysis-speeches-amlo

Text analysis of the speeches, conferences and interviews of the current president of Mexico

Jupyter Notebook

tf-idf

Term Frequency-Inverse Document Frequency from Scratch

Huffman-decoding

A New Approach for Efficient Sequential Decoding of Static Huffman Codes

dataengineering-assignment

Prescreening Tasks for Data Engineer

Jupyter Notebook

distance-metrics

Distance metrics are one of the most important parts of some machine learning algorithms, supervised and unsupervised learning, it will help us to calculate and measure similarities between numerical values expressed as data points

Jupyter Notebook

csv-estimate-rows

csv-shuffler

A tool to automatically Shuffle lines in .csv files

livyc

Apache Spark as a Service with Apache Livy Client

MachineLearning

The repository contains basic experiments using machine learning algorithms with python

RESTful-APIs-Nodejs

Building fast, scalable and secure RESTful services with Node, Express and MongoDB

Moving-Average-Spark

How to Compute Moving Average with Spark

SparkSQL-with-Python

This repository has some examples of using Spark and SparkSQL with Python through PySpark

Wittline

Take a look at my repository

GPU-Programming-with-Python

GPU programming with Python, you can take advantage of the incredible computing power of your graphics processing unit GPU. we will work with NVIDIA’s CUDA library.

csv-columnar

apache-spark-course

Apache Spark with python

Jupyter Notebook

Data-Analytics-with-R

Repository for data analytics course using R

optimizing-public-transportation

Streaming event pipeline around Apache Kafka and its ecosystem. Using public data from the Chicago Transit Authority we will construct an event pipeline around Kafka that allows us to simulate and display the status of train lines in real time.

Contextual-Data-Transforms

This repository contain the most important contextual data transformation algorithms which help to improve the rate compression reached by statistical encoders. Ramses Alexander Coraspe Valdez

Computer-Vision-and-Deep-Learning

This repository contains information on the basic techniques and algorithms used in computer image processing, in addition to some projects related to pattern recognition using deep learning.

csv-generator

wittline.github.io

My github profile

Python

Software Analysis, Design and Construction with Python

model-catalog-grpc

A gRPC service to consume any machine learning model stored in a model catalog through a single endpoint.

csv-splitter

Python-recursion

This repository shows the implementation of the most common recursive algorithms

Multiprocessing

Improving the Performance in the Statistical Redistribution of Message Symbols using Architectural patterns for Parallel Programming

code_challenges

Scripts for different purposes

burrows-wheeler-transform

Implementation of the algorithm "Burrows Wheeler Transform" in python for data compression