There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Agile_Data_Code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second EditionAgile_Data_Code
Chapter-wise code for Agile Data the O'Reilly bookweakly_supervised_learning_code
The source code to the book Weakly Supervised Learning (O'Reilly, 2020) by Russell JurneyCollecting-Data
This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.enron-avro
Code for creating and querying an Avro encoded repository of the UC Berkeley Enron email archivegithub-explorer
Recommender system for Github projects using the github archive dataenron-python-flask-cassandra-pig
Hortonworks demo of Enron emails with Pig, Cassandra, Python and FlaskCloud-Stenography
Main Repoenron-node-mongo
Building a simple Node application with Pig, MongoDB, Node.js and the Enron Emailspig-to-json
A Pig to JSON UDF for Pig that converts tuples and bags to JSON stringsenron-elasticsearch
Pig/ElasticSearch/Wonderdog example with the Enron Emailscoursera_machine_learning
Python examples of the homework examples for Andrew Ng's Stanford Machine Learning class on Courseragithub_network
Experimentation with Github data as a networkamazon_open_source
Analyzing Amazon's Free and Open Source Software (FOSS) contributionsBooting-the-Analytics-Application
Data Syndrome HOWTOdisco
A library for company name parsing based on cleancoenron-hive
Working with the Enron emails in Pig and HIVEenron-pig-tojson-redis-node
Enron Emails -> Pig ->ToJson -> RedisStorer -> Node.jsdruid-application-development
A Realtime Chart Web Application Development with Druidlibpostal-reborn
Code to go with my blog post, Libpostal, Reborn!paas_blog
A series of blog posts exploring PaaS for automating data science taskstimeseriesserde
A time series serde for HIVEdeep_products
A book on building products using deep learning and natural language processingenron-hcatalog
Using HCatalog with the Enron Avro datasethive_tweets
Process your tweets in Apache Hiveproperty_graph_analytics
A forthcoming book on property graph analyticsnltk_exercises
Working through the nltk bookDattack
commoncrawl-pig-arcfileloader-udf-storefunc
Pig ArcFileLoader examples for loading the Common Crawl internet dataenron-pig-accumulo
Example of using Pig with Accumulo on the Berkely enron emailsbaby_names
Project for US Baby Names example dashboard on Apache Supersetopen_business_graph
Code relating to the Relato Business Graph on data.worlddeep_learning
Deep learning tools and utilitiessuperset_postgres_github
A project to wrangle github event data into Postgres for Superset to analyzeLinearAlgebra
A Processing project to visualize all of Linear Algebra! :)druid-python-demo
Demonstration of druid, pyDruid, Flask and d3.jsaddressbook_extensions
Titanium AddressBook extensions for iOS.quantum_ai_readme
README cataloging resources for learning about Quantum Computing applications in Artificial Intelligenceatlanta-directory-project
Processing Atlanta Directories from Emory University to understand the demographics of race and class in Atlanta in the Late 19th and early 20th centuriesLove Open Source and this site? Check out how you can help us