There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Agile_Data_Code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second EditionAgile_Data_Code
Chapter-wise code for Agile Data the O'Reilly bookweakly_supervised_learning_code
The source code to the book Weakly Supervised Learning (O'Reilly, 2020) by Russell JurneyCollecting-Data
This is a HOWTO for collecting data in Ruby and Python applications and sending it to S3 via Kafka.enron-avro
Code for creating and querying an Avro encoded repository of the UC Berkeley Enron email archivegithub-explorer
Recommender system for Github projects using the github archive dataenron-python-flask-cassandra-pig
Hortonworks demo of Enron emails with Pig, Cassandra, Python and FlaskCloud-Stenography
Main Repoenron-node-mongo
Building a simple Node application with Pig, MongoDB, Node.js and the Enron Emailspig-to-json
A Pig to JSON UDF for Pig that converts tuples and bags to JSON stringsenron-elasticsearch
Pig/ElasticSearch/Wonderdog example with the Enron Emailscoursera_machine_learning
Python examples of the homework examples for Andrew Ng's Stanford Machine Learning class on Courseragithub_network
Experimentation with Github data as a networkamazon_open_source
Analyzing Amazon's Free and Open Source Software (FOSS) contributionsBooting-the-Analytics-Application
Data Syndrome HOWTOdisco
A library for company name parsing based on cleancoenron-hive
Working with the Enron emails in Pig and HIVEenron-jruby-sinatra-hbase-pig
Hortonworks demo of Enron emails using Hadoop, Pig, HBase, JRuby, Sinatraenron-pig-tojson-redis-node
Enron Emails -> Pig ->ToJson -> RedisStorer -> Node.jsdruid-application-development
A Realtime Chart Web Application Development with Druidlibpostal-reborn
Code to go with my blog post, Libpostal, Reborn!paas_blog
A series of blog posts exploring PaaS for automating data science taskstimeseriesserde
A time series serde for HIVEdeep_products
A book on building products using deep learning and natural language processingenron-hcatalog
Using HCatalog with the Enron Avro datasethive_tweets
Process your tweets in Apache Hiveproperty_graph_analytics
A forthcoming book on property graph analyticsnltk_exercises
Working through the nltk bookDattack
commoncrawl-pig-arcfileloader-udf-storefunc
Pig ArcFileLoader examples for loading the Common Crawl internet dataenron-pig-accumulo
Example of using Pig with Accumulo on the Berkely enron emailsbaby_names
Project for US Baby Names example dashboard on Apache Supersetopen_business_graph
Code relating to the Relato Business Graph on data.worlddeep_learning
Deep learning tools and utilitiessuperset_postgres_github
A project to wrangle github event data into Postgres for Superset to analyzeLinearAlgebra
A Processing project to visualize all of Linear Algebra! :)druid-python-demo
Demonstration of druid, pyDruid, Flask and d3.jsaddressbook_extensions
Titanium AddressBook extensions for iOS.quantum_ai_readme
README cataloging resources for learning about Quantum Computing applications in Artificial IntelligenceLove Open Source and this site? Check out how you can help us