There are no reviews yet. Be the first to send feedback to the community and the maintainers!
spark-distributed-louvain-modularity
Spark / graphX implementation of the distributed louvain modularity algorithmdistributed-graph-analytics
Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks such as Giraph and GraphX. The analytics included are High Betweenness Set Extraction, Weakly Connected Components, Page Rank, Leaf Compression, and Louvain Modularity.correlation-approximation
Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasetsmitie-trainer
Model Training tool for MITIEnewman
Quickly analyze and explore email with advanced analytics and visualization.pst-extraction
PST extraction and analytic pipelinedistributed-louvain-modularity
Community Detection and Compression Analytic for Big Graph Datagraphene
zephyr
Zephyr is a big data, platform agnostic ETL API, with Hadoop MapReduce, Storm, and other big data bindings.watchman
Watchman: An open-source social-media event-detection systemtrack-communities
A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualization of communities and tracks.Datawake
Browser add-on and web server to support collection and analysis of web browsing data.Datawake-Legacy
This project is superseded by the current Datawake project but is maintained here for existing users. Browser extension and backend services aimed at enhancing Internet search with domain specific knowledge, collaboration, and analysis.DatawakeDepot
Loopback web application for administration of Datawake networkshigh-betweenness-set-extraction
Approximate Betweenness Centrality computation for big graph data.rhipe-arima
An R/Hadoop Arima analytic using Rhipe to submit mapreduce jobs.GEQE
Geo Event Quey by Example - Leverage geo-located temporal text data in order to identify similar locations or events.firmament
NodeJS script and Docker files to create MySQL/MongoDB backed AngularJS/Bootstrap web applicationdatawake-prefetch
page-rank
social-sandbox
Geo-temporal scraping of social media, unsupervised event detectionxdata-vm
Vagrant-Ubuntu VM serving as a platform for XDATA performer software integrationxdata-nba
Tools to mine nba dataleaf-compression
DatawakeManager-WebApp
DatawakeManager Web Servernewman-vm
newman vminteractive-graph-viewer
An R Shiny app for interactively viewing the results of the Louvain method for community detection.hive-common-udf
A collection of common Apache Hive UDFstriangle-counting
A port of the work at Sandia National Laboratories on approximate triangle counting via wedge sampling.merlin-stack
graphene-enron
go_watchman
github.com/watchman apps for which go is specifically well suitedgraphene-walker
Rmmtsne
A native R implementation of multiple maps t-distributed stochastic neighbor embedding (mmtsne).twitter-cacher
Twitter Scraperzephyr-sample-project
A sample project (or, rather, sample projects) to show various ways of using Zephyr - generally a good starting point for your own Zephyr implementations.vande
sotera.github.io
DatawakeManager-Loopback
DatawakeManager Data Layernewman-research
Tools to be evaluated prior to integration into Newmangraphene-instagram
A version of Graphene that runs on scraped Instagram data.DatawakeFFPlugin
JMI based Datawake plugin for Firefox 38+zephyr-contrib
Useful classes for functions outside the scope of Zephyr's ETL, but still used in many scenarios (generally with extensive dependencies that probably shouldn't be in the core API).DatawakeSuite
micropath-kml
For creating kml to visualize aggregate micro-path output.Love Open Source and this site? Check out how you can help us