• Stars
    star
    1
  • Language
    Scala
  • License
    GNU General Publi...
  • Created over 9 years ago
  • Updated over 9 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

extracting Tweets from Hadoop HDFS using Apache Scala

More Repositories

1

ebot

Ebot, an Opensource Web Crawler built on top of a nosql database (apache couchdb, riak), AMQP database (rabbitmq), webmachine and mochiweb. Ebot is written in Erlang and it is a very scalable, distribuited and highly configurable web cawler. See wiki pages for more details
Erlang
326
star
2

pyspark-examples

pyspark sample scripts
Python
17
star
3

twitterBot

A simple Twitter bot written in Elixir
Elixir
15
star
4

predictoR

A forecasting tool for time series, Long Term Prediction.
R
12
star
5

twitter-r-utils

Twitter R utils
R
8
star
6

botkit-starter-web-rasa-nlu

botkit starter template using the web interface and rasa-nlu middleware
JavaScript
6
star
7

docker-mongodb-rpi

Docker image for Mongodb on Raspberry PI
5
star
8

qsense

Qsense is a python library and command line tool for QlikSense
Python
4
star
9

ewg

Erlang Wordlist Generator
Erlang
4
star
10

proloGraph

A demo for a REST server developed in prolog
Prolog
3
star
11

ltp

Time series forecasting - log term predictions over timeseries
R
3
star
12

ragno

Simple web domains crawler written in erlang
Erlang
3
star
13

r-istat

Some example R scripts for istat data
R
3
star
14

dump_tweets.R

dump_tweets.R is a tool for searching tweets and crawl (recursively) users from twitter. Data are then saved to a MySQL database and can finally be exported to .RData files
R
3
star
15

msad

msad is a python library and commandline for ActiveDirectory
Python
2
star
16

apache-camel-blueprint-samples

2
star
17

node-express-rest-oracle-docker

sample of sharing (oracle) database tables via rest api using node.js, express in a docker container
JavaScript
2
star
18

dump_tweets.py

Incremental Dumper (to DB) of tweets from Twitters
Python
2
star
19

strategico

Automatically exported from code.google.com/p/strategico
R
1
star
20

oreste

Automatically exported from code.google.com/p/oreste
Erlang
1
star
21

BOTeo

JavaScript
1
star
22

etsdb

TimeSeries database
1
star
23

cryptoBot

Python
1
star
24

mdm.py

A generic and flexible Master Data Management tool written in Python
Python
1
star
25

apollo-datasource-qliksense

Apollo datasource for Qliksense
JavaScript
1
star
26

talendcloud

1
star
27

twitteRutils

Useful functions on twitter data (timelines, search, people,..)
R
1
star
28

steampipe-plugin-talend

steampipe-plugin-talend
Go
1
star
29

scrapy_web

Scrapy spiders for the web
Python
1
star
30

qlik-cloud-devops-terraform-opentofu

Qlik Cloud devops with OpenTofu or Terraform
HCL
1
star
31

TwitterPopularTags

TwitterPopularTags: the famous example of Spark Streaming in a standalone project
Scala
1
star
32

talendcloud-go

Talend cloud API wrapper
Go
1
star
33

extractTweets.py

Python
1
star
34

apollo-datasource-ldap

LDAP datasource for apollo graphql server
JavaScript
1
star
35

aws_ext

Python
1
star
36

tfbot

bot with tensorflow ML
Python
1
star
37

dmcommunity-challenges

My solutions for challenges from https://dmcommunity.org/challenge/
Prolog
1
star