• Stars
    star
    1,020
  • Rank 45,113 (Top 0.9 %)
  • Language
    Shell
  • Created over 8 years ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Gitter chat

docker-hive

This is a docker container for Apache Hive 2.3.2. It is based on https://github.com/big-data-europe/docker-hadoop so check there for Hadoop configurations. This deploys Hive and starts a hiveserver2 on port 10000. Metastore is running with a connection to postgresql database. The hive configuration is performed with HIVE_SITE_CONF_ variables (see hadoop-hive.env for an example).

To run Hive with postgresql metastore:

    docker-compose up -d

To deploy in Docker Swarm:

    docker stack deploy -c docker-compose.yml hive

To run a PrestoDB 0.181 with Hive connector:

  docker-compose up -d presto-coordinator

This deploys a Presto server listens on port 8080

Testing

Load data into Hive:

  $ docker-compose exec hive-server bash
  # /opt/hive/bin/beeline -u jdbc:hive2://localhost:10000
  > CREATE TABLE pokes (foo INT, bar STRING);
  > LOAD DATA LOCAL INPATH '/opt/hive/examples/files/kv1.txt' OVERWRITE INTO TABLE pokes;

Then query it from PrestoDB. You can get presto.jar from PrestoDB website:

  $ wget https://repo1.maven.org/maven2/io/prestosql/presto-cli/308/presto-cli-308-executable.jar
  $ mv presto-cli-308-executable.jar presto.jar
  $ chmod +x presto.jar
  $ ./presto.jar --server localhost:8080 --catalog hive --schema default
  presto> select * from pokes;

Contributors

More Repositories

1

docker-hadoop

Apache Hadoop docker image
Shell
2,196
star
2

docker-spark

Apache Spark docker image
Shell
2,036
star
3

docker-hadoop-spark-workbench

[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
Makefile
688
star
4

docker-hbase

Makefile
246
star
5

docker-flink

Apache Flink docker image
Shell
191
star
6

README

General README for the Big Data Europe project's sources
83
star
7

demo-spark-sensor-data

Demo Spark application to transform data gathered on sensors for a heatmap application
Java
33
star
8

docker-kafka

Shell
31
star
9

docker-hive-metastore-postgresql

Postgresql configured to work as metastore for Hive.
TSQL
30
star
10

app-bde-pipeline

Bootstrap a pipeline on the BDE platform
Elixir
26
star
11

docker-zeppelin

Makefile
25
star
12

docker-hdfs-filebrowser

A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.
Mako
11
star
13

docker-spark-notebook

Spark Notebook docker image
Makefile
10
star
14

docker-flume

Python
8
star
15

docker-zookeeper

[DEPRECATED]
Shell
8
star
16

docker-elasticsearch

Start Elasticsearch instance, initiate an index and submit the index schema (mappings)
Shell
8
star
17

app-bdi-ide

Common Lisp
7
star
18

WorkFlow-Builder

Application to build and export Big Data pipelines
Elixir
7
star
19

demo-integrator-ui

Showcase the demo for integrator UI with Hadoop, HDFS browser, Spark, Flink, Strabon, Sextant, Solr.
Shell
6
star
20

docker-ontario

Ontario: Ontology-based Architecture for Semantic Data Lakes
5
star
21

app-integrator-ui

Wrapping user interface for embedding pipeline component interfaces
JavaScript
5
star
22

app-stack-builder

Application which helps in the construction of docker-compose.yml files
Common Lisp
4
star
23

mu-init-daemon-service

Microservice to report the progress of a service's initialization process
Ruby
4
star
24

docker-event-detection

Shell
4
star
25

docker-strabon

Shell
4
star
26

pilot-sc6-cycle2

Shell
3
star
27

mu-swarm-admin-service

A microservice that allows BDE pipelines to be managed through a graph database
Python
3
star
28

app-http-logger

Logging system to observe running containers, inspect their traffic and make it available for visualization in ElasticSearch
Shell
3
star
29

graph-acl-basics

Testing environment for graph-based ACL using the Mu Query Rewriter
Common Lisp
2
star
30

docker-postgres

Dockerized postgres
Shell
2
star
31

vagrant-mesos-multinode

[DEPRECATED] Boot Mesos with Vagrant
Shell
2
star
32

pilot-sc7-change-detector

Java
2
star
33

mu-query-rewriter

Scheme
2
star
34

app-swarm-ui

Swarm User Interface based on docker-compose, mu.semte.ch and EmberJS
Common Lisp
2
star
35

ember-stack-builder-frontend

Frontend for the Stack Builder
JavaScript
2
star
36

demo-d3js-with-sparqlendpoint

JavaScript
2
star
37

docker-nginx-proxy-with-css

Nginx proxy topping pages with a BDE CSS style
CSS
2
star
38

docker-elk-stack

ELK stack Dockers for BDE pipelines
2
star
39

docker-4store

Shell
2
star
40

mu-event-query-service

Microservice to query a DB for docker container events and return information in json format.
Python
1
star
41

pilot-sc2-cycle1

Scala
1
star
42

mu-swarm-admin-proxy

The entrypoint of all pipelines
1
star
43

docker-solr

1
star
44

WorkFlow-Monitor

Ember frontend to monitor a BDE pipeline
JavaScript
1
star
45

mu-swarm-logger-service

Writes docker logs into the triplestore and/or into files
Python
1
star
46

docker-kafkasail

1
star
47

vagrant-hadoop-singlenode

[DEPRECATED] Boot Hadoop with Vagrant
Shell
1
star
48

docker-geotriples-ws

1
star
49

mu-docker-stats

Microservice to fetch statistics data about the running containers to show it in the frontend for visual feedback.
Python
1
star
50

mu-query-rewriter-sandbox

A sandbox application that allows people to check the query rewriter
JavaScript
1
star
51

pilot-sc7-geotriples

Java
1
star
52

mu-pipeline-service

Provides resources to describe a Big Data pipeline in mu.semte.ch
Common Lisp
1
star
53

mu-har-transformation-service

Transforms each pcap file in a given directory into .har files (json) and pushes them into an ELK instance
Python
1
star
54

docker-kibana

Extended Kibana docker image with several plugins installed by default
1
star