• Stars
    star
    104
  • Rank 330,604 (Top 7 %)
  • Language
    Scala
  • License
    Apache License 2.0
  • Created about 7 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Seahorse

Seahorse is an open-source visual framework allowing you to create Apache Spark applications in a fast, simple and interactive way.

Seahorse is distributed under the Apache 2.0 License.

Building Seahorse from source

Prerequisites:

  • docker 1.30
  • docker-compose 1.9
  • JDK 8
  • sbt 0.13
  • python 2.7
    • PyYAML
  • npm 4.6
  • jekyll 3.2
    • pygments.rb
    • jekyll-sass-converter
    • jekyll-watch
  • PhantomJS

Run

./build/build_all.sh

This will build all the needed docker images and create a docker-compose.yml file. You can now run it using docker-compose up. Seahorse will start at http://localhost:33321.

A good place to start using Seahorse is the basic examples section of the documentation.

Development

Note that in order to contribute to Seahorse you have to sign the Contributor License Agreement.

Before submitting a PR, please run the Scala style check:

sbt scalastylebackend && (cd ./seahorse-workflow-executor && sbt scalastyle)

Running tests

Initialize the submodules before running the tests:

git submodule init
git submodule update

Backend tests:

./build/build_and_run_tests.sh

Frontend tests:

./frontend/run_unit_tests.sh

End-to-end integration tests:

./build/e2e_tests.sh -a

In order for Seahorse to compile and run correctly on Mac OS, you need to increase memory for Docker engine to at least 6GB.

Bash completion for Python scripts

Some of our Python scripts used by devs support bash autocompletion using argcomplete.

pip install argcomplete
activate-global-python-argcomplete --user

See this for global completion support.

Mac OS

Note, that bash 4.2 is required. Installation instruction for Mac users

After the bash upgrade, you may have to rename .bash_profile to .bashrc. And maybe add /usr/local/bin to $PATH. Also, check if you're actually running the new bash with echo $BASH_VERSION - your terminal might still be using the old one.

Developing SDK operations on local repository

To compile and test SDK operations on local repository, you can use seahorse-sdk-example submodule

git submodule init
git submodule update
./build/prepare_sdk_dependencies.sh

Now it will compile and test against the local Seahorse repository:

cd seahorse-sdk-example
sbt test

Enterprise options and support

Seahorse was originally created at deepsense.ai. Technical support and customization options are available upon contact.

More Repositories

1

roi-pooling

C++
460
star
2

carla-birdeye-view

Bird-eye's view for CARLA simulator
Python
174
star
3

db-ally

Efficient, consistent and secure library for querying structured data with natural language
Python
115
star
4

carla-real-traffic-scenarios

Python
73
star
5

Distributed-BA3C

Python
56
star
6

pydatawarsaw-notebooks

CSS
55
star
7

edge-slm

This project is a native implementation of a RAG pipeline for Small Language Models tested on Android devices. The main goal was to fit the whole RAG pipeline into a resource constrained device - ie. smartphone. By design the provided RAG library should be deployable on various platforms.
C++
50
star
8

Keras-PyTorch-AvP-transfer-learning

We pit Keras and PyTorch against each other, showing their strengths and weaknesses in action. We present a real problem, a matter of life-and-death: distinguishing Aliens from Predators!
Jupyter Notebook
48
star
9

ds-splat

Cuda
42
star
10

seahorse-workflow-executor

Scala
41
star
11

tensorflow_on_slurm

Python
40
star
12

unblackboxing_webinar

Jupyter Notebook
28
star
13

ds-template

Template for professional data science and python applications made by deepsense.ai
Python
27
star
14

intel-ai-webinar-neural-networks

Jupyter Notebook
15
star
15

deep_learning_art_webinar

Jupyter Notebook
10
star
16

BA3C-CPU

C++
8
star
17

ds-pycontain

Library to run python REPL in isolated docker container and helpful abstraction for docker containers/images. in python
Python
7
star
18

ragbits

Building blocks for rapid development of GenAI applications
Python
6
star
19

hands-on-deep-learning

Jupyter Notebook
6
star
20

seahorse-sdk-example

Examples of usage of Seahorse SDK
Scala
4
star
21

trelbert

Repository for TrelBERT: A pre-trained encoder for Polish Twitter
3
star
22

mrunner

Python
2
star
23

hackathon_gov_pl

Repository for hackathon.gov.pl.
Jupyter Notebook
2
star
24

neptune-r-client-library

R
1
star