• Stars
    star
    176
  • Rank 216,987 (Top 5 %)
  • Language
    JavaScript
  • License
    MIT License
  • Created over 8 years ago
  • Updated over 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Tool for exploring Word Vector models

Word2Vec Explorer

This tool helps you visualize, query and explore Word2Vec models. Word2Vec is a deep learning technique that feeds massive amounts of text into a shallow neural net which can then be used to solve a variety of NLP and ML problems.

Word2Vec Explorer uses Gensim to list and compare vectors and it uses t-SNE to visualize a dimensional reduction of the vector space. Scikit-Learn is used for K-Means clustering.

The UI is built using React, Babel, Browserify, StandardJS, D3 and Three.js.

TSNE 10K

TSNE Labels

Vector Comparisons

Setup

To install all Python depenencies:

pip install -r requirements.txt

Usage

Load the explorer with a Word2Vec model:

./explore GoogleNews-vectors-negative300.bin

Now point your browser at localhost:8080 to load the explorer!

Obtaining Pre-Trained Models

A classic example of Word2Vec is the Google News model trained on 600M sentences: GoogleNews-vectors-negative300.bin.gz

[More pre-trained models]](https://github.com/3Top/word2vec-api#where-to-get-a-pretrained-models)

Development

In order to make changes to the user interface you will need some NPM dependencies:

npm install
npm start

The command npm start will automatically transpile and bundle any code changes in the ui/ folder. All backend code can be found in explorer.py and ./explore.

Before submitting code changes make sure all code is compliant with StandardJS as well as Pep8:

standard
pep8 --max-line-length=100 *.py explore

Todo

  • 3D GPU/WebGL view (on branch 3d)
  • Make sure axes stay when zooming/panning scatterplot
  • Autocomplete in query interface
  • Look into supporting other high dimensional data models (go beyond word vectors)
  • Drill-down of vector that shows real distance between neighbors
  • Improved sample rated view that takes into account term counts and connectedness

More Repositories

1

node-facebook

Simple Facebook Integration for NodeJS (and Express)
JavaScript
124
star
2

python-bhtsne

Python module for Barnes-Hut implementation of t-SNE (Cython)
C++
110
star
3

deep-base

Deep learning base image for Docker (Tensorflow, Caffe, MXNet, Torch, Openface, etc.)
Makefile
79
star
4

kakuteru

Semantic-web enabled Lifestreamer built on RubyOnRails
Ruby
63
star
5

eth-indexer

High performance Ethereum smart contract event indexing for fast local retrieval
JavaScript
58
star
6

transferflow

Transfer Learning for Tensorflow
Python
34
star
7

app-skeleton

Simple JS App Skeleton (React, SemanticUI, Babel, Standard, ES6, Webpack, React-router)
JavaScript
32
star
8

node-api-skeleton

Node.js API Skeleton using frameworks du jour: ES6/Await, Express, Jest, Mongoose, Docker, JWT, etc.
JavaScript
28
star
9

ruby-v8

Run the V8 Javascript Engine from Ruby.
C++
22
star
10

xml-to-json-proxy

Tools that allow client-side XML calls in the browser. Using Flash crossdomain calls, the XML is converted to JSONML and delivered in JavaScript
13
star
11

iknow_developers

Example code on how to use the iKnow! API
PHP
11
star
12

shoal

A modern upstart replacement. Manage processes using centralized JSON configurations
JavaScript
10
star
13

relex-json-serve

Small server that allows you to access the Relex NLP toolkit using JSON RPC calls.
Ruby
7
star
14

moonfish-app

Open source platform for doing Token Sales and Initial Coin Offerings (ICOs)
JavaScript
6
star
15

crypto-toad

Crypto market intelligence via Telegram
Python
5
star
16

Pulsi

Virtual stock market simulator for pre-IPO companies.
JavaScript
4
star
17

content_focus

This is a little gem that allows you to input raw HTML and extract the most relevant piece of content.
Ruby
4
star
18

nnpack

Packaging and Data Portability for Neural Networks
Python
3
star
19

activity_mapper

A framework for aggregating (public) social activity into a single polymorphic persistent structure.
Ruby
3
star
20

python-api-skeleton

Simple Python-based JSON API skeleton for SPA SaaS apps.
Python
2
star
21

phpfbex

Simple JS/PHP Facebook integration Example
PHP
2
star
22

dominiek.github.com

Storing of static stuff
JavaScript
2
star
23

wqet-grader

Python
2
star
24

etherist

Real-time Ethereum currency data brokerage + Telegram Bot
Python
1
star
25

moonfish-api

Open source platform for doing Token Sales and Initial Coin Offerings (ICOs)
JavaScript
1
star