• Stars
    star
    5
  • Rank 2,861,937 (Top 57 %)
  • Language
    TeX
  • License
    GNU General Publi...
  • Created almost 9 years ago
  • Updated almost 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Script series for NLP: PMI, TF-IDF and Neural cooccurrence vectorization, vector (TF/IDF & PMI) data base distributed querying and population with Hadoop. Deep learning and kernel learning in sklearn.

More Repositories

1

dummy_fraud_detection

Fraud detection in credit card payments and auto insurance claims using PySpark
Python
11
star
2

sentence_embedding

A sentence embedding method based on weighted series
Python
9
star
3

define-semantic-annotation

Define is a semantic annotation software aimed at enhancing and constraining hand similarity annotation tasks.
Python
1
star
4

RPM_C1_phrases

RPM French corpus
1
star
5

summ_features

TeX
1
star
6

expconditions

Learning Machine trained for extraction of experimental conditions from scientific literature in the biomedical area
Python
1
star
7

multinomial-bayes-document-classifier

This is a matlab library which is implemented a multinomial Bayes classifier for text document classification. Send me a mail for using doubts. Any way, each function gives you a little help.
MATLAB
1
star
8

open-ncd-kbc

This repo contains software and results derived from the PRODEP project entitled "Reinforcement learning in the automatic acquisition of knowledge in noncommunicable diseases"
Python
1
star
9

describe_corpus

This is a dataset where each file is associated to a term. Each file in turn contains definitions for the associated term. All text snippets are embedded into doc2vec vector representations.
Python
1
star
10

seismic_embeddings

This project aims to represent seismic data samples in an embedding space to observe similarities among embeddings. Data samples were provided by the Mexican National Seismic service (Servicio Sismológico Nacional) including intensity measurements from 1900 to 2018.
Python
1
star
11

lamda-signal-clustering

Software system implementation for signal acquisition and pattern clustering via LAMDA method (Learning Algorithm for Multivariate Data Analysis, byJoseph Aguilar-Martín, CNRS). It is written in C/C++ and it requires licenced National Instruments software called LabWindows/CVI and a USB Digital I/O Device. This software was tested for the last time over windows 7 Professional OS. See associated MSc thesis for details.
C
1
star