• Stars
    star
    441
  • Rank 98,861 (Top 2 %)
  • Language
    Python
  • Created over 7 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

is_that_a_duplicate_quora_question

all the code for the article https://www.linkedin.com/pulse/duplicate-quora-question-abhishek-thakur will be available here..

How To

  1. Install Required Libraries
pip install pandas
pip install numpy
pip install scikit-learn
pip install nltk
pip install tqdm
pip install keras
pip install tensorflow
pip install pyemd
pip install fuzzywuzzy
pip install python-levenshtein
pip install --upgrade gensim
  1. Download Required Language libraries
mkdir data
cd data
wget http://www-nlp.stanford.edu/data/glove.840B.300d.zip
unzip glove.840B.300d.zip
rm glove.840B.300d.zip
wget http://qim.ec.quoracdn.net/quora_duplicate_questions.tsv
wget https://s3.amazonaws.com/dl4j-distribution/GoogleNews-vectors-negative300.bin.gz
sudo python -m nltk.downloader stopwords
cd ..
  1. Run
python feature_engineering.py
python deepnet.py

More Repositories

1

approachingalmost

Approaching (Almost) Any Machine Learning Problem
7,369
star
2

colabcode

Run VSCode (codeserver) on Google Colab or Kaggle Notebooks
Python
2,069
star
3

tez

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.
Python
1,160
star
4

autoxgb

XGBoost + Optuna
Python
676
star
5

diffuzers

a web ui & api for πŸ€— diffusers
Python
584
star
6

approaching_almost_nlp

Approaching (Almost) Any Natural Language Processing Problem
341
star
7

mlspace

MLSpace: Hassle-free machine learning & deep learning development
Python
303
star
8

wtfml

WTFML: Well That's Fantastic Machine Learning
Python
295
star
9

bert-sentiment

Python
269
star
10

how-to-become-a-ds-in-30-days

How to become a data scientist in 30 days
213
star
11

mlframework

Python
200
star
12

clickbaits_revisited

Deep learning models to identify clickbaits taking content into consideration
Python
171
star
13

long-text-token-classification

Python
162
star
14

greedyFeatureSelection

greedy feature selection based on ROC AUC
Python
125
star
15

bert-entity-extraction

Python
124
star
16

StableSAM

100
star
17

pysembler

An automatic ensembler of machine learning models in python
Python
67
star
18

captcha-recognition-pytorch

Python
61
star
19

sandesh

A simple app to send messages to Slack channels / members using webhook
Python
56
star
20

ml_dev_env

Machine Learning / Deep Learning Environment. Everywhere. Anywhere.
Dockerfile
50
star
21

commonlit-pairwise-model

Pairwise model for commonlit competition
Python
46
star
22

e01

Python
37
star
23

chaii-hindi-tamil-question-answering

chaii: hindi and tamil question answering
Python
36
star
24

melanoma-deep-learning

JavaScript
35
star
25

bert-tweet-sentiment

Python
31
star
26

automl_gpu

Python
26
star
27

walmart2015

Python
26
star
28

csv_test

26
star
29

AutoML

Python
24
star
30

imet-collection

Python
23
star
31

anime_hentai

Distinguishing between anime and hentai
Python
15
star
32

autonlp

AutoNLP: AutoML for NLP (WIP)
Python
13
star
33

abhishekkrthakur

9
star
34

ApproachingAlmostNLP

8
star
35

competitions-template

8
star
36

LCE

Local Collective Embeddings. Python translation of https://github.com/msaveski/LCE
Python
7
star
37

moa-kaggle

6
star
38

movie_recommender

6
star
39

av_minihack

Python
5
star
40

naivebees

Python
5
star
41

amazon_challenge

code for amazon employee access challenge
C
4
star
42

nuSVM

implementation of nusvm using cvxopt
Python
4
star
43

aaamlp_figures

4
star
44

ultramnist

3
star
45

testing

3
star
46

NDSB

national data science bowl @ kaggle
Python
2
star
47

finetuning_googlenet

Python
2
star
48

images

2
star
49

pyCoDi

implementation of CoDi saliency in python
Python
2
star
50

illumination-compensation

C++
2
star
51

fastFibonacci

fast fibonacci in cython
C
1
star
52

xformers

1
star
53

kaggle-afsis

Beating the Benchmark in Kaggle Afsis challenge
Python
1
star
54

EMC

em clustering
Python
1
star