• This repository has been archived on 30/Nov/2023
  • Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language SCSS
  • Created over 1 year ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Archived. My blog is now moved to https://github.com/thedataquarry

More Repositories

1

fine-grained-sentiment

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
Python
165
star
2

tweet-stance-prediction

Applying NLP transfer learning techniques to predict Tweet stance toward a topic
Jupyter Notebook
107
star
3

db-hub-fastapi

Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
Python
32
star
4

kuzudb-study

Benchmark study on KรนzuDB, an embedded OLAP graph database, on an artificial social network dataset
Python
25
star
5

duckdb-study

Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
Python
19
star
6

lancedb-study

Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
Python
16
star
7

neo4j-python-fastapi

Bulk ingest data into Neo4j using sync or async Python, and expose the data via FastAPI
Python
12
star
8

fine-grained-sentiment-app

A Flask LIME explainer app for fine-grained sentiment classification.
Python
11
star
9

pydantic-benchmarks

Benchmarks testing the performance of various releases of Pydantic v2 ๐Ÿฆ€
Python
9
star
10

blog

Posts related to Data Science, engineering and machine learning.
Jupyter Notebook
7
star
11

topic-modelling

Comparing the scalability and quality of topic models in Gensim and PySpark
Python
5
star
12

patent-classification

Classify international patents into one of eight categories based on the text of their titles & abstracts using DistilBert & ONNX Runtime
Python
4
star
13

fine-grained-sentiment-app-streamlit

A LIME explainer app for fine-grained sentiment classification, written using Streamlit.
Python
3
star
14

graphdb-case-studies

Case studies showing the analysis of connected data using different graph databases and their Python client libraries
Python
3
star
15

rag-data-ops

Code for data ops when building RAG applications using LangChain and LlamaIndex
Python
2
star
16

mteb-validation

Compare different embedding models from MTEB leaderboard
Python
1
star
17

spectral-line-plots

Plot multiple lines with spectral colors to simultaneously compare similar datasets
Python
1
star