• Stars
    star
    4
  • Rank 3,304,323 (Top 66 %)
  • Language
    Python
  • License
    MIT License
  • Created about 3 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Classify international patents into one of eight categories based on the text of their titles & abstracts using DistilBert & ONNX Runtime

More Repositories

1

fine-grained-sentiment

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
Python
165
star
2

tweet-stance-prediction

Applying NLP transfer learning techniques to predict Tweet stance toward a topic
Jupyter Notebook
107
star
3

db-hub-fastapi

Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients
Python
32
star
4

kuzudb-study

Benchmark study on KΓΉzuDB, an embedded OLAP graph database, on an artificial social network dataset
Python
25
star
5

duckdb-study

Compare DuckDB, Polars and Pandas for generating an artificial dataset of persons and companies
Python
19
star
6

lancedb-study

Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search
Python
16
star
7

neo4j-python-fastapi

Bulk ingest data into Neo4j using sync or async Python, and expose the data via FastAPI
Python
12
star
8

fine-grained-sentiment-app

A Flask LIME explainer app for fine-grained sentiment classification.
Python
11
star
9

pydantic-benchmarks

Benchmarks testing the performance of various releases of Pydantic v2 πŸ¦€
Python
9
star
10

blog

Posts related to Data Science, engineering and machine learning.
Jupyter Notebook
7
star
11

topic-modelling

Comparing the scalability and quality of topic models in Gensim and PySpark
Python
5
star
12

fine-grained-sentiment-app-streamlit

A LIME explainer app for fine-grained sentiment classification, written using Streamlit.
Python
3
star
13

graphdb-case-studies

Case studies showing the analysis of connected data using different graph databases and their Python client libraries
Python
3
star
14

prrao87.github.io

Archived. My blog is now moved to https://github.com/thedataquarry
SCSS
3
star
15

rag-data-ops

Code for data ops when building RAG applications using LangChain and LlamaIndex
Python
2
star
16

mteb-validation

Compare different embedding models from MTEB leaderboard
Python
1
star
17

spectral-line-plots

Plot multiple lines with spectral colors to simultaneously compare similar datasets
Python
1
star