bangla-tts
Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis libraryeeg-rsenet
Motor Imagery EEG Signal Classification Using Random Subspace Ensemble Networklesion-segmentation-melanoma-tl
Automatic Skin Lesion Segmentation and Melanoma Detection: Transfer Learning approach with U-Net and DCNN-SVMkeras-attn_aug_cnn
Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.text-image-denoiser
deep learning models trained to denoise/deblur text images (signle frame, multi-frame) [pytorch]bangla-news-rnn
Bangla news classification and generationsimple-gRPC
Simple gRPC with images in python.covid19-few-shot-learning
imagebert-keras
Keras implementation of ImageBERT from Microsoftawesome-multilingual-large-language-models
A comprehensive collection of multilingual datasets and large language models, meticulously curated for evaluating and enhancing the performance of large language models across diverse languages and tasks.bangla-image-search
A dead-simple image search and image-text matching system for Bangla using CLIPFibro-CoSANet
Idiopathic pulmonary fibrosis (IPF) is a restrictive interstitial lung disease that causes lung function decline by lung tissue scarring. Although lung function decline is assessed by the forced vital capacity (FVC), determining the accurate progression of IPF remains a challenge. To address this challenge, we proposed Fibro-CoSANet, a novel end-to-end multi-modal learning-based approach, to predict the FVC decline. Fibro-CoSANet utilized CT images and demographic information in convolutional neural network frameworks with a stacked attention layer. Extensive experiments on the OSIC Pulmonary Fibrosis Progression Dataset demonstrated the superiority of our proposed Fibro-CoSANet by achieving the new state-of-the-art modified Laplace Log-Likelihood score of -6.68. This network may benefit research areas concerned with designing networks to improve the prognostic accuracy of IPF.dot-res-lstm
Classification of ECG signals by dot Residual LSTM Network for anomaly detectiongcp-ml-certification
A study guide for preparing for the GCP Professional Machine Learning Certification in one week or less.bangla-CLIP
CLIP (Contrastive Language–Image Pre-training) for Bangla.autoocr
Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)keras-human-pose
A simple wrapper to localize human joints from images/video frames for multiple subjects.qt-motion-analysis
A rugged Qt GUI application for processing webcam frames for ML applications (pose estimation)sarcasm-detection-roberta
Sarcasm Detection using LSTM, GRU, and RoBERTa on SARC (reddit), sarcasm_v2, and iSARCASM (twitter) datasetstorch-speech-dataloader
A ready-to-use pytorch dataloader for audio classification, speech classification, speaker recognition, etc. with in-GPU augmentationshybrid-dataset-eeg
DWT with coif_2, level 3 decomposition, subject A, subject C, 5 channels, 10 features (AAC, DASDV, IEMG, MAV, MMAV2, MYOP, RMS, SSC, SM2, LOG, WL) 50 features in total, 3 classes : left hand, right hand, legyolov3-anchor-clustering
Python script to run k-means clustering on any yolov3 format dataset to find appropriate anchorspaw-segmentation
🐾 Semantic segmentation of paws from cute pet imagesactivity-recognition-abc
SCAR-Net, Submission to the Cooking Activity Recognition Challenge, ABC: competition trackaudioperm
A python library for generating different permutations of audible segments from audio files.stock-price-lstm
Stock price prediction on Yahoo finance API dataautonomous-driving-system
'Autonomous driving system with Machine Learning' in Matlabmedical-image-TL
Medical Image Classificationawesome-speaker-recognition-verification
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.darknet-multi-gpu-parallel
running multiple darknet models in parallel in multi-gpu setupWhat-If-Explainability
Explaining Trees (LightGBM) with FastTreeShap (Shapley) and What if toolnlp-auto-essay-scoring
NLP Automated Essay Scoring with BERTdarknet-fastapi-modelserver
A simple fastapi model server for darknet (yolov1 to yolov4).onnx-face-liveness
Face liveness with ONNX runtimeflask_restful
Templates for working with Flask-RESTfulpicast
A lightweight fast data streaming library for raspberry pi in python.VertexAI-notebook-demos
Notebook demos + tutorials for Vertex AI (training, deployment, explainability, monitoring, etc.)EmotionsInTheWild-CNN-Benchmarks
Emotion (Context + Facial) recognition in the wild using ConvNets (EfficientNet, ResNet, ResNext)spark-augmentation
A robust learning scheme by proposing a novel augmentation algorithm that scales to emulate a large set of image degradation and occlusion policiesCompetitive-Programming
This repo contains solution to some of the problems I solved in different 'OJ's.pyemotivcortexv2
Simply written emotiv cortex API v2 in python to get data from emotiv headsetsweather-dashboard
Weather Search Dashboardml-cheatsheets
Machine Learning Cheatsheets for quick overview.bangla-synthetic-license-plates
A Bangla license plates dataset (synthetic), generated with a mixture of deep learning and image processing. The labels are in darknet yolo format. [.txt, .data, .names]crowdsource-voice-research
Crowdsourcing voice data for audio, speech research. Flask + Jinja2 website with waitress, nginx, and SSL (self-signed certs)pikwizard-image-downloader
An image downloader for https://pikwizard.com using Selenium. Copyright-free image downloader.pytorch-nlp
PyTorch NLP Text Classification (BBC NEWS, StackOverflow Tags)fastapi-dash
Simple dashboard with interactive plots with FastAPIecg-arrythmia
ECG Anomaly Detectionwhitebox-attack-malware-GAN
Generating Adversarial Malware Examples for White-Box Attacks Based on GANtf2-speaker-recognition
speaker recognition in tensorflow 2ImP-Mat
Practice codes 4 Matlab & Image Processingbangla-multilingual-llm-eval
Evaluation of Open and Closed-Source Multi-lingual LLMs for Low-Resource Bangla Languagespeaker-verification-gmm
Speaker verification using Gaussian Mixture Model (GMM)gpt-rl-human-feedback
30-days-gre-preparation
Expeditious GRE preparation in a week! https://www.youtube.com/watch?v=VTKowCh43codeep-learning-goodfellow
code implementations from book deep learning by goodfellowdsp-matlab-cpp
Digital Signal Processing in Matlab & C++python-bit-tricks
Fastest bit manipulations, bit tricks, optimization code snippets in pythonECAPA-TDNN-tensorflow
Unofficial tensorflow implementation of ECAPA-TDNN (Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification)online-adaptive-s-norm
Adaptive S normalization suitable for online updatesparaphrase-tool
FEGPA
Face Emotion, Gaze estimation, Pose estimation Annotation tool.tf-model-server4-yolov3
Simple code base and instructions to convert yolov3 darknet weights to tensorflow .pb to serve @ tensorflow model serveryolov4-docker-grpc
yolov4 environment, docker, grpc serverCody-Challange
Matlab Codygo-learn
ACPR
ACPR alpha, autonomous carom playing robot which improves its performance by playing matches.activity-recognition-pose
Human Activity Recognition from Skeletal data (Pose)bangla-tts-hts
HTS based bangla text to speechfab-lab-python-intro-workshop
Jupyter notebook containing all the codes used in workshop and other resourcesface-recognition-ytube
Face recognition in python3.MOOC
The list of online courses I'm pursuing (active) and relevant docs prepared on the way.kaldi-speaker-recognition
leetcode-python
My leetcode solutions written in python 3fast-wavenet-mel2wav
Dummy Implementation, Will update lateryolov4-cpp-grpc
drug-drug-interaction
Code-Templates
This repo contains code templates I use for programming contest.sre-torch
Losses and metrics for speaker recognition evaluation in PyTorchpyspeech-loader
Python Speech Data Loader on GPUytb-audio-dataset-prep
Download videos from YouTube channels, bulk-convert mp4 to wav, apply vad, and store voice only audiosremote-docker-spawn
Remotely spawning docker containers using REST APIllama-summarization
Parameter-Efficient Fine-Tuning (PEFT) of Llama for text summarization.pyspid
Python Speaker Identification with Speedbob-plda-docker
A docker container for bob (bob.learn.em) signal processing, machine learning, and biometrics toolkit in python 3.5zoom-participants-report
Generating a report with the list of participants from zoom recording file / screenshotzabir-nabil.github.io
A blog on sport programming, electronics, signal processing, machine learning and some random stuffs!dockerfiles-deep-learning
Simple Dockerfiles for fast deep learning experimentstorch-audio-mobile
Support for a small set of pytorch audio operations for mobilespeaker-recognition-efn-torch
Speaker Recognition using EfficientNet backbone with PyTorchrecommender-rest-api
free-ml-journals
List of journals in the machine learning, computer vision, biomedical image processing domain without APCalgoml-smd
Design and Analysis of {Algorithms, Data Structures}, and {Machine Learning Models} (for sapienmeetsdeus community)static-chatbot
A simple static randomized response based chatbotFace-Dataset-Generator
Reads a video file, detects faces from each frame, saves the frames in some* folder and saves the cropped faces in another* folder.Love Open Source and this site? Check out how you can help us