• Stars
    star
    7,470
  • Rank 4,901 (Top 0.1 %)
  • Language
  • License
    MIT License
  • Created over 6 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas

Awesome Deep Learning Project Ideas

Awesome

A curated list of practical deep learning and machine learning project ideas

  • 30+ ideas
  • Relevant to both the academia and industry
  • Ranges from beginner friendly to research projects

Contents

  • Hackathon Ideas - Project ideas unlocked by use of Large Language Models, specially text to text -- note that a lot of the text to text ideas can also be buit a lot better with LLMs now!

  • Text - With some topics about Natural language processing

  • Forecasting - Most of the topics in this section is about Time Series and similar forecasting challenges

  • Recommendation Systems

  • Vision - With topics about image and video processing

  • Music and Audio - These topics are about combining ideas from language and audio to understand music

  • Conclusion


Hackathon Ideas

  • Developer Ideas

    • Text to cmd for terminal: Take user intent in terminal e.g.

      $ask "how to list all files with details"
      > Execute "ls -l"? [y/N] y 
      $ls -l
    • Build and edit YAMLs using natural language e.g. Kubernetes and other form of config files

      • Kor for ideas on how this is done for JSON
      • Can be use-case specific. Build pipelines? Kube?
    • Mobile Android/iOS SDK for Stable Diffusion inference

  • Voice powered Experiences

    • Audio Conversation with chatGPT, can combine with fast Text-to-Speech e.g. Eleven Labs to have a two-way conversation
    • Telegram/WhatsApp bot to get audio and save as text with metadata into mem.ai or Roam Research or Obsidian
  • Edit image by giving instructions of what you want to do: SeeChatGPT and playgroundai.com as examples

  • Semantic search over any media

  • Text to Music Generation

  • Knowledge Base QA aka Answer Engines

    • Take any plaintext dataset e.g. State of the Union address and build on top of that image
    • Can use this over Video Subtitles to search and QA over videos as well, by mapping back to source
  • Guided Summarisation/Rewriting

    • Take specific questions which the user might have about a large text dataset e.g. a novel or book and include that in your summary of the piece
    • Pay attention to specific entities and retell the events which happen in a story with attention to that character
  • ControlNet + Stable Diffusion for Aethetic Control

    • Build tooling using diffusers which takes in a set of photos, finetunes a model (LoRA) on a person, detects face and moves it to a new aesthetic e.g. futuristic neon punk, grunge rock, Studio Ghibli. Can also add InstructPix2Pix to give user more control.
  • Text to Code/SQL

    • Use code understanding to convert use query to SQL or another executable programming language, including Domain Specific Languages
    • Here is an example of the same: qabot

Text

Natural Language Understanding

  • Sentence to Sentence semantic similarity

    • Can you identify question pairs that have the same intent or meaning?
    • Dataset: Quora question pairs with similar questions marked
  • Fight online abuse

  • Open Domain question answering

  • Automatic text summarization

    • Can you create a summary with the major points of the original document?
    • Abstractive (write your own summary) and Extractive (select pieces of text from original) are two popular approaches
    • Dataset: CNN and DailyMail News Pieces by Google DeepMind
  • Copy-cat Bot

Check mlm/blog for some hints.

Forecasting

  • Univariate Time Series Forecasting

  • Multi-variate Time Series Forecasting

    • How polluted will your town's air be? Pollution Level Forecasting
    • Dataset: Air Quality dataset
  • Demand/load forecasting

  • Predict Blood Donation

    • We're interested in predicting if a blood donor will donate within a given time window.
    • More on the problem statement at Driven Data.
    • Dataset: UCI ML Datasets Repo

Recommendation systems

  • Movie Recommender

    • Can you predict the rating a user will give on a movie?
    • Do this using the movies that user has rated in the past, as well as the ratings similar users have given similar movies.
    • Dataset: Netflix Prize and MovieLens Datasets
  • Search + Recommendation System

    • Predict which Xbox game a visitor will be most interested in based on their search query
    • Dataset: BestBuy
  • Can you predict Influencers in the Social Network?

    • How can you predict social influencers?
    • Dataset: PeerIndex

Vision

  • Image classification

  • Bone X-Ray competition

    • Can you identify if a hand is broken from a X-ray radiographs automatically with better than human performance?
    • Stanford's Bone XRay Deep Learning Competition with MURA Dataset
  • Image Captioning

    • Can you caption/explain the photo a way human would?
    • Dataset: MS COCO
  • Image Segmentation/Object Detection

  • Large-Scale Video Understanding

    • Can you produce the best video tag predictions?
    • Dataset: YouTube 8M
  • Video Summarization

  • Style Transfer

    • Can you recompose images in the style of other images?
    • Dataset: fzliu on GitHub shared target and source images with results
  • Chest XRay

    • Can you detect if someone is sick from their chest XRay? Or guess their radiology report?
    • Dataset: MIMIC-CXR at Physionet
  • Clinical Diagnostics: Image Identification, classification & segmentation

    • Can you help build an open source software for lung cancer detection to help radiologists?
    • Link: Concept to clinic challenge on DrivenData
  • Satellite Imagery Processing for Socioeconomic Analysis

    • Can you estimate the standard of living or energy consumption of a place from night time satellite imagery?
    • Reference for Project details: Stanford Poverty Estimation Project
  • Satellite Imagery Processing for Automated Tagging

    • Can you automatically tag satellite images with human features such as buildings, roads, waterways and so on?
    • Help free the manual effort in tagging satellite imagery: Kaggle Dataset by DSTL, UK

Music

  • Music/Audio Recommendation Systems

  • Music Genre recognition using neural networks

    • Can you identify the musical genre using their spectrograms or other sound information?
    • Datasets: FMA or GTZAN on Keras
    • Get started with Librosa for feature extraction

FAQ

  • Can I use the ideas here for my thesis? Yes, totally! I'd love to know how it went.

  • Do you have any advice before I start my project? Advice for Short Term Machine Learning Projects by Tim R. is a pretty good starting point!

  • How can I add my ideas here? Just send a pull request and we'll discuss?

  • Hey, something is wrong here! Yikes, I am sorry. Please tell me by raising a GitHub issue.

    I'll fix it as soon as possible.

Acknowledgements

Problems are motivated by the ones shared at:

Credit

Built with lots of keyboard smashing and copy-pasta love by NirantK. Find me on Twitter!

License

This repository is licensed under the MIT License. Please see the LICENSE file for more details.

More Repositories

1

NLP_Quickbook

NLP in Python with Deep Learning
Jupyter Notebook
560
star
2

best-of-jupyter

Jupyter Tips, Tricks, Best Practices with Sample Code for Productivity Boost
420
star
3

hindi2vec

State-of-the-Art Language Modeling and Text Classification in Hindi Language
Jupyter Notebook
220
star
4

pytorch-web-deploy

Simple, fast web deployment for your PyTorch models
Python
70
star
5

agentai

Text to Python Objects via a LLM Function Call
Python
54
star
6

coronaIndia

Experiments & NLP Deployments for CoronaVirus Related Work
Jupyter Notebook
34
star
7

Hinglish

Hinglish Text Classification
Jupyter Notebook
30
star
8

breakoutlist-india

High potential opportunities for ambitious engineers, designers, data people and future founders. The best teams to join.
27
star
9

llama2demo

Python
14
star
10

Twitter-Geographical-Sentiment-Analysis

Finds the Happiest US and Indian State based on Sentimental Analysis of Twitter Data
Python
13
star
11

keras-practice

Notebooks covering Intro to CNN, Transfer Learning using VGG16
Jupyter Notebook
12
star
12

Genetic-Algorithm-Self-Study-Notes

Notes, Reading Sources and Bibliography on Genetic Algorithms
8
star
13

qdrant_tools

Python Tools to use with the Qdrant Python Client
Jupyter Notebook
7
star
14

nirantk.github.io

Jupyter Notebook
6
star
15

Text-Summarization

C
4
star
16

awesome-vectordb

Everything you need to decide and work with VectorDBs
Python
4
star
17

knee-xrays

Exploratory Repository
Jupyter Notebook
3
star
18

fitz-wrapper

CLI Utilities for PDF to Image Conversion, built with Py3
Python
3
star
19

OnDeckMLChallenge

Jupyter Notebook
3
star
20

fastvector

Python
3
star
21

DSA-BITS-Masti

Data Structures and Algorithms at BITS Pilani
C
3
star
22

experiments

Repository for Experimental Code
HTML
2
star
23

quickstart

Shell
2
star
24

comehomeandbuild

HTML
2
star
25

MITx-Analytics-Edge-Coursework

Code, Lecture Slides and Data from edx.org/course/analytics-edge-mitx-15-071x-0
HTML
2
star
26

cohere-learn

Utils which wrap around Cohere API: FewShotClassify and more coming soon
Python
1
star
27

Noor

Bringing Light to What We are Taught :)
HTML
1
star
28

interview_practice

Archive
C++
1
star
29

Aditi

1
star
30

latest-news-ncert

Link educational topics to latest NEWS
Python
1
star
31

julie

Julie is a blogging assistant and linter for AI Hackers wanting to make their work more accessible
Python
1
star
32

qdrant-course

Jupyter Notebook
1
star
33

CovidSeer

Complimentary Repo for Publishing Public facing Covid India work
Jupyter Notebook
1
star
34

go-demo

Demo code for the Golang lecture by @theonewolf
Go
1
star
35

bq

Binary Quantization in Numpy
1
star