Natural Language YouTube Search
Use OpenAI's CLIP neural network to search inside YouTube videos. You can try it by running the notebook on Google Colab.
New
- Integrated to Huggingface Spaces with Gradio. See demo:
How it works
- Download the YouTube video
- Extract every N-th frame
- Encode all frames using CLIP
- Encode a natural language search query using CLIP
- Find the images that best match the search query
For more details see the notebook.
Examples
Here are some example searches from this YouTube video of a car driving around San Francisco.
"A fire truck"
"Road works"
"People crossing the street"
"The Embarcadero"
"Waiting at the red light"
"Green bike lane"
"A street with tram tracks"
"The Transamerica Pyramid"
Natural language search on Unsplah
You can also try my other project to search from 2M photos on Unsplash using natural language queries: