There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
A system to process visual input on timed frames to produce sensible audio aid in accordance with human information processing limits, using image captioning, semantic text comparison and text-to-speech modules.