There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"