• Stars
    star
    1,054
  • Rank 43,744 (Top 0.9 %)
  • Language
    JavaScript
  • License
    MIT License
  • Created about 5 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Open source audio annotation tool for humans

audino

GitHub license PRs Welcome

audino is an open source audio annotation tool. It provides features such as transcription and labeling which enables annotation for Voice Activity Detection (VAD), Diarization, Speaker Identification, Automated Speech Recognition, Emotion Recognition tasks and more.

Features

Current features of the tool include:

  1. Multi-language support
  2. Collaborative annotation
  3. JWT based authentication
  4. User-level project, role and data assignment
  5. Project-level API Key based datapoint creation
  6. Emoji support
  7. Flexibility in label creation

Usage

Note: Please see getting started guide for configurations and concrete usage.

Please install the following dependencies to run audino on your system:

  1. git [tested on v2.23.0]
  2. docker [tested on v19.03.8, build afacb8b]
  3. docker-compose [tested on v1.25.5, build 8a1c60f6]

Clone the repository

$ git clone https://github.com/midas-research/audino.git
$ cd audino

Note for Windows users: Please configure git to handle line endings correctly as services might throw an error and not come up. You can do this by cloning the project this way:

$ git clone https://github.com/midas-research/audino.git --config core.autocrlf=input

Deploy

You can either run the project on default configuration or modify them to your need. Note: Before proceeding further, you might need to give docker sudo access or run the commands listed below as sudo.

To build the services, run:

$ docker-compose -f docker-compose.prod.yml build

To bring up the services, run:

$ docker-compose -f docker-compose.prod.yml up

Then, in browser, go to http://0.0.0.0/ to view the application.

To bring down the services, run:

$ docker-compose -f docker-compose.prod.yml down

Contribute to the project

To contribute to the project, you need to use development configuration for working on the project, fixing bugs and making contributions. Note: Before proceeding further, you might need to give docker sudo access or run the commands listed below as sudo.

To build the services, run:

$ docker-compose -f docker-compose.dev.yml build

To bring up the services, run:

$ docker-compose -f docker-compose.dev.yml up

Then, in browser, go to http://localhost:3000/ to view the application.

To bring down the services, run:

$ docker-compose -f docker-compose.dev.yml down

Tutorials

We provide a set of tutorials to guide users to achieve certain tasks. If you feel something is missing and should be included, please open an issue.

Citation

Currently, the paper is under review. For now, please cite it as:

@misc{grover2020audino,
    title={audino: A Modern Annotation Tool for Audio and Speech},
    author={Manraj Singh Grover and Pakhi Bamdev and Yaman Kumar and Mika Hama and Rajiv Ratn Shah},
    year={2020},
    eprint={2006.05236},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}

License

MIT © MIDAS, IIIT Delhi

More Repositories

1

sthan-sr-aaai

Python
98
star
2

man-sf-emnlp

Python
60
star
3

STATENet_Time_Aware_Suicide_Assessment

Code release for "A Time-Aware Transformer Based Model for Suicide Ideation Detection on Social Media", EMNLP 2020.
Python
53
star
4

sthgcn-icdm

Code for Spatiotemporal Hypergraph Convolution Network for Stock Movement Forecasting
Python
37
star
5

keyphrase-extraction-as-sequence-labeling-data

dataset for Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings
30
star
6

profit-naacl

Code for PROFIT: Quantitative Day Trading From Natural Language Using Reinforcement Learning at NAACL 2021
Python
27
star
7

dlkp

A deep learning library for identifying keyphrases from text
Python
25
star
8

text2facegan

Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions
Python
23
star
9

hyper-stockgat-www

Code for Exploring the Scale-Free Nature of Stock Markets: Hyperbolic Graph Learning for Algorithmic Trading at WWW 2021
Python
19
star
10

fast-eacl

Python
18
star
11

multimodal-financial-forecasting

Python
17
star
12

persuasion-advertisements

HTML
17
star
13

gupshup

GupShup: Summarizing Open-Domain Code-Switched Conversations EMNLP 2021
Python
15
star
14

hyperbolic-tlstm-sigir

Python
15
star
15

hindi-nli-code

Python
15
star
16

sismo-wsdm

Code release for "Towards Ordinal Suicide Ideation Detection on Social Media", WSDM 2021.
Python
14
star
17

m3a-acl

Code and Data for M3A: Multimodal Multi-speaker Mergers & Acquisitions at ACL-IJCNLP 2021 (main)
Python
13
star
18

phase-eacl

Code release for "PHASE: Learning Emotional Phase-aware Representations for Suicide Ideation Detection on Social Media", EACL 2021.
Python
13
star
19

speechmix

Python
12
star
20

calling-out-bluff

Python
11
star
21

hindi-nli-data

a repository containing the details of natural language inference dataset in Hindi
11
star
22

corcod-dataset

CoRCoD: Code Runtime Complexity Dataset
11
star
23

bhaav

Dataset of sentences from Hindi stories tagged with different emotion tags
10
star
24

randomlaynet

Jupyter Notebook
8
star
25

DECA

Data Extension and Class Addition for VSR
Python
8
star
26

tec-ijcai

Code release for the IJCAI 2021 paper: TEC: A Time Evolving Contextual Graph Model for Speaker State Analysis in Political Debates
Python
8
star
27

metoo-mtl-naacl

Jupyter Notebook
7
star
28

interpreting-AES-Integrated-Gradients

Jupyter Notebook
7
star
29

data-free-uats

Python
6
star
30

gpols-coling

Python
6
star
31

privacy-leaks

On the Inference of Soft Biometrics from Typing Patterns Collected in a Multi-device Environment
Jupyter Notebook
6
star
32

mobile-vsr

Python
5
star
33

multimodal-bias-naacl

Python
5
star
34

video-persuasion

5
star
35

mathify

An extensive mathematics dataset called MathQuest sourced from the 11th and 12th standard Mathematics NCERT textbooks.
Jupyter Notebook
4
star
36

ldkp

Long Document Keyphrase Identification Dataset
3
star
37

finclass-uai

Code for FinCLASS: Modeling Financial Uncertainty with Multivariate Temporal Entropy-based Curriculums at UAI 2021
Python
3
star
38

Improvement-and-Estimation-of-Automated-Scoring-Systems-Performance-with-Guarantees

Implementation of the paper titled "Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees"
Python
3
star
39

MeTooMA

a repository for sharing the dataset produced in the ICWSM paper titled #MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement
3
star
40

keyphrase-extraction

Python
2
star
41

hindi-discourse

a repository for hindi discourse analysis dataset and its related experiments
2
star
42

no_dues

JavaScript
2
star
43

universal-eeg-embeddings

Universal embeddings for EEG
Python
1
star
44

asha-jamia

Python
1
star
45

hindi-aes

Jupyter Notebook
1
star
46

what-do-vision-transformers-see

1
star
47

linguistically-informed-frame-interpolation

Jupyter Notebook
1
star
48

hush-hush-speak

Python
1
star