• Stars
    star
    1
  • Language
  • License
    Creative Commons ...
  • Created 5 months ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Natural Language Understanding resources for Indian languages

More Repositories

1

indicnlp_catalog

A collaborative catalog of NLP resources for Indic languages
526
star
2

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
Python
271
star
3

IndicTrans2

Translation models for 22 scheduled languages of India
Python
181
star
4

indicnlp_corpus

Description Describes the IndicNLP corpus and associated datasets
Python
147
star
5

indicTrans

indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
Jupyter Notebook
110
star
6

Indic-TTS

Text-to-Speech for languages of India
Jupyter Notebook
91
star
7

OpenHands

👐OpenHands : Making Sign Language Recognition Accessible. | **NOTE:** No longer actively maintained. If you are interested to own this and take it forward, please raise an issue
Python
87
star
8

IndicLLMSuite

A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
Python
72
star
9

IndicWav2Vec

Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
Jupyter Notebook
71
star
10

IndicXlit

Transliteration models for 21 Indic languages
Python
64
star
11

NPTEL2020-Indian-English-Speech-Dataset

NPTEL2020: Speech2Text dataset for Indian-English Accent
Python
59
star
12

IndicBERT

Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME
Python
58
star
13

IndicNLP-Transliteration

Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/IndicXlit
Python
57
star
14

Shoonya

Shoonya - Platform to Annotate and label data at scale.
46
star
15

indic-bart

Pre-trained, multilingual sequence-to-sequence models for Indian languages
Python
42
star
16

vistaar

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Python
34
star
17

Chitralekha

Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over
32
star
18

Indic-Input-Tool-UI

Web Interface for Transliteration for Indic languages.
JavaScript
22
star
19

Chitralekha-Backend

Transcribe your videos and translate it into Indic languages.
Python
21
star
20

Shoonya-Backend

DRF-based API server for Shoonya platform
Python
17
star
21

Svarah

Swarah: Indian-English speech dataset collected across the country
Python
16
star
22

INCLUDE

Code for INCLUDE paper with pre-trained models
Python
12
star
23

indic-asr-api-backend

Indic-Conformer models for ASR
Python
11
star
24

DocSim

Synthetically generate random text document images with ground-truth
Python
11
star
25

Dhruva-Platform

Dhruva is an open-source platform for serving language AI models at scale.
TypeScript
11
star
26

Shoonya-Frontend

JavaScript
10
star
27

Fonts-for-Indian-Scripts

Font style transfer for Devanāgarī script using GANs
Python
10
star
28

aacl23-mnmt-tutorial

Additional resources from our AACL tutorial
10
star
29

adapter-efficiency

Python
10
star
30

speech-transcript-cleaning

Perform cleaning and normalization to standardize speech transcripts (train and test) across datasets.
Python
8
star
31

ezAnnotate

Annotation Platform for Machine Learning / Data Science, forked from DataTurks
JavaScript
7
star
32

transactional-voice-ai

The code for transactional voice AI
Python
6
star
33

workshop-nlg-nlu-2022

Material for AI Workshop on Natural Language Understanding and Generation
6
star
34

Indic-Glossary-Explorer

Glossary service for Indian languages
JavaScript
6
star
35

Anudesh-Frontend

JavaScript
6
star
36

setu

HTML
6
star
37

IndicLID

Language Identification for Indian languages
Python
5
star
38

indicnlp.ai4bharat.org

Archived old website for AI4Bhārat Indic-NLP
HTML
5
star
39

Chitralekha-Frontend-Lite

Lightweight version of Chitralekha
JavaScript
5
star
40

sign-language.ai4bharat.org

Website for Indian Sign Language Recognition
4
star
41

INCLUDE-MS-Teams-Integration

An experimental Microsoft Teams integration of Sign Language models for word-level sign recognition
C#
4
star
42

Chitralekha-Frontend

Frontend for Chitralekha platform
JavaScript
4
star
43

Indic-Glossaries

Collection of datasets for glossaries in Indian languages
3
star
44

IndicSUPERB

Python
3
star
45

transactional-voice-ai_serving

Deployment code for all the Transactional Voice AI modules.
C++
3
star
46

CTQScorer

Python
3
star
47

Anudesh-Backend

Python
3
star
48

IndicMT-Eval

IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages, ACL 2023
HTML
3
star
49

indic-numtowords

Python
3
star
50

Indic-Swipe

IndicSwipe is a collection of datasets and neural model architectures for decoding swipe gesture inputs on touch-based Indic language keyboards across 7 languages.
Python
3
star
51

Indic-OCR

2
star
52

DMU-DataDaan

Codebase for NLTM DMU's Data Upload System
JavaScript
2
star
53

2022.ai4bharat.org

Old website of AI4Bhārat using TinaCMS
JavaScript
2
star
54

setu-translate

Python
2
star
55

Shoonya-Frontend-Old

Old version of Shoonya UI. Latest repo: https://github.com/AI4Bharat/Shoonya-Frontend
JavaScript
2
star
56

Varnam-Transliteration-UI

Transliteration Web Interface
JavaScript
1
star
57

models.ai4bharat.org

A website to showcase all the models built by the AI4Bharat team
JavaScript
1
star
58

Dhruva-Evaluation-Suite

A tool to perform functional testing and performance testing of the Dhruva Platform
Python
1
star
59

IndicVoices

1
star
60

Input-Tools-By-AI4bharat

Enhance your typing experience in Chrome with AI4Bharat's Input Tools Chrome extension. This extension provides real-time transliteration suggestions for Indian languages, offering seamless integration into your typing workflow.
JavaScript
1
star