@AI4Bharat

Top repositories

1

indicnlp_catalog

A collaborative catalog of NLP resources for Indic languages
523
star
2

Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
Python
271
star
3

IndicTrans2

Translation models for 22 scheduled languages of India
Python
176
star
4

indicnlp_corpus

Description Describes the IndicNLP corpus and associated datasets
Python
141
star
5

indicTrans

indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
Jupyter Notebook
105
star
6

OpenHands

👐OpenHands : Making Sign Language Recognition Accessible. | **NOTE:** No longer actively maintained. If you are interested to own this and take it forward, please raise an issue
Python
87
star
7

Indic-TTS

Text-to-Speech for languages of India
Jupyter Notebook
75
star
8

IndicLLMSuite

A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
Python
69
star
9

IndicWav2Vec

Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2
Jupyter Notebook
65
star
10

IndicXlit

Transliteration models for 21 Indic languages
Python
58
star
11

IndicBERT

Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME
Python
57
star
12

NPTEL2020-Indian-English-Speech-Dataset

NPTEL2020: Speech2Text dataset for Indian-English Accent
Python
55
star
13

IndicNLP-Transliteration

Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/IndicXlit
Python
54
star
14

Shoonya

Shoonya - Platform to Annotate and label data at scale.
46
star
15

indic-bart

Pre-trained, multilingual sequence-to-sequence models for Indian languages
Python
41
star
16

Chitralekha

Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over
31
star
17

Chitralekha-Backend

Transcribe your videos and translate it into Indic languages.
Python
21
star
18

Indic-Input-Tool-UI

Web Interface for Transliteration for Indic languages.
JavaScript
19
star
19

vistaar

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Python
19
star
20

Shoonya-Backend

DRF-based API server for Shoonya platform
Python
17
star
21

Svarah

Swarah: Indian-English speech dataset collected across the country
Python
14
star
22

DocSim

Synthetically generate random text document images with ground-truth
Python
11
star
23

INCLUDE

Code for INCLUDE paper with pre-trained models
Python
11
star
24

Dhruva-Platform

Dhruva is an open-source platform for serving language AI models at scale.
TypeScript
11
star
25

Fonts-for-Indian-Scripts

Font style transfer for Devanāgarī script using GANs
Python
10
star
26

adapter-efficiency

Python
10
star
27

Shoonya-Frontend

JavaScript
9
star
28

aacl23-mnmt-tutorial

Additional resources from our AACL tutorial
9
star
29

indic-asr-api-backend

Indic-Conformer models for ASR
Python
8
star
30

speech-transcript-cleaning

Perform cleaning and normalization to standardize speech transcripts (train and test) across datasets.
Python
8
star
31

ezAnnotate

Annotation Platform for Machine Learning / Data Science, forked from DataTurks
JavaScript
7
star
32

workshop-nlg-nlu-2022

Material for AI Workshop on Natural Language Understanding and Generation
6
star
33

transactional-voice-ai

The code for transactional voice AI
Python
6
star
34

Indic-Glossary-Explorer

Glossary service for Indian languages
JavaScript
6
star
35

indicnlp.ai4bharat.org

Archived old website for AI4Bhārat Indic-NLP
HTML
5
star
36

Anudesh-Frontend

JavaScript
5
star
37

Chitralekha-Frontend-Lite

Lightweight version of Chitralekha
JavaScript
5
star
38

setu

HTML
5
star
39

IndicLID

Language Identification for Indian languages
Python
4
star
40

INCLUDE-MS-Teams-Integration

An experimental Microsoft Teams integration of Sign Language models for word-level sign recognition
C#
4
star
41

Chitralekha-Frontend

Frontend for Chitralekha platform
JavaScript
4
star
42

Indic-Glossaries

Collection of datasets for glossaries in Indian languages
3
star
43

sign-language.ai4bharat.org

Website for Indian Sign Language Recognition
3
star
44

CTQScorer

Python
3
star
45

Anudesh-Backend

Python
3
star
46

IndicMT-Eval

IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages, ACL 2023
HTML
3
star
47

Indic-Swipe

IndicSwipe is a collection of datasets and neural model architectures for decoding swipe gesture inputs on touch-based Indic language keyboards across 7 languages.
Python
3
star
48

Indic-OCR

2
star
49

IndicSUPERB

Python
2
star
50

DMU-DataDaan

Codebase for NLTM DMU's Data Upload System
JavaScript
2
star
51

2022.ai4bharat.org

Old website of AI4Bhārat using TinaCMS
JavaScript
2
star
52

transactional-voice-ai_serving

Deployment code for all the Transactional Voice AI modules.
C++
2
star
53

setu-translate

Python
2
star
54

indic-numtowords

Python
2
star
55

Shoonya-Frontend-Old

Old version of Shoonya UI. Latest repo: https://github.com/AI4Bharat/Shoonya-Frontend
JavaScript
2
star
56

Varnam-Transliteration-UI

Transliteration Web Interface
JavaScript
1
star
57

models.ai4bharat.org

A website to showcase all the models built by the AI4Bharat team
JavaScript
1
star
58

Dhruva-Evaluation-Suite

A tool to perform functional testing and performance testing of the Dhruva Platform
Python
1
star
59

IndicVoices

1
star
60

indicnlp_suite

Natural Language Understanding resources for Indian languages
1
star
61

Input-Tools-By-AI4bharat

Enhance your typing experience in Chrome with AI4Bharat's Input Tools Chrome extension. This extension provides real-time transliteration suggestions for Indian languages, offering seamless integration into your typing workflow.
JavaScript
1
star