• Stars
    star
    27
  • Rank 878,376 (Top 18 %)
  • Language
    Jupyter Notebook
  • Created over 1 year ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

ExpansionNet v2 model trained on the COCO dataset with captions translated into Kazakh

More Repositories

1

Kazakh_TTS

An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.
Shell
108
star
2

SpeakingFaces

A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.
Python
74
star
3

TurkicASR

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Python
51
star
4

ISSAI_SAIDA_Kazakh_ASR

the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
Shell
43
star
5

TurkicTTS

A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.
Python
39
star
6

thermal-facial-landmarks-detection

SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.
Jupyter Notebook
34
star
7

KazNERD

An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.
Python
22
star
8

TFW

TFW: Annotated Thermal Faces in the Wild Dataset
Jupyter Notebook
20
star
9

KazEmoTTS

An open-source Kazakh Emotional Text-to-Speech Dataset
Python
17
star
10

telegram-bot-chatgpt

Telegram bot to interact with ChatGPT via voice messages
Python
16
star
11

Chest-X-ray-module

Leveraging the recent advances in machine learning and availability of public medical imaging datasets, we created a Free Online X-Ray Diagnostic Tool using deep learning that can determine the X-ray type and visualize the pathology.
Python
14
star
12

trimodal_person_verification

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"
Python
11
star
13

Central-Asian-Food-Dataset

42 food classes from Kazakh National and Central Asian cuisine
Python
11
star
14

tutorial_indoor_localization_WiFine

In this tutorial, we will load, preprocess a simplified version of the WiFine dataset. The data will be used to train a location prediction model based (a random forest regressor and a multilayer perceptron)
Jupyter Notebook
11
star
15

MultilingualASR

Shell
10
star
16

Kazakh_ASR

Shell
10
star
17

Kazakh-Speech-Commands-Dataset

Kazakh Speech Commands Dataset
Jupyter Notebook
9
star
18

COVID-19-Simulator

Covid Epidemic Simulator
JavaScript
9
star
19

Uzbek_ASR

Shell
9
star
20

faces-in-event-streams

This repo contains code and instructions for the detection of faces in event streams
Python
8
star
21

IMUWiFine

Python
7
star
22

Soyle

Python
5
star
23

Shear-Design-Optimization-of-RC-Column

Deep Neural Network model for the automatic design of rectangular reinforced concrete columns under axial load, biaxial bending and shear forces.
Python
4
star
24

AnyFace

Input-Agnostic Face Detection
Jupyter Notebook
4
star
25

Particle-Based-COVID19-Simulator

Particle-based COVID-19 Simulator with Contact Tracing and Testing
MATLAB
3
star
26

tutorial_COVID-19_epidemic_simulator

The workshop materials for Epidemic simulator and indoor Wi-Fi localization projects.
Python
3
star
27

WiFine

A finer-level sequential dataset of WiFi received signal strengths (RSS) and corresponding (x, y, z) positions.
3
star
28

KazParC

An open-source parallel corpus for machine translation across Kazakh, English, Russian, and Turkish
Jupyter Notebook
3
star
29

CLTL_Turkic_ASR

Automatic Speech Recognition for Turkic Languages Using Cross-Lingual Transfer Learning from Kazakh
Shell
2
star
30

Column-Design-Optimization

Column design optimization
Python
2
star
31

ExoMem-AR-Memory

ExoMem: Augmented Reality based human memory enhancement system using AI
C#
2
star
32

KazSAnDRA

An open-source Kazakh Sentiment Analysis Dataset of Reviews and Attitudes (KazSAnDRA) and baseline sentiment classification models
Python
2
star
33

TatarTTS

TatarTTS: An Open-Source Text-to-Speech Synthesis Dataset for the Tatar Language
2
star
34

KazQAD

An open-source Kazakh Question Answering Dataset
1
star
35

AD_classifier

Jupyter Notebook
1
star
36

Vision-Language-Models-for-Activity-Recognition-and-Abnormality-Detection-for-Elderly

VLM PrismerZ model for recognition of emergency and non-emergneyc situations via vision and language transformers. PrismerZ is directed on understanding the contextual information and completing image captioning and visiom qiestion answering tasks.
1
star
37

city-identification

This repo contains dataset and models for city classification
Python
1
star
38

city-sustainability-indexes

This repo contains code and models for detecting city sustainability indexes
Python
1
star
39

cargoxray

It is a dataset of X-ray images of cargo transport. The dataset includes images of railcars and trucks with trailers.
Python
1
star