There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Kazakh_TTS
An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers has risen from two to five (three females and two males), and the topic coverage has been diversified.SpeakingFaces
A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer interaction.TurkicASR
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.ISSAI_SAIDA_Kazakh_ASR
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.TurkicTTS
A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.thermal-facial-landmarks-detection
SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.kaz-image-captioning
ExpansionNet v2 model trained on the COCO dataset with captions translated into KazakhKazNERD
An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.TFW
TFW: Annotated Thermal Faces in the Wild DatasetKazEmoTTS
An open-source Kazakh Emotional Text-to-Speech Datasettelegram-bot-chatgpt
Telegram bot to interact with ChatGPT via voice messagesChest-X-ray-module
Leveraging the recent advances in machine learning and availability of public medical imaging datasets, we created a Free Online X-Ray Diagnostic Tool using deep learning that can determine the X-ray type and visualize the pathology.tutorial_indoor_localization_WiFine
In this tutorial, we will load, preprocess a simplified version of the WiFine dataset. The data will be used to train a location prediction model based (a random forest regressor and a multilayer perceptron)trimodal_person_verification
This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"Central-Asian-Food-Dataset
42 food classes from Kazakh National and Central Asian cuisineMultilingualASR
Kazakh_ASR
Kazakh-Speech-Commands-Dataset
Kazakh Speech Commands Datasetfaces-in-event-streams
This repo contains code and instructions for the detection of faces in event streamsCOVID-19-Simulator
Covid Epidemic SimulatorUzbek_ASR
IMUWiFine
Soyle
Shear-Design-Optimization-of-RC-Column
Deep Neural Network model for the automatic design of rectangular reinforced concrete columns under axial load, biaxial bending and shear forces.AnyFace
Input-Agnostic Face DetectionParticle-Based-COVID19-Simulator
Particle-based COVID-19 Simulator with Contact Tracing and Testingtutorial_COVID-19_epidemic_simulator
The workshop materials for Epidemic simulator and indoor Wi-Fi localization projects.KazParC
An open-source parallel corpus for machine translation across Kazakh, English, Russian, and TurkishWiFine
A finer-level sequential dataset of WiFi received signal strengths (RSS) and corresponding (x, y, z) positions.CLTL_Turkic_ASR
Automatic Speech Recognition for Turkic Languages Using Cross-Lingual Transfer Learning from KazakhColumn-Design-Optimization
Column design optimizationExoMem-AR-Memory
ExoMem: Augmented Reality based human memory enhancement system using AIKazSAnDRA
An open-source Kazakh Sentiment Analysis Dataset of Reviews and Attitudes (KazSAnDRA) and baseline sentiment classification modelscity-identification
This repo contains dataset and models for city classificationAD_classifier
Vision-Language-Models-for-Activity-Recognition-and-Abnormality-Detection-for-Elderly
VLM PrismerZ model for recognition of emergency and non-emergneyc situations via vision and language transformers. PrismerZ is directed on understanding the contextual information and completing image captioning and visiom qiestion answering tasks.city-sustainability-indexes
This repo contains code and models for detecting city sustainability indexesTatarTTS
TatarTTS: An Open-Source Text-to-Speech Synthesis Dataset for the Tatar Languagecargoxray
It is a dataset of X-ray images of cargo transport. The dataset includes images of railcars and trucks with trailers.RL_PTZ_Coverage
Reinforcement learning algorithms for PTZ (pan-tilt-zoom) system with surveillance cameraLove Open Source and this site? Check out how you can help us