Katsuya Iida (@kaiidams)
  • Stars
    star
    139
  • Global Rank 162,945 (Top 6 %)
  • Followers 22
  • Following 7
  • Registered almost 7 years ago
  • Most used languages
    C#
    42.9 %
    Python
    42.9 %
    C
    14.3 %
  • Location 🇯🇵 Japan
  • Country Total Rank 4,871
  • Country Ranking
    C#
    472
    Python
    603
    C
    921

Top repositories

1

Kokoro-Speech-Dataset

A public domain single speaker Japanese speech dataset
Python
30
star
2

soundstream-pytorch

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint
Python
30
star
3

voice100

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Python
25
star
4

FreeHand-Dataset

Synthesized hand pose images generated by Blender
Python
9
star
5

NeMoOnnxSharp

Text-to-speech and speech recognition, VAD with NVIDIA NeMo and ONNX Runtime for .NET Core.
C#
8
star
6

YamNetUnityDemo

This is prediction demo of TensorFlow YamNet model on Unity Barracuda.
C
6
star
7

Voice100AndroidApp

Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime and Voice100 neural TTS/ASR models on Xamarin. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
C#
6
star
8

Kokoro-Align

Kokoro-Align is a PyTorch speech-transcript alignment tool for LibriVox. It splits audio files in silent positions and find CTC best path to align transcript texts with the audio files.
Python
5
star
9

TransferLearningAudio

ML.NET porting of https://www.tensorflow.org/tutorials/audio/transfer_learning_audio
C#
5
star
10

Voice100Sharp

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
C
5
star
11

LanguageDetection

C# port of https://github.com/shuyo/language-detection
C#
4
star
12

voice100-runtime

Voice100 runtime. Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
Python
2
star
13

Voice100Godot

C#
2
star
14

NeMoOnnxGodot

Neural speech with NVIDIA NeMo and ONNX Runtime
C#
2
star