IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"hifi3dface
Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".hok_env
Honor of Kings AI Open Environment of Tencentpika
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldigrover
This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Databddm
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech SynthesisFRA-RIR
PCDMs
Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion ModelsDrugOOD
OOD Dataset Curator and Benchmark for AI-aided Drug DiscoveryFrequency_Aug_VAE_MoESR
Latent-based SR using MoE and frequency augmented VAE decodertleague_projpage
3m-asr
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognitionTLeague
RLogist
RLogist = RL (reinforcement learning) + PathologistCogKernel
MDM
MDMUltraDualPathCompression
A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression".Lodoss
mini-hok
Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such as complexity and accessibility.TriNet
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.ICML21_OAXE
season
[EMNLP 2022] Salience Allocation as Guidance for Abstractive Summarizationhokoff
Leopard
The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"hifi3dface_projpage
Project page for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".GrndPodcastSum
(ACL 2022) The source code for the paper "Towards Abstractive Grounded Summarization of Podcast Transcripts"OASum
EMNLP21_SemEq
This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".learning_singing_from_speech
Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".valuationgame
Arena
MetaLogic
ZED
This is the repository for EMNLP 2022 paper "Efficient Zero-shot Event Extraction with Context-Definition Alignment"machine-translation
Open source on machine translationTPolicies
zebra-inference
Interformer
FOLNet
This repository includes the code for First-Order Logic Network (FOLNet).TLeagueAutoBuild
TImitate
siam
Love Open Source and this site? Check out how you can help us