Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Go

Elixir

CoffeeScript

Groovy

Assembly

Julia

Swift

Objective-C

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

PHP

F#

Scala

Dart

Java

TypeScript

C#

Shell

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇨🇱 Chile

🇵🇾 Paraguay

🇦🇺 Australia

🇱🇾 Libya

🇳🇴 Norway

🇩🇿 Algeria

🇫🇷 France

🇺🇦 Ukraine

All Countries Compare Countries

bytedance/fc-clip

Stars
109
Rank 319,077 (Top 7 %)
Language
Python
License
Apache License 2.0
Created over 1 year ago
Updated over 1 year ago

bytedance/fc-clip

bytedance

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

FC-CLIP is an universal model for open-vocabulary image segmentation problems, consisting of a class-agnostic segmenter, in-vocabulary classifier, out-of-vocabulary classifier. With everything built upon a shared single frozen convolutional CLIP model, FC-CLIP not only achieves state-of-the-art performance on various open-vocabulary segmentation benchmarks, but also enjoys a much lower training (3.2 days with 8 V100) and testing costs compared to prior arts.

Installation

See installation instructions.

Getting Started

See Preparing Datasets for FC-CLIP.

See Getting Started with FC-CLIP.

We also support FC-CLIP with HuggingFace 🤗 Demo

Model Zoo

	ADE20K(A-150)			Cityscapes			Mapillary Vistas		ADE20K-Full (A-847)	Pascal Context 59 (PC-59)	Pascal Context 459 (PC-459)	Pascal VOC 21 (PAS-21)	Pascal VOC 20 (PAS-20)	COCO (training dataset)			download
	PQ	mAP	mIoU	PQ	mAP	mIoU	PQ	mIoU	mIoU	mIoU	mIoU	mIoU	mIoU	PQ	mAP	mIoU
FC-CLIP (ResNet50)	17.9	9.5	23.3	40.3	21.6	53.2	15.9	24.4	7.1	50.5	12.9	75.9	89.5	50.7	40.7	58.8	checkpoint
FC-CLIP (ResNet101)	19.1	10.2	24.0	40.9	24.1	53.9	16.7	23.2	7.7	48.9	12.3	77.6	91.3	51.4	41.6	58.9	checkpoint
FC-CLIP (ResNet50x4)	21.8	11.7	26.8	42.2	23.8	54.6	17.4	24.6	8.7	54.0	13.1	79.0	92.9	52.1	42.8	60.4	checkpoint
FC-CLIP (ResNet50x16)	22.5	13.6	29.4	42.0	25.6	56.0	17.8	26.1	10.3	56.4	15.7	80.7	94.5	54.4	45.0	63.3	checkpoint
FC-CLIP (ResNet50x64)	22.8	13.6	28.4	42.7	27.4	55.1	18.2	27.3	10.8	55.7	16.2	80.3	95.1	55.6	46.4	65.3	checkpoint
FC-CLIP (ConvNeXt-Large)	26.8	16.8	34.1	44.0	26.8	56.2	18.3	27.8	14.8	58.4	18.2	81.8	95.4	54.4	44.6	63.7	checkpoint

Citing FC-CLIP

If you use FC-CLIP in your research, please use the following BibTeX entry.

@inproceedings{yu2023fcclip,
  title={Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP},
  author={Qihang Yu and Ju He and Xueqing Deng and Xiaohui Shen and Liang-Chieh Chen},
  journal={arXiv: 2308.02487},
  year={2023}
}

Acknowledgement

IconPark

🍎Transform an SVG icon into multiple themes, and generate React icons，Vue icons，svg icons

xgplayer

A HTML5 video player with a parser that saves traffic

sonic

A blazingly fast JSON serializing & deserializing library

monoio

Rust async runtime based on io-uring.

byteps

A high performance and generic framework for distributed DNN training

lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

ByteX

ByteX is a bytecode plugin platform based on Android Gradle Transform API and ASM. 字节码插件开发平台

Elkeid

Elkeid is an open source solution that can meet the security requirements of various workloads such as hosts, containers and K8s, and serverless. It is derived from ByteDance's internal best practices.

AlphaPlayer

AlphaPlayer is a video animation engine.

scene

Android Single Activity Framework compatible with Fragment.

bhook

🔥 ByteHook is an Android PLT hook library which supports armeabi-v7a, arm64-v8a, x86 and x86_64.

flutter_ume

UME is an in-app debug kits platform for Flutter. Produced by Flutter Infra team of ByteDance

terarkdb

A RocksDB compatible KV storage engine with better performance

btrace

🔥🔥 btrace(AKA RheaTrace) is a high performance Android trace tool which is based on Perfetto, it support to define custom events automatically during building apk and using bhook to provider more native events like Render/Binder/IO etc.

gopkg

Universal Utilities for Go

android-inline-hook

🔥 ShadowHook is an Android inline hook library which supports thumb, arm32 and arm64.

bitsail

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.

go-tagexpr

An interesting go struct tag expression syntax for field validation, etc.

GiantMIDI-Piano

appshark

Appshark is a static taint analysis platform to scan vulnerabilities in an Android app.

AabResGuard

The tool of obfuscated aab resources.(Android app bundle资源混淆工具)

piano_transcription

CodeLocator

BoostMultiDex

BoostMultiDex is a solution for quickly loading multiple dex files on low Android version devices (4.X and below, SDK <21).

music_source_separation

Fastbot_Android

Fastbot(2.0) is a model-based testing tool for modeling GUI transitions to discover app stability problems

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

memory-leak-detector

fedlearner

A multi-party collaborative machine learning framework

monolith

ByteDance's Recommendation System

sonic-cpp

A fast JSON serializing & deserializing library, accelerated by SIMD.

godlp

sensitive information protection toolkit

MVDream

Multi-view Diffusion for 3D Generation

res-adapter

Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".

bytemd

ByteMD v1 repository

tailor

ibot

iBOT 🤖: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)

Jupyter Notebook

RealRichText

A Tricky Solution for Implementing Inline-Image-In-Text Feature in Flutter.

guide

A new feature guide component by react 🧭

mockey

a simple and easy-to-use golang mock library

magic-microservices

Make Web Components easier and powerful!😘

Fastbot_iOS

About Fastbot(2.0) is a model-based testing tool for modeling GUI transitions to discover app stability problems

flow-builder

A highly customizable streaming flow builder.

MVDream-threestudio

3D generation code for MVDream

effective_transformer

Running BERT without Padding

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Next-ViT

matxscript

A high-performance, extensible Python AOT compiler.

byteir

A model compilation solution for various hardware

syllepsis

Syllepsis is an out-of-the-box rich text editor.

uss

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

OMGD

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

neurst

Neural end-to-end Speech Translation Toolkit

danmu.js

HTML5 danmu (danmaku) plugin for any DOM element

vArmor

vArmor is a cloud native container sandbox system based on AppArmor/BPF/Seccomp. It also includes multiple built-in protection rules that are ready to use out of the box.

particle-sfm

ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild. ECCV 2022.

CloudShuffleService

Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.

lynx-llm

paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/

g3

Enterprise-oriented Generic Proxy Solutions

xgplayer-vue

Vue component for xgplayer, a HTML5 video player with a parser that saves traffic

DEADiff

[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"

flux

A fast communication-overlapping library for tensor parallelism on GPUs.

trace-irqoff

Interrupts-off or softirqs-off latency tracer

ParaGen

ParaGen is a PyTorch deep learning framework for parallel sequence generation.

ByteMLPerf

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

MoMA

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

Jupyter Notebook

AWERTL

An non-invasive iOS framework for quickly adapting Right-To-Left style UI

Bytedance-UnionAD

keyhouse

Keyhouse is a skeleton of general-purpose Key Management System written in Rust.

react-model

The next generation state management library for React

LargeBatchCTR

Large batch training of CTR models based on DeepCTR with CowClip.

ic_flow_platform

IFP (ic flow platform) is an integrated circuit design flow platform, mainly used for IC process specification management and data flow contral.

DanmakuRenderEngine

DanmakuRenderEngine is a lightweight and scalable Android danmaku library. 轻量级高扩展安卓弹幕渲染引擎

primus

diat

A CLI tool to help with diagnosing Node.js processes basing on inspector.

coconut_cvpr2024

Jupyter Notebook

Hammer

An efficient toolkit for training deep models.

ns-x

An easy-to-use, flexible network simulator library in Go.

pv3d

RLFN

Winner of runtime track in NTIRE 2022 challenge on Efficient Super-Resolution

DCFrame

DCFrame is a Swift UI collection framework, which can easily create complex UI.

trace-noschedule

Trace noschedule thread

decoupleQ

A quantization algorithm for LLM

tar-wasm

A faster experimental wasm-based tar implementation for browsers.

TWIST

Official codes: Self-Supervised Learning by Estimating Twin Class Distribution

magic-portal

⚡ A blazing fast micro-component and micro-frontend solution uses web-components under the hood.

xgplayer-react

React component for xgplayer, a HTML5 video player with a parser that saves traffic

fe-foundation

UI Foundation for React Hooks and Vue Composition Api

nnproxy

Scalable NameNode RPC Proxy for HDFS Federation

dbatman

Elkeid-HUB

Elkeid HUB is a rule/event processing engine maintained by the Elkeid Team that supports streaming/offline (not yet supported by the community edition) data processing. The original intention is to solve complex data/event processing and external system linkage requirements through standardized rules.

FreeSeg

pull_to_refresh

Flutter pull_to_refresh widget

Jeddak-DPSQL

DPSQL (Privacy Protection SQL Query Service) - This project is a microservice Middleware located between the database engine ( Hive , Clickhouse , etc.) and the application system. It provides transparent SQL query result desensitization capabilities.

terark-zip

A data structure and algorithm library built for TerarkDB

trace-runqlat

ipmb

An interprocess message bus system built in Rust.

X-Portrait

Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"

kernel

ByteDance kernel for use on cloud.

scroll_kit