• Stars
    star
    107
  • Rank 323,587 (Top 7 %)
  • Language
    Python
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Image Editing Anything

IEA: Image Editing Anything

Using stable diffusion and segmentation anything models for image editing.

Generally, given a textual prompt or cliked region, SAM generated the masked region for source image. Then, we use CLIP model to select the region, which can be finally used to generate the target edited image with stable diffusion.

Use python service.py to initialize the service.

Moreover, we introduce a more advanced inpainting model from diffusers to suport better image editing. Use python service_img2img.py to launch the service.

Generated Cases

case

116f87dc-0232-4773-b5e7-98691426f915

Reference

[1] https://github.com/huggingface/diffusers

[2] https://github.com/facebookresearch/segment-anything

[3] https://github.com/maxi-w/CLIP-SAM

[4] https://github.com/IDEA-Research/Grounded-Segment-Anything/

More Repositories

1

MLE-LLaMA

Multi-language Enhanced LLaMA
Python
301
star
2

Visual-LLaMA

Open LLaMA Eyes to See the World
Python
172
star
3

DiS

Scalable Diffusion Models with State Space Backbone
Python
101
star
4

Video-Stable-Diffusion

Generate consistent videos with stable diffusion models
Python
45
star
5

Gradient-Free-Textual-Inversion

Gradient-Free Textual Inversion for Personalized Text-to-Image Generation
Python
33
star
6

Stable-Edit

Text-based real image editing with stable diffusion models
Python
25
star
7

Perceiver-Music-Generation

music generation with perceiver-ar model
Python
24
star
8

DeeCap

Dynamic Early Exit for Image Captioning
Python
16
star
9

Vespa

Video Diffusion State Space Models
Python
15
star
10

Visual-ChatGLM

Open ChatGLM Eyes to See the World
Python
13
star
11

PNAIC

Partially Non-Autoregressive Image Captioning
Python
10
star
12

AIO

All In One: General Multimodal Large Language Model
Python
9
star
13

Future-Caption

Efficient modeling of future context for image captioning
Python
8
star
14

Meta-Ensemble

Meta-Ensemble Parameter Learning
Python
8
star
15

Image-Caption-Pytorch

Pytorch implementation for image caption baseline model
Python
8
star
16

UAIC

Uncertainty-away image caption generation
Python
7
star
17

Dialogue-System

Multi-modal dialogue system
Python
5
star
18

Latent-Dynamics

Exploring latent dynamics for visual storytelling
Python
4
star
19

MaskGMT

Masked generative music transformer
Python
4
star
20

Matrix-Analysis-and-Application

References and coding homework in matrix analysis and application course in UCAS
Python
3
star
21

Cleaned-Webvid

Use strategy to achieve clean webvid-10m dataset
Python
3
star
22

Diverse-Image-Caption

Promoting Coherence and Diversity in Image Captioning
Python
3
star
23

Visual-MOSS

Makes MOSS model understand visual information
Python
3
star
24

ACSG

Actor-Critic Sequence Generation for Relative Difference Captioning
2
star
25

LQMA

Language Quantized Masked AutoEncoders
Python
2
star
26

DSC

descriptive synthetic captions in dalle3
2
star
27

feizc

2
star
28

MAIC

Memory augmented image captioning
Python
2
star
29

SAIC

Semi-Autoregressive Image Captioning
2
star
30

arXiv-MM

Multimodal dataset for arXiv
Python
2
star
31

DiffuCap

Controllable Image Captioning with Diffusion Model
2
star
32

Union

Unifying Language-Image Pre-training via Single-Tower Transformer
Python
2
star
33

AAT

Attention-Aligned Transformer for Image Captioning
Python
2
star
34

CLIP-MAE

When clip meet mae and beyond
Python
2
star
35

Chinese-Image-Caption

An image captioner with Chinese language
Python
2
star
36

ViD

Text-to-Image Diffusion Models as Refined Visual Learners
Python
1
star
37

Meta-ViT

Meta-ensemble parameter learning for Vision Transformer
Python
1
star
38

ClipCap

Incorporating CLIP features into Transformer-based image captioning
Python
1
star
39

CLKA

Cross Lingual Knowledge Alignment for Stable Diffusion Models
Python
1
star
40

Diffusion-Model

A tutorial of diffusion model for text-guide image generation
Python
1
star
41

LLaMA-XL

LLaMA model Beyond Length Limitation
1
star
42

GameTag

official implementation for GameTag algorithm
Python
1
star
43

MoE-MLLM

Mixture-of-Experts for Multimodal Large Language Models
Python
1
star