• Stars
    star
    299
  • Rank 138,407 (Top 3 %)
  • Language
    Python
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Multi-language Enhanced LLaMA

logo-pic

MLE-LLaMA: Multi-Language Enhanced LLaMA

This project aims to make LLaMa understand Chinese, and can generate fluency chinese. We are inspired that LLaMa have learned good English expression and a little alignment prompt can makes it capture Chinese.

  • Token vocabulary support for multi-language. We found that llama tokenizer naturally support for Chinese.

  • Fine-tuning llama script.

    (1) download original ckpt from huggingface, and put them into file path ckpt.

    (2) train.py original script must be run on 80G A100 and more techniques should be employed.

    (3) train_lora.py lora fine-tuning using pert.

    Argument Values
    batch size 128 * 8
    epochs 3
    cut length 256
    learning rate 2e-5
    speed 1.02s / it
  • Fine-grained english-chinese alignment dataset. We colleced the high-quality English-Chinese pairs and can be download in google drive.

    We also found that BELLE provide ckpts and chinese dataset, strongly recommended to refer it.

  • Instructing tuning. We use chinese alpaca and GuanacoDataset for instructing tunning.

  • Open source checkpoints, gradio scripts and cases. We found that LLaMA model tends to generate long sentences.

case11

case12

case13

Reference

[1] https://github.com/facebookresearch/llama

[2] https://github.com/tatsu-lab/stanford_alpaca

[3] https://github.com/huggingface/transformers/tree/main/examples/pytorch/language-modeling

[4] https://github.com/tloen/alpaca-lora

[5] https://github.com/LianjiaTech/BELLE

More Repositories

1

Visual-LLaMA

Open LLaMA Eyes to See the World
Python
165
star
2

IEA

Image Editing Anything
Python
107
star
3

DiS

Scalable Diffusion Models with State Space Backbone
Python
101
star
4

Video-Stable-Diffusion

Generate consistent videos with stable diffusion models
Python
45
star
5

Gradient-Free-Textual-Inversion

Gradient-Free Textual Inversion for Personalized Text-to-Image Generation
Python
33
star
6

Stable-Edit

Text-based real image editing with stable diffusion models
Python
25
star
7

Perceiver-Music-Generation

music generation with perceiver-ar model
Python
24
star
8

DeeCap

Dynamic Early Exit for Image Captioning
Python
16
star
9

Vespa

Video Diffusion State Space Models
Python
15
star
10

Visual-ChatGLM

Open ChatGLM Eyes to See the World
Python
13
star
11

PNAIC

Partially Non-Autoregressive Image Captioning
Python
10
star
12

AIO

All In One: General Multimodal Large Language Model
Python
9
star
13

Future-Caption

Efficient modeling of future context for image captioning
Python
8
star
14

Meta-Ensemble

Meta-Ensemble Parameter Learning
Python
8
star
15

Image-Caption-Pytorch

Pytorch implementation for image caption baseline model
Python
8
star
16

UAIC

Uncertainty-away image caption generation
Python
7
star
17

Dialogue-System

Multi-modal dialogue system
Python
5
star
18

Latent-Dynamics

Exploring latent dynamics for visual storytelling
Python
4
star
19

MaskGMT

Masked generative music transformer
Python
4
star
20

Matrix-Analysis-and-Application

References and coding homework in matrix analysis and application course in UCAS
Python
3
star
21

Cleaned-Webvid

Use strategy to achieve clean webvid-10m dataset
Python
3
star
22

Diverse-Image-Caption

Promoting Coherence and Diversity in Image Captioning
Python
3
star
23

Visual-MOSS

Makes MOSS model understand visual information
Python
3
star
24

LQMA

Language Quantized Masked AutoEncoders
Python
2
star
25

DSC

descriptive synthetic captions in dalle3
2
star
26

feizc

2
star
27

MAIC

Memory augmented image captioning
Python
2
star
28

SAIC

Semi-Autoregressive Image Captioning
2
star
29

arXiv-MM

Multimodal dataset for arXiv
Python
2
star
30

DiffuCap

Controllable Image Captioning with Diffusion Model
2
star
31

ACSG

Actor-Critic Sequence Generation for Relative Difference Captioning
2
star
32

Union

Unifying Language-Image Pre-training via Single-Tower Transformer
Python
2
star
33

AAT

Attention-Aligned Transformer for Image Captioning
Python
2
star
34

CLIP-MAE

When clip meet mae and beyond
Python
2
star
35

Chinese-Image-Caption

An image captioner with Chinese language
Python
2
star
36

ViD

Text-to-Image Diffusion Models as Refined Visual Learners
Python
1
star
37

Meta-ViT

Meta-ensemble parameter learning for Vision Transformer
Python
1
star
38

ClipCap

Incorporating CLIP features into Transformer-based image captioning
Python
1
star
39

CLKA

Cross Lingual Knowledge Alignment for Stable Diffusion Models
Python
1
star
40

Diffusion-Model

A tutorial of diffusion model for text-guide image generation
Python
1
star
41

LLaMA-XL

LLaMA model Beyond Length Limitation
1
star
42

GameTag

official implementation for GameTag algorithm
Python
1
star
43

MoE-MLLM

Mixture-of-Experts for Multimodal Large Language Models
Python
1
star