Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

TypeScript

Go

PHP

OCaml

Ruby

Shell

Scala

C

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Kotlin

R

Clojure

Perl

F#

Python

Julia

Crystal

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇬🇹 Guatemala

🇳🇬 Nigeria

🇻🇳 Vietnam

🇫🇷 France

🇧🇷 Brazil

🇬🇮 Gibraltar

🇲🇨 Monaco

🇦🇫 Afghanistan

All Countries Compare Countries

IrisRainbowNeko/pixiv_AI_crawler

Stars
622
Rank 72,195 (Top 2 %)
Language
Python
License
MIT License
Created over 2 years ago
Updated almost 2 years ago

IrisRainbowNeko/pixiv_AI_crawler

IrisRainbowNeko

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

基于深度学习的p站高质量涩图AI爬虫，可以学会你的XP

人工智能pixiv高质量涩图爬虫

能学会你xp的AI涩图爬虫

爬虫部分基于 PixivCrawler 修改实现，涩图识别分类部分使用 ConvNeXt 作为backbone的分类模型实现，性能优于Trasnformer类模型。

自动筛选效果

环境配置

环境配置参考 ConvNeXt

需要 pytorch==1.8 timm==0.3.2

下载miniconda，创建新python环境并激活

conda create -n pixivai python=3.9
conda activate pixivai

安装pytorch

conda install pytorch torchvision torchaudio cudatoolkit=11.1 -c pytorch-lts -c conda-forge
# 没有N卡的用这个
conda install pytorch torchvision torchaudio cpuonly -c pytorch-lts

安装其他依赖

pip install -r requirements.txt

使用方法

下载预训练权重放在ckpt/文件夹内:

下载权重-百度网盘提取码：mmwi 或下载权重

根据 PixivCrawler 的说明配置爬虫，设置账号和cookie，设置要爬的内容。

pixiv_crawler/config.py中配置爬虫基本参数。

运行命令启动AI爬虫:

# 不加关键字默认爬日榜
python AIcrawler.py --ckpt 模型权重 --n_images 总图像个数 [--keyword 关键字]

按自己的xp训练模型

数据处理

准备至少5000张图。用labeler.py打标签，数据集标签会储存为json格式。

或

把不同类别放入不同文件夹，用labeler_folder.py一键打标签。

images
|--0
|  |--1.png
|  |--2.png
|
|--1

用data_proc.py划分训练集和测试集，并对图像进行预处理。

修改参数，运行脚本训练:

python train.sh

训练参数设置参考 ConvNeXt

genshin_auto_fish

基于深度强化学习的原神自动钓鱼AI

HCP-Diffusion

A universal Stable-Diffusion toolbox

DreamArtist-stable-diffusion

stable diffusion webui with contrastive prompt tuning

DreamArtist-sd-webui-extension

DreamArtist for Stable-Diffusion-webui extension

genshin_autoplay_domain

原神全自动刷秘境AI

genshin_voice_play

语音控制玩原神

HCP-Diffusion-webui

webui for HCP-Diffusion

ML-Danbooru

Anime image tags detector

RobustDet

The official PyTorch implementation of "Adversarially-Aware Robust Object Detector"

ML-Danbooru-webui

webui extension of ML-Danbooru

GenshinMidi

根据midi自动生成原神音游谱

yuanshen_auto_music

原神自动演奏脚本

TeyvatOCR

识别并翻译原神中的提瓦特通用文

anime-ai-detect-fucker

针对AI画图识别AI的对抗攻击

synthesis_watermelon

基于box2d物理引擎的安卓版合成大西瓜

yuanshen_draw

在原神中使用围栏绘图

genshin_maze

AI自动生成并摆放原神迷宫

torch-analyzer

A torch model analyzer

open_cumputer

利用esp8266+舵机实现远程开机，包含服务器端代码实现内网穿透以及安卓端开机APP代码

rl3

强化学习作业，多智能体

anime_resource_title_analyzer

分析动漫资源网站的标题信息(字幕组 + 番剧标题 + 分辨率 + 第几集)

WeiChatJump

FunctionWave

一个能用数学函数来作曲的小程序

edge_charimg

把图像的边缘特征转化成字符图

rl2

ProgramCalculator

多功能可编程科学计算器，电子技术课程设计

BlindWaterMarkKiller

消除知乎盲水印

ys_solve

原神机关自动解谜

Arduion_3Dcube

Arduion显示一个3D立方体

BluetoothMosue

蓝牙鼠标安卓端源码

4D-Draw

一个4D绘图引擎

auto_helthy_report

中南大学自动每日打卡

card_QR_door

刷卡、扫码宿舍门

huaji_video

一个将视频滑稽化的软件

Auto_Hand_Font

自动生成手写体文章

my-gitpage

MatrixCalculator

一个轻便的矩阵计算器，低配matlab

arduino-badapple

用Arduino+oled屏播放badapple

sysu_report

中山大学自动每日健康打卡

RainbowNekoEngine

Neural network training and inference framework

NekoFormer

All in one basic anime CV model. tagger+siglip+natural language