• Stars
    star
    117
  • Rank 301,828 (Top 6 %)
  • Language
    Python
  • License
    MIT License
  • Created about 6 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Python framework to scrape Pastebin pastes and analyze them

Logo

pastepwn - Paste-Scraping Python Framework

Build Status PyPI version Coverage Status Codacy Badge

Pastebin is a very helpful tool to store or rather share ascii encoded data online. In the world of OSINT, pastebin is being used by researchers all around the world to retrieve e.g. leaked account data, in order to find indicators about security breaches.

Pastepwn is a framework to scrape pastes and scan them for certain indicators. There are several analyzers and actions to be used out-of-the-box, but it is also easily extensible - you can create your own analyzers and actions on the fly.

Please note: This framework is not to be used for illegal actions. It can be used for querying public Pastebin pastes for e.g. your username or email address in order to increase your own security.

⚠️ Important note

In April 2020 Pastebin disabled access to their scraping API for a short period of time. At first people weren't able to access the scraping API in any way, but later on they re-enabled access to the API setup page. But since then it isn't possible to scrape "text" pastes. Only pastes with any kind of syntax set. That reduces the amount of pastes to a minimum, which reduced the usefulness of this tool.

Setting up pastepwn

To use the pastepwn framework you need to follow these simple steps:

  1. Make sure to have a Pastebin premium account!
  2. Install pastepwn via pip (pip3 install pastepwn
  3. Create a file (e.g. main.py) in your project root, where you put your code in²
  4. Fill that file with content - add analyzers and actions. Check the example implementation.

¹ Note that pastepwn only works with python3.6 or above
² (If you want to store all pastes, make sure to set up a mongodb, mysql or sqlite instance)

Behind a proxy

There are 2 ways to use this tool behind a proxy:

  • Define the following environment variables: HTTP_PROXY, HTTPS_PROXY, NO_PROXY.
  • When initializing the PastePwn object, use the proxies argument. proxies is a dict as defined in requests' documentation.

Troubleshooting

If you are having troubles, check out the wiki pages first. If your question/issue is not resolved there, feel free to create an issue or contact me on Telegram.

Roadmap and ToDos

Check the bug tracker on GitHub to get an up-to-date status about features and ToDos.

  • REST API for querying paste data (will be another project)
  • Add a helpful wiki with instructions and examples

More Repositories

1

Python-BlackJackBot

A Telegram bot written in Python to play the game BlackJack alone or with your friends
Python
76
star
2

certstream-server-go

This project aims to be a drop-in replacement for the certstream server by Calidog. This tool aggregates, parses, and streams certificate data from multiple certificate transparency logs via websocket connections to the clients.
Go
57
star
3

pyBrematic

Python library for controlling Brematic remote power outlets and potentially other stuff
Python
12
star
4

Python-GeizhalsBot

A bot to get notified about changes of the price of a Geizhals.de wishlist or product
Python
12
star
5

certleak

Python framework for collecting and analyzing TLS certificate data via the Certificate Transparency Network
Python
9
star
6

backmeup

A lightweight, local backup utility for the CLI written in Go
Go
7
star
7

motd-files

The motd files I use on my servers
Shell
7
star
8

GoGeizhalsBot

This repository holds a rewrite of my Python-GeizhalsBot Telegram bot in Go
Go
5
star
9

TelegramBot

This is the deprecated code for my Telegram Bot. Please check the new project here: https://github.com/d-Rickyy-b/Python-BlackJackBot
Python
4
star
10

Monzo-Meter

A little code snippet & instructions of how to create a monzo-meter
Python
4
star
11

webstress

Websocket stress tool developed in Go
Go
3
star
12

GolangTour-Dark

A dark theme for the "Tour of Go" introduction for the Go programming language
CSS
2
star
13

Python-GuardianBot

A bot to keep your groups free of ads and spam
Python
2
star
14

webogram-x

Dark theme (css) for Telegram web based on colors from other Telegram X apps
CSS
2
star
15

mockBot

A Telegram bot to mock your friends
Go
1
star
16

node-red-contrib-ffpass

Node-RED module that enables you to control your Ford car via the FordPass API.
JavaScript
1
star
17

TSF-Parser

Java application to parse and download the TSF leaderboards to a local database
Java
1
star
18

MGGVertretungsplan

A simple app to display the current school timetable for the Markgrafen Gymnasium
Java
1
star
19

Java-BlackJackBot

Java
1
star
20

myShrugBot

A Telegram bot to express your indifference by sending a shrug!
Go
1
star
21

TelegramBotAPI-Dark

A dark theme for the Telegram bot API docs
CSS
1
star