• Stars
    star
    1,104
  • Rank 41,736 (Top 0.9 %)
  • Language
    Python
  • License
    MIT License
  • Created over 7 years ago
  • Updated about 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

advertools - online marketing productivity and analysis tools
Documentation Status http://pepy.tech/badge/advertools

🎉 New: crawl_headers Function for crawling a known list of URLs with the HEAD method only
🎊 New: SEO crawler has new options for following links, include/exclude URL params and/or URL regex.
🎉 New: reverse_dns_lookup Function for getting host information on a list of IP addresses

advertools: productivity & analysis tools to scale your online marketing

A digital marketer is a data scientist.
Your job is to manage, manipulate, visualize, communicate, understand, and make decisions based on data.

You might be doing basic stuff, like copying and pasting text on spread sheets, you might be running large scale automated platforms with sophisticated algorithms, or somewhere in between. In any case your job is all about working with data.

As a data scientist you don't spend most of your time producing cool visualizations or finding great insights. The majority of your time is spent wrangling with URLs, figuring out how to stitch together two tables, hoping that the dates, won't break, without you knowing, or trying to generate the next 124,538 keywords for an upcoming campaign, by the end of the week!

advertools is a Python package that can hopefully make that part of your job a little easier.

Installation

pip install advertools
# OR:
pip3 install advertools

SEM Campaigns

The most important thing to achieve in SEM is a proper mapping between the three main elements of a search campaign

Keywords (the intention) -> Ads (your promise) -> Landing Pages (your delivery of the promise) Once you have this done, you can focus on management and analysis. More importantly, once you know that you can set this up in an easy way, you know you can focus on more strategic issues. In practical terms you need two main tables to get started:

SEO

Probably the most comprehensive online marketing area that is both technical (crawling, indexing, rendering, redirects, etc.) and non-technical (content creation, link building, outreach, etc.). Here are some tools that can help with your SEO

  • SEO crawler: A generic SEO crawler that can be customized, built with Scrapy, & with several features:
    • Standard SEO elements extracted by default (title, header tags, body text, status code, reponse and request headers, etc.)
    • CSS and XPath selectors: You probably have more specific needs in mind, so you can easily pass any selectors to be extracted in addition to the standard elements being extracted
    • Custom settings: full access to Scrapy's settings, allowing you to better control the crawling behavior (set custom headers, user agent, stop spider after x pages, seconds, megabytes, save crawl logs, run jobs at intervals where you can stop and resume your crawls, which is ideal for large crawls or for continuous monitoring, and many more options)
    • Following links: option to only crawl a set of specified pages or to follow and discover all pages through links
  • robots.txt downloader A simple downloader of robots.txt files in a DataFrame format, so you can keep track of changes across crawls if any, and check the rules, sitemaps, etc.
  • XML Sitemaps downloader / parser An essential part of any SEO analysis is to check XML sitemaps. This is a simple function with which you can download one or more sitemaps (by providing the URL for a robots.txt file, a sitemap file, or a sitemap index
  • SERP importer and parser for Google & YouTube Connect to Google's API and get the search data you want. Multiple search parameters supported, all in one function call, and all results returned in a DataFrame
  • Tutorials and additional resources

Text & Content Analysis (for SEO & Social Media)

URLs, page titles, tweets, video descriptions, comments, hashtags are some exmaples of the types of text we deal with. advertools provides a few options for text analysis

  • Word frequency Counting words in a text list is one of the most basic and important tasks in text mining. What is also important is counting those words by taking in consideration their relative weights in the dataset. word_frequency does just that.
  • URL Analysis We all have to handle many thousands of URLs in reports, crawls, social media extracts, XML sitemaps and so on. url_to_df converts your URLs into easily readable DataFrames.
  • Emoji Produced with one click, extremely expressive, highly diverse (3k+ emoji), and very popular, it's important to capture what people are trying to communicate with emoji. Extracting emoji, get their names, groups, and sub-groups is possible. The full emoji database is also available for convenience, as well as an emoji_search function in case you want some ideas for your next social media or any kind of communication
  • extract_ functions The text that we deal with contains many elements and entities that have their own special meaning and usage. There is a group of convenience functions to help in extracting and getting basic statistics about structured entities in text; emoji, hashtags, mentions, currency, numbers, URLs, questions and more. You can also provide a special regex for your own needs.
  • Stopwords A list of stopwords in forty different languages to help in text analysis.
  • Tutorial on DataCamp for creating the word_frequency function and explaining the importance of the difference between absolute and weighted word frequency
  • Text Analysis for Online Marketers An introductory article on SEMrush

Social Media

In addition to the text analysis techniques provided, you can also connect to the Twitter and YouTube data APIs. The main benefits of using advertools for this:

  • Handles pagination and request limits: typically every API has a limited number of results that it returns. You have to handle pagination when you need more than the limit per request, which you typically do. This is handled by default
  • DataFrame results: APIs send you back data in a formats that need to be parsed and cleaned so you can more easily start your analysis. This is also handled automatically
  • Multiple requests: in YouTube's case you might want to request data for the same query across several countries, languages, channels, etc. You can specify them all in one request and get the product of all the requests in one response
  • Tutorials and additional resources
  • A visual tool to check what is trending on Twitter for all available locations
  • A Twitter data analysis dashboard with many options
  • How to use the Twitter data API with Python
  • Extracting entities from social media posts tutorial on Kaggle
  • Analyzing 131k tweets by European Football clubs tutorial on Kaggle
  • An overview of the YouTube data API with Python

Conventions

Function names mostly start with the object you are working on, so you can use autocomplete to discover other options:

kw_: for keywords-related functions
ad_: for ad-related functions
url_: URL tracking and generation
extract_: for extracting entities from social media posts (mentions, hashtags, emoji, etc.)
emoji_: emoji related functions and objects
twitter: a module for querying the Twitter API and getting results in a DataFrame
youtube: a module for querying the YouTube Data API and getting results in a DataFrame
serp_: get search engine results pages in a DataFrame, currently available: Google and YouTube
crawl: a function you will probably use a lot if you do SEO
*_to_df: a set of convenience functions for converting to DataFrames (log files, XML sitemaps, robots.txt files, and lists of URLs)

More Repositories

1

dashboardom

Portfolio of Dash Interactive Dashboards / Mini Apps
Python
41
star
2

google-serp

Google Search Results Pages Dashboard
Python
36
star
3

seo-audit-and-analysis

SEO audit and analysis template with Python and advertools
Julia
27
star
4

advertools_crawler_ui

advertools crawler UI
Jupyter Notebook
27
star
5

trending-twitter

Simple dashboard for getting currently trending hashtags and topics on Twitter
Python
25
star
6

SEMRush_serp_tutorial

SEMRush SERP Tutorial. Using advertools to Extract and Analyze Search Engine Results Pages Data
Jupyter Notebook
14
star
7

dash-file-browser

Dash File Browser
Python
14
star
8

adviz

advertools visualizations
Jupyter Notebook
13
star
9

datacamp_sem

Creating Search Engine Marketing Campaigns for Datacamp With Python
Jupyter Notebook
11
star
10

seo_crawler

Bare-bones Basic SEO Crawler using Python Scrapy | check out the new version -->
Python
10
star
11

radvertools

Productivity and analysis tools for online marketing
R
10
star
12

openai_entity_extraction

Entity extraction using ChatGPT
Julia
9
star
13

dashseo

Making Plotly's Dash SEO-friendly
Python
9
star
14

gsc-audit-analysis

Google Search Console API Data Analysis
Jupyter Notebook
8
star
15

terrorism

Global Terrorism Database Interactive Dashboard
Python
7
star
16

advertools_app

Python
7
star
17

word_frequency

Absolute and Weighted Word Frequency
Jupyter Notebook
7
star
18

semrush_crawling_tutorial

SEMrush Tutorial: Crawling and Scraping for Creating an Influencer Database
Julia
7
star
19

content_similarity

Dash App for finding related content in a list of documents
Jupyter Notebook
6
star
20

life_expectancy

Life Expectancy Interactive Dashboard
Python
5
star
21

robotstxt_app

Visual App for Testing URLs and User-agents blocked by robots.txt Files
Python
4
star
22

sem_on_a_large_scale

Setting Up Search Engine Marketing Campaigns on a Large Scale
Jupyter Notebook
4
star
23

semrush_tutorial_sitemap_analysis

Content Analysis with XML Sitemaps
Jupyter Notebook
4
star
24

custom-crawlers

How to create custom crawlers with advertools
Jupyter Notebook
4
star
25

openai-api-bulk-emails

Creating custom email messages in bulk using OpenAI's API
Julia
4
star
26

wikipedia_top_25

Horizontal bar chart of Wikipedia's weekly top 25 articles
Jupyter Notebook
4
star
27

SERP-analysis-python-advertools

SERP analysis and visualization workflow with advertools, pandas, and plotly
Jupyter Notebook
3
star
28

text_analysis_for_online_marketers

Text Analysis for Online Marketers
Jupyter Notebook
3
star
29

seo_crawl_analysis_template

A Template for Analyzing the Output of the `advertools` `crawl` Function
Jupyter Notebook
3
star
30

migration_dashboard

Net Migration by Country - 2017. Interactive Visualization
Python
3
star
31

slidemodel_programmable_presentations

Code and data for the article on programmable presentations for SlideModel.com
HTML
3
star
32

adv-office-hours-crawling

Companion code for the session on crawling options
Julia
2
star
33

multilevel_interactive_treemap

Interactive Treemap Powered by Plotly and Dash
Jupyter Notebook
2
star
34

flaskboard

Flask(Dash)board
Python
2
star
35

baby_names

Python
2
star
36

nyt-api-analysis

NYTimes.com API Data Analysis
Jupyter Notebook
2
star
37

twitterdash

Twitter Dashboard
Python
1
star
38

boxoffice

Movie box office data for US movies
R
1
star
39

eliasdabbas.github.io

Elias Dabbas Blog
1
star
40

llm-content-evaluation

Evaluate content with LLMs using Google helpful content guidelines
Julia
1
star
41

crawlytics_demo

A demonstration of how to use the advertools.crawlytics module using live examples
Jupyter Notebook
1
star
42

acled

ACLED Data analyses and dashboards
Jupyter Notebook
1
star
43

migration-population

Migration and population dashboard - WorldBank data
Python
1
star
44

gold-reserves

Gold Reserves Dashboard
Python
1
star
45

wikipedia_list_of_massacres

HTML
1
star
46

news_sitemaps_visualization

Visualize the most used words in headlines of a selection of news outlets (Google News sitemaps)
Jupyter Notebook
1
star
47

gsc_country_racing_chart

Jupyter Notebook
1
star