• Stars
    star
    128
  • Rank 279,436 (Top 6 %)
  • Language
  • Created about 10 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A dataset of the battles in the War of the Five Kings from George R.R. Martin's A Song Of Ice And Fire series.

The War Of The Five Kings, A Dataset

This repo contains a dataset of the battles in the War of the Five Kings from George R.R. Martin's A Song Of Ice And Fire series.

This small side project came about because I outside of my regular projects I have been looking for a small-n dataset to use as a data science teaching tool. While "non-fiction" datasets are, frankly, more interesting, they also come with the mess, complications, and grey-areas associated with the real world. On the other end of the spectrum, randomly generated datasets can be difficult to use as a teaching tool because their outputs (e.g. visualizations or analyses) are meaningless. So, instead of generating a dataset from random-values, the George R.R. Martin series provides ample source material to create a dataset that is both fictional and with (for anyone that has read the series) built-in context. Also, I had a few hours to kill in an airport.

Here is an exploratory analysis of the dataset on nbviewer

Codebook

This readme file acts as the codebook for the dataset.

Level Of Observation:

  • The battles of the War of the Five Kings

Variables:

  • name: String variable. The name of the battle.
  • year: Numeric variable. The year of the battle.
  • battle_number: Numeric variable. A unique ID number for the battle.
  • attacker_king: Categorical. The attacker's king. A slash indicators that the king charges over the course of the war. For example, "Joffrey/Tommen Baratheon" is coded as such because one king follows the other in the Iron Throne.
  • defender_king: Categorical variable. The defender's king.
  • attacker_1: String variable. Major house attacking.
  • attacker_2: String variable. Major house attacking.
  • attacker_3: String variable. Major house attacking.
  • attacker_4: String variable. Major house attacking.
  • defender_1: String variable. Major house defending.
  • defender_2: String variable. Major house defending.
  • defender_3: String variable. Major house defending.
  • defender_4: String variable. Major house defending.
  • attacker_outcome: Categorical variable. The outcome from the perspective of the attacker. Categories: win, loss, draw.
  • battle_type: Categorical variable. A classification of the battle's primary type. Categories:
    • pitched_battle: Armies meet in a location and fight. This is also the baseline category.
    • ambush: A battle where stealth or subterfuge was the primary means of attack.
    • siege: A prolonged of a fortied position.
    • razing: An attack against an undefended position
  • major_death: Binary variable. If there was a death of a major figure during the battle.
  • major_capture: Binary variable. If there was the capture of the major figure during the battle.
  • attacker_size: Numeric variable. The size of the attacker's force. No distinction is made between the types of soldiers such as cavalry and footmen.
  • defender_size: Numeric variable. The size of the defenders's force. No distinction is made between the types of soldiers such as cavalry and footmen.
  • attacker_commander: String variable. Major commanders of the attackers. Commander's names are included without honoric titles and commandders are seperated by commas.
  • defender_commander: String variable. Major commanders of the defener. Commander's names are included without honoric titles and commandders are seperated by commas.
  • summer: Binary variable. Was it summer?
  • location: String variable. The location of the battle.
  • region: Categorical variable. The region where the battle takes place. Categories: Beyond the Wall, The North, The Iron Islands, The Riverlands, The Vale of Arryn, The Westerlands, The Crownlands, The Reach, The Stormlands, Dorne
  • note: String variable. Coding notes regarding individual observations.

Source:

Are you a huge ASOIAF fan?

Not really. I just love data - all data.

This data is mirrored and can be queried via API here

More Repositories

1

code_py

A collection of well-commented code snippets for data science
HTML
676
star
2

mlai

Notes on machine learning and artificial intelligence.
HTML
548
star
3

sklearn-flask-docker

An example of deploying a sklearn model using Flask using a Docker container.
Python
115
star
4

code_r

Collection of R Scripts for Reuse
R
106
star
5

election_day_2016_twitter

Tweets from election day 2016
Jupyter Notebook
64
star
6

MachineLearningFlashcards.com

The website for the machine learning flashcards project.
CSS
23
star
7

twitter_miner

A generic miner for twitter
Python
18
star
8

third_2016_presidential_debate_twitter

Tweets from the third presidential debate in 2016
Python
16
star
9

techstars_timesheet

Time tracking data from 84 days at Techstars Cloud 2016
Jupyter Notebook
10
star
10

second_2016_presidential_debate_twitter

150,000 tweets from 2016's second presdential debate between Hillary Clinton and Donald Trump
Python
10
star
11

COVID-19-amazon-tracker

Tracks avaliability and prices of common COVID-19-related products on Amazon
10
star
12

istheelectionrigged.com

Single serve site answering a basic question about American democracy.
JavaScript
5
star
13

laptop_setup

4
star
14

rooster_teeth_letsplay

Data On 990 Roosterteeth Let's Play Videos
HTML
2
star
15

helpful_fountains

Python
2
star
16

redesigned-giggle

Python
2
star
17

conflict_health

A complete archive of ConflictHealth.com (2008 to 2012)
PHP
2
star
18

Oreilly_Notes_To_Markdown

A personal script that converts O'Reilly Media highlight exports into markdown for use on ChrisAlbon.com
Python
1
star
19

wikipedia-factcheck-chatgpt

A ChatGPT plugin that factchecks a response from ChatGPT
1
star
20

muppet_deaths

A dataset on the causes of muppet deaths
1
star
21

wikipedia-chatgpt-plugin

Python
1
star
22

thehill_50_beautiful_people

A dataset on The Hill's listing of the 50 most beautiful people in DC
1
star