• Stars
    star
    2
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Optimal Binning (Quantization)

More Repositories

1

data-science

Lecture Slides for Introduction to Data Science
TeX
24
star
2

image-to-text

Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
Python
14
star
3

meta_ps

Production and Consumption of APSR, BJPS, Perspectives, PS, and World Politics Articles
R
13
star
4

face_of_crime

Race and Gender of Criminals and Victims in Law and Order
TeX
13
star
5

tldr

Distilling key points, reorganizing, and modestly augmenting the points from books and lectures.
11
star
6

tv_schedules

70+ years of data on Network television. Attributes of shows, race & gender of cast members, directors, producers, presenters, etc.
Python
10
star
7

on-writing

Writing Tips, Tricks, and Tools
HTML
9
star
8

text-as-data

Pipeline for Analyzing Text Data: Acquire, Preprocess, Analyze
Python
8
star
9

optimal_classification_cutoffs

Script for calculating the optimal cut-off for max. F1-score or accuracy
Jupyter Notebook
6
star
10

wait

Waiting Times at CA DMV
Jupyter Notebook
6
star
11

search-and-replace

Edit Distance Based Search and Replace
Python
6
star
12

python-workshop

Introduction to Python, Data structures, scraping, APIs, pre-processing text data
Python
4
star
13

likes-followers-views

Track Facebook Likes, Twitter Followers, YouTube Views
R
4
star
14

daughters

R
4
star
15

biocong

Biographical data on members of congress (105th --- 115th).
Jupyter Notebook
4
star
16

adult

Consumption of Pornography Online Using Passively Observed Browsing Data
Jupyter Notebook
3
star
17

sonny_side

Son Bias in US: Evidence from Business Names
Jupyter Notebook
3
star
18

guess

Adjust naive estimates of learning for guessing
R
3
star
19

working_women_on_tv

Employment Status of Female Characters on Indian Television Soaps
R
3
star
20

journal_price

Price of Academic Journals
3
star
21

digital-tv-coverage-in-uk

Digital TV Coverage By Postcode in the U.K.
Python
3
star
22

mixed_signals

Using 48,613 average movie ratings from 12 platforms for which we have ratings for 100 or more movies, we estimate the correlation between ratings across platforms. The median correlation between average ratings of two platforms was .37.
R
3
star
23

partisan-gaps

How do (biased) guessing encouraging features and guessing agnostic coding techniques affect the partisan gap?
Jupyter Notebook
3
star
24

distortions

Replication Data and Scripts for Deliberative Distortions
R
2
star
25

ds

Learning From Data
HTML
2
star
26

java_ocr_parser_for_factbook

Java OCR and Parser for Warren's TV and Cable Factbook (From 2013)
Java
2
star
27

extreme_recall

Extreme Recall: Which Politicians Come to Mind?
Stata
2
star
28

military-experience

Military Experience of US Presidents and UK Prime Ministers.
R
2
star
29

epic_children

Jupyter Notebook
2
star
30

lta

Group zipcodes to Build Local Television Areas
Jupyter Notebook
2
star
31

optimal_data_collection

TeX
2
star
32

pcomp

Stata
2
star
33

ookla_netindex

Estimates of Internet speed by city, country, region by Ookla (NetIndex)
Python
2
star
34

nga

Scraping National Governor's Association (from 2012)
R
2
star
35

social_proof_stars

Effect of Social Proof on Downloads
Jupyter Notebook
2
star
36

nireland

Replication Data And Scripts for How Can You Think That?: Deliberation and the Learning of Opposing Arguments
R
1
star
37

partisan_gap

Stata
1
star
38

partisan_vision

Partisan Bias in Simple Visual Evaluations
TeX
1
star
39

kirkuk

Data and scripts behind the paper "What Future For Kirkuk?"
R
1
star
40

prop_male

How does stopping rule matter for sex ratio?
1
star
41

quant-discipline

By the Numbers: Toward Precise Numerical Summaries
TeX
1
star
42

total_error

total error
TeX
1
star
43

scaling

Scaling ML Products At Startups: A Practitioner's Guide
TeX
1
star
44

speech-learn

Modeling Relationship Between Congressional Speech and Ideology
Python
1
star
45

goji

R Package With Functions for Generalized Variance, Formatting Strings, Cleaning Text etc.
R
1
star
46

g2

Jupyter Notebook
1
star
47

uncertainty

Is an Uncertain Prospect Less Preferred Than Its Worst Possible Outcome? New Evidence on the Uncertainty Effect
TeX
1
star
48

recognize

Assess OCR quality: Compare OCR to human transcription
R
1
star
49

pollbias

The House Always Wins: House Effects in Polling
R
1
star
50

superdf

superdf: extending the dataframe classes in R and Python to save metadata with the data
Jupyter Notebook
1
star
51

misinformation

Portugol
1
star
52

birthday_voter

Jupyter Notebook
1
star
53

not_to_code

Static Code Analysis of Replication Files
R
1
star
54

selexp

Discretionary Exposure to Political Information
R
1
star
55

unclear_gap

Survey Research. How Vague Response Options Produce Partisan Knowledge Gaps
TeX
1
star
56

typecast

Replication Materials for Typecast
TeX
1
star
57

party_time

Replication Data and Scripts for Affect, Not Ideology: A Social Identity Perspective on Polarization
Scheme
1
star
58

pareto_partisan

Pareto Partisan: Are Partisans Willing to Bite Their Purse To Spite The Main Opposing Party?
R
1
star
59

late_iv

check distributional implications of LATE
1
star
60

partisan_head

TeX
1
star
61

hidden

R
1
star