• Stars
    star
    6
  • Rank 2,539,965 (Top 51 %)
  • Language Stata
  • Created over 1 year ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Stata package with tools related to computational reproducibility

More Repositories

1

stata

Stata Commands for Data Management and Analysis
258
star
2

ietoolkit

Stata commands designed for Impact Evaluations in particular, but also data work in general
Stata
214
star
3

REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Jupyter Notebook
210
star
4

dime-r-training

Dime Analytics R Training
HTML
110
star
5

sdgatlas2018

Replication code for the World Bank Atlas of Sustainable Development Goals 2018
R
104
star
6

stata-visual-library

Inspiration and code for data visualizatio in Stata, created and maintained by DIME Analytics.
Stata
78
star
7

stata-tables

Code and writing for blogpost about Stata tables
TeX
70
star
8

dime-data-handbook

Development Research in Practice: The DIME Analytics Data Handbook. By Kristoffer Bjärkefur, Luíza Cardoso de Andrade, Benjamin Daniels, and Maria Jones
TeX
63
star
9

DIME-Resources

Repo for all the DIME Analytics/DIME resources like trainings and all.
55
star
10

ML-classification-algorithms-poverty

A comparative assessment of machine learning classification algorithms applied to poverty prediction
Jupyter Notebook
51
star
11

ml4dev

Machine Learning for Development: A method to Learn and Identify Earth Features from Satellite Images
Python
50
star
12

Stata-IE-Visual-Library

This is a repository maintained by DIME Analytics and containing example graphs on how to explore data sets and display results of Impact Evaluations using Stata. For information on how to contribute to the library and download codes and data sets, click on the link to GitHub below.
Jupyter Notebook
49
star
13

Python-for-Data-Science

Jupyter Notebook
48
star
14

llm4data

LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for development data and knowledge discovery.
Python
46
star
15

r-econ-visual-library

This is a repository maintained by DIME Analytics and containing example graphs on how to create graphs for data analysis of Impact Evaluations using R.
HTML
45
star
16

dime-standards

Repository with resources for DIME's research standards and coding standards
TeX
41
star
17

GOST_PublicGoods

Jupyter Notebook
40
star
18

DIME-LaTeX-Templates

DIME's LaTeX templates and LaTeX exercises teaching anyone new to LaTeX how to use LaTeX and how to use DIME's templates
TeX
40
star
19

iefieldkit

Stata commands designed for Impact Evaluations field work. These are tools that are used during/after a survey in the field for data quality monitoring.
Stata
39
star
20

covid19-agent-based-model

This repository contains the Python implementation of the agent-based model used to model the spread of COVID-19.
Jupyter Notebook
38
star
21

SPI

Repository containing raw data, code, and final output for the Statistical Performance Indicators project of the World bank
HTML
35
star
22

GISTEmbed

GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
Python
34
star
23

OpenNightLights

Collection of tools and training materials for exploring the open Nighttime Lights repository
Jupyter Notebook
32
star
24

LearningPoverty

Learning Poverty: an indicator with global coverage that combines schooling and learning.
Stata
31
star
25

wbgviz

Several R packages for World Bank-standard visualisations, building on ggplot2
R
30
star
26

stata-linter

Python
29
star
27

blackmarblepy

Georeferenced Rasters and Statistics of Nightlights from NASA Black Marble
Jupyter Notebook
28
star
28

econometrics-sandbox

This repository contains the code that creates the dashboards references in the “Econometrics Sandbox” blogpost series publish in the Development Impact blog (https://blogs.worldbank.org/impactevaluations)
R
28
star
29

debt-data

Projects related to the World Bank's Debt Statistics
HTML
25
star
30

covid-mobile-data

The COVID19 Mobility Task Force will use data from Mobile Network Operators (MNOs) to support data-poor countries with analytics on mobility to inform mitigation policies for preventing the spread of COVID-19
Python
24
star
31

template

🎩 Project Template
Jupyter Notebook
21
star
32

GEE_Zonal

Collection of python tools for running zonal stats on Google Earth Engine layers
Jupyter Notebook
19
star
33

dime-github-trainings

Training materials and other GitHub related information developed by DIME Analytics
TeX
19
star
34

DIMEwiki

Sample code for impact evaluation and survey
JavaScript
19
star
35

GOST_SAR

Collection of tools developed by GOST team for extracting information from SAR data
Jupyter Notebook
19
star
36

GOSTnets

Convenience wrapper for networkx analysis using geospatial information, focusing on OSM
Jupyter Notebook
17
star
37

blackmarbler

Georeferenced Rasters and Statistics of Nighttime Lights from NASA Black Marble
HTML
16
star
38

dime-python-training

DIME's Python Training for advanced R/Stata users
Jupyter Notebook
16
star
39

iQual

iQual is a package that leverages natural language processing to scale up interpretative qualitative analysis. It also provides methods to assess the bias, interpretability and efficiency of the machine-enhanced codes. iQual has been applied to analyse interviews on parents' aspirations for their children in Cox's Bazaar, Bangladesh.
Jupyter Notebook
15
star
40

dec-python-course

Jupyter Notebook
14
star
41

gld

This is the repository for the Global Labor Database (GLD). It aims to contain all necessary information to understand what the GLD is and how it functions. It does not, however, contain any microdata. For any questions please contact the Focal Point ([email protected]).
Stata
14
star
42

GOST_AIS

Process automatic identification system (AIS) shipping data for various development purposes
Jupyter Notebook
13
star
43

wb-nlp-apps

This repository contains the NLP modeling components and web application implementations of a project for knowledge and data discovery funded by the Knowledge for Change Program (KCP) and the Joint Data Center on Forced Displacement (JDC).
Jupyter Notebook
13
star
44

pipr

R client to the PIP API
R
12
star
45

rio-safe-space

This repository contains the supplemental material and replication package for the 2019 Working Paper "Demand for 'Safe Spaces': Avoiding Harassment and Stigma" by Florence Kondylis, Arianna Legovini, Kate Vyborny, Astrid Zwager, and Luiza Andrade.
Stata
11
star
46

DIA-toolkit

This repository contains all the program codes developed in the "Distributional Impact Analysis: Toolkit and Illustrations of Impacts Beyond the Average Treatment Effect" by Guadalupe Bedoya (World Bank), Luca Bittarello (Northwestern University), Jonathan Davis (University of Chicago), and Nikolas Mittag (CERGE-EI).
Stata
11
star
47

GLAD

Global Learning Assessment Database: a collection of harmonized learning assessments datasets at the student and country level.
Stata
11
star
48

school-survey

Joint UNESCO, UNICEF, WBG survey on national education responses to COVID-19.
Stata
10
star
49

pip

Stata module to access World Bank’s Global Poverty and Inequality data
Stata
10
star
50

cv4ag

Computer vision application over satellite RGB tiles for agricultural land detection
Python
10
star
51

GOSTurban

GOST's combined tools for urban analysis
Jupyter Notebook
9
star
52

povcalnetR

R client to the Povcalnet API
R
9
star
53

wb-nlp-tools

Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
Python
9
star
54

python-101

A hour lighting introduction to Python for WBG staff delivered on Data Day on Feb 13
Jupyter Notebook
9
star
55

GEPD

Global Education Policy Dashboard
HTML
8
star
56

CityScan

Collection of data processing scripts to generate the baseline data for the CityScan project
Jupyter Notebook
8
star
57

sdg-metadata

SDG Metadata Translation Pilot
JavaScript
8
star
58

rissk

Identify at-risk interviews directly from your Survey Solutions export files.
Python
8
star
59

SDI-Health

Dissemination of harmonization code and data for SDI Health surveys
Stata
8
star
60

EPM

Electricity Planning Model
GAMS
7
star
61

BDA-with-Python

Jupyter Notebook
7
star
62

TwitterEconomicMonitoring

Collection of training materials to download and draw insights from Twitter data.
Jupyter Notebook
7
star
63

dkanr

General purpose R client to the DKAN Open Data platform
R
7
star
64

wb-reproducible-research-repository

This repository supports the World Bank's Reproducible Research Repository
Stata
6
star
65

GOSTnetsraster

Calculating market access using raster surfaces of friction or travel time
Jupyter Notebook
6
star
66

ethiopia-rsdp-ie

Replication Package for: The Impact of Ethiopia's Road Sector Development Program: Evidence from Satellite Data
R
6
star
67

shiny-trainings-performance-ex

HTML
6
star
68

EduAnalyticsToolkit

EduAnalytics Team Toolkit for Data Management, Documentation and Analytics
Stata
6
star
69

povcalnet

Stata client to the Povcalnet API
Stata
6
star
70

DIME-MSIE-Workshop

To version control and share all lab presentation, code examples etc. for DIME’s Manage Successful Impact Evaluation (MSIE) Workshop (also know as DIME’s Field Coordinator Training)
TeX
6
star
71

HNP

World Bank's Geospatial Team (GOST) support to the Global Practice for Health, Nutrition, and Population.
6
star
72

rsocialwatcher

A Social Data Collector for Facebook Marketing API
R
6
star
73

econberta-econie

Repository hosting the large language model EconBERTa and the annotated dataset EconIE
Python
6
star
74

datalibweb

datalibweb - datalibweb is the Stata frontend for the microdata API created by Poverty Global Practice in collaboration with ITS and DECDG to enable users to access data and documentation available in different global, regional and country microdata catalogs at the World Bank.
Stata
6
star
75

qcheck

Stata
5
star
76

institutional-assessment-dashboard

This repository contains the code to create the “global institutional assessment dashboard”
R
5
star
77

intro-to-python

Introduction to Python for Data Science.
Jupyter Notebook
5
star
78

health-equity-diagnostics

Jupyter Notebook
5
star
79

INFRA_SAP

Compilation of national level infrastructure analysis as part of the World Bank's Global Infrastructure Map
Jupyter Notebook
5
star
80

Water-When-It-Counts

Replication files for Water When It Counts: Reducing Scarcity through Irrigation Monitoring in Central Mozambique by Paul Christian, Florence Kondylis, Valerie Mueller, Astrid Zwager and Tobias Siegfried
Stata
5
star
81

geometatool

Geospatial Metadata Toolkit
HTML
4
star
82

GSS_Census_Tools

Collected tools for improving the EA demarcation workflow
Jupyter Notebook
4
star
83

SARMD_guidelines

Technical guidelines for the SAR microdata base
Stata
4
star
84

fin2ddh

R
4
star
85

NTL_Harmonizer

Jupyter Notebook
4
star
86

dime-stata-training

HTML
4
star
87

primus

Stata package to manage PRIMUS system
Stata
4
star
88

Firms-Web-Scraping

The aim of this project is to scrape metadata of business firms given only their name and country where they are operating.
Python
4
star
89

GEEST

Gender Enabling Environments Spatial Tool (GEEST)
QML
4
star
90

Worldwide-Bureaucracy-Indicators

Do files used to create Worldwide Bureaucracy Indicators
Stata
3
star
91

geolocation-twitter-urban-planning

This repository contains the code for the analysis and the reproducibility package for the paper "Applying machine learning and geolocation techniques to social media data (Twitter) to develop a resource for urban planning"
R
3
star
92

GeospatialFCVcollateral

Links to additional resources related to the Geospatial and ICT in FCV course on the Open Learning Campus (under construction)
HTML
3
star
93

climateknowledgeportal

Climate Change Knowledge Portal Documentation
Jupyter Notebook
3
star
94

RAG-Based-ChatBot-Example

Python
3
star
95

worldex

WorldEx Application for subnational data indexing and discovery.
Jupyter Notebook
3
star
96

CoVID_density_hotspot_mapping

Identify potential hotspots for CoVID spread due to population density, building heights, and access to services
Jupyter Notebook
2
star
97

terridev_GSG

Stata
2
star
98

SDG-big-data

HTML
2
star
99

LSMS

Jupyter Notebook
2
star
100

PIP-Methodology

Methodology page for the Poverty and Inequality Platform.
TeX
2
star