• Stars
    star
    378
  • Rank 113,272 (Top 3 %)
  • Language
    Go
  • License
    MIT License
  • Created over 6 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

⚖️ Fast weighted random selection for Go

weightedrand ⚖️

PkgGoDev CodeFactor Build Status codecov

Fast weighted random selection for Go.

Randomly selects an element from some kind of list, where the chances of each element to be selected are not equal, but rather defined by relative "weights" (or probabilities). This is called weighted random selection.

Usage

import (
    /* ...snip... */
    "github.com/mroth/weightedrand/v2"
)

func main() {
    chooser, _ := weightedrand.NewChooser(
        weightedrand.NewChoice('🍒', 0),
        weightedrand.NewChoice('🍋', 1),
        weightedrand.NewChoice('🍊', 1),
        weightedrand.NewChoice('🍉', 3),
        weightedrand.NewChoice('🥑', 5),
    )
    // The following will print 🍋 and 🍊 with 0.1 probability, 🍉 with 0.3
    // probability, and 🥑 with 0.5 probability. 🍒 will never be printed. (Note
    // the weights don't have to add up to 10, that was just done here to make
    // the example easier to read.)
    result := chooser.Pick()
    fmt.Println(result)
}

Performance

The existing Go library that has a comparable implementation of this is github.com/jmcvetta/randutil, which optimizes for the single operation case. In contrast, this library creates a presorted cache optimized for binary search, allowing repeated selections from the same set to be significantly faster, especially for large data sets.

Comparison of this library versus randutil.ChooseWeighted on my workstation. For repeated samplings from large collections, weightedrand will be much quicker:

Num choices randutil weightedrand weightedrand -cpu=8*
10 201 ns/op 38 ns/op 2.9 ns/op
100 267 ns/op 51 ns/op 4.1 ns/op
1,000 1012 ns/op 67 ns/op 5.4 ns/op
10,000 8683 ns/op 83 ns/op 6.9 ns/op
100,000 123500 ns/op 105 ns/op 12.0 ns/op
1,000,000 2399614 ns/op 218 ns/op 17.2 ns/op
10,000,000 26804440 ns/op 432 ns/op 35.1 ns/op

*: Since v0.3.0 weightedrand can efficiently utilize a single Chooser across multiple CPU cores in parallel, making it even faster in overall throughput. See PR#2 for details. Informal benchmarks conducted on an Intel Xeon W-2140B CPU (8 core @ 3.2GHz, hyperthreading enabled).

Don't be mislead by these numbers into thinking weightedrand is always the right choice! If you are only picking from the same distribution once, randutil will be faster. weightedrand optimizes for repeated calls at the expense of some initialization time and memory storage.

Requirements

weightedrand >= v2 requires go1.18 or greater. For support on earlier versions of go, use weightedrand v1.

Credits

To better understand the algorithm used in this library (as well as the one used in randutil) check out this great blog post: Weighted random generation in Python.

More Repositories

1

unindexed

🔎❔ website that irrevocably deletes itself once indexed
JavaScript
1,315
star
2

emojitracker

💫 track ALL the emoji
743
star
3

phoenix-showdown

🏇 benchmark Sinatra-like web frameworks
Elixir
704
star
4

scmpuff

🔢 Numeric file shortcuts for common git commands
Go
373
star
5

slacknimate

👯 Realtime text animation for Slack chatops
Go
286
star
6

evalcache

🐣 zsh plugin to cache eval loads to improve shell startup time
Shell
173
star
7

bootslap

☕ bootstraps macOS to be usable
Shell
120
star
8

sseserver

🏄 High-performance Server-Sent Events endpoint for Go
Go
107
star
9

emojistatic

💩 CDN static asset generation for emoji
CSS
98
star
10

exmoji

😎 Emoji encoding swiss army knife for Elixir/Erlang
Elixir
95
star
11

emoji_data.rb

😎 Emoji encoding swiss army knife for Ruby
Ruby
87
star
12

emoji-data-js

😎 Emoji encoding swiss army knife for NodeJS
CoffeeScript
64
star
13

ramdisk

🐏 Convenience wrapper for managing RAM disks
Go
47
star
14

benchwarmer

⌛ Elixir micro-benchmarking library
Elixir
47
star
15

bump

🌻 CLI tool to draft a GitHub Release for the next semantic version
Go
46
star
16

git-muzak

🎶 Background music for your git commits
Shell
32
star
17

cameraform

📹 Simple Flash+JS library for webcam capture and submission.
ActionScript
24
star
18

recyclebin

♻️ measures usage of a particular term on twitter
Go
21
star
19

grayratio

💬 gray:blue ratio for iMessage conversations
Shell
16
star
20

twitter-streaming-showdown

🚣 benchmark Twitter Streaming API libraries
Ruby
15
star
21

cssquirt

💧 Embeds images (or directories of images) directly into CSS via the Data URI scheme.
Ruby
14
star
22

tinygeoip

🐉 tiny geoip microservice
Go
13
star
23

momocode

🍑 Visual fingerprinting for 20-byte Ethereum addresses via emoji
Solidity
12
star
24

xkcdpass

🔏 xkcd style password generator for iOS
Swift
12
star
25

pullcrusher

🎩 optimizes all images in a GitHub repo & sends a pull request with the changes.
Ruby
10
star
26

emojidoll

🎎 twitter bot to generate Emoji dolls
Ruby
10
star
27

pigstream

🐷🐤 Twitter bot that insta-taunts frustrated people who can't beat a level on Angry Birds, using the streaming API.
Ruby
10
star
28

consider

🤔 quickly check code comments for subtle (racism|sexism|ableism)
Shell
9
star
29

golang-challenge-1

Solution for Go Challenge #1 (in Go and Elixir)
Go
9
star
30

deepclean

🗑️ scan and remove junk files from your source code directories
Go
9
star
31

scalafmt-native

Statically-linked GraalVM "native image" binaries of scalafmt
Dockerfile
8
star
32

my-boxen

👔 MacOSX system provisioning via Boxen [DEPRECATED, see mroth/bootslap]
Ruby
8
star
33

git-prompt-useremail

💂‍♂️ zsh plugin adds prompt reminders for git user.email
Shell
8
star
34

hurricanecamp

🌀 Hurricane Dev Camp is a stay-at-home hackathon to coincide with Hurricane Irene.
8
star
35

subtleist

🎏 Anonymously remind of subtle-isms and other Recurse Center social rules in Slack
Go
8
star
36

nanogeoip

🐉 tiny and blazing fast experimental geoip microservice
Rust
8
star
37

stardotws

🌟 source for a fun EMOJI DOMAIN
HTML
7
star
38

upcoming-cloud-warrior

☁️ Quick script/instructions to to get the Upcoming.org Archive Team Project running on multiple Heroku instances in the cloud.
Shell
7
star
39

hubhumans

👪 Automatically create a `humans.txt` file based upon public members of a GitHub organization.
Ruby
6
star
40

bogan-martin-award

🐲 old-skool Flickr staff award website
HTML
6
star
41

goodvsevil

🐱🐶 A quick hack to compare keyword counts in the Twitter Streaming API, using puppies and kittens.
Ruby
5
star
42

portfolio

🎨 my portfolio website
HTML
5
star
43

personalappeals

🏩 Hot or Not of the Wikipedia contributors.
JavaScript
5
star
44

semverdesc

🎯 git describe with semantic version compatible names
Go
4
star
45

dotfiles

⭕ My dotfiles repository. There are many like it, but this one is mine.
Shell
4
star
46

howami

💉 command line tool to show a summary of Fitbit health data
Ruby
4
star
47

pdftotextcloud

📚 pdftotext as a web service
JavaScript
4
star
48

pybaztag

🐰 Python convenience wrapper for the Nabaztag API.
Python
4
star
49

readtime

⌚ estimate how long it will take to read text
Go
4
star
50

shopmon

🏪 Monitor for in-stock products from Shopify powered stores
Go
4
star
51

forafriendbot

🙋 twitter bot that poses questions.... for a friend.
CoffeeScript
4
star
52

flickr-nearby-webos

📱 Quick and dirty location services test for WebOS, source code to support blog post.
JavaScript
3
star
53

github-cli-xref

♻️ Tool to make it easy to cross-reference GitHub issues.
Ruby
3
star
54

fashionhack

👠 fashion hackday project with @kellan
Ruby
3
star
55

flickr-wholovesyou

💞 see which Flickr members most frequently favorite photos of you (or someone else you know!)
Ruby
3
star
56

tamanegi

🌰 generates tor .onion hashes
Go
3
star
57

hubfavor

🙇 determine who in a GitHub org is most likely to do you a favor
Ruby
3
star
58

meatballtracker

🍝 monitor and alert for delicious meatballs at a local restaurant.
Ruby
3
star
59

isgregdead

💀 Check whether a coworker (who occasionally keeps odd hours) is alive.
Ruby
3
star
60

foursquare-token-echo

4️⃣💭 simple webpage to receive and echo the client_token for a Foursquare OAuth2 request.
2
star
61

flickr-socialvenn

👬 Generates a weighted contact intersection venn diagram for a Flickr member.
2
star
62

poidh

👀 pics or it didn't happen!
Ruby
2
star
63

sse-bench

📊 benchmarks Server-Sent Events endpoints
CoffeeScript
2
star
64

dees-colors

🌈 preview text as seen by a friend with grapheme-color synesthesia.
2
star
65

jitter

👯 Go timers with random jitter
Go
2
star
66

emojicompare

😄⁉️😄 A comparison of the Apple and Twitter emoji glyphs
2
star
67

go2go-docker

🐳 Docker image for the dev.go2go experimental Go branch
Dockerfile
2
star
68

loremfile

📜 Generate lorem ipsum text of a specific size
Go
1
star
69

.github

😈 Default community health files
1
star
70

dinnermint

🍬 automated metadata processing of personal photos on Flickr.
Ruby
1
star
71

tordesc

Parser for Tor network data descriptors
Rust
1
star
72

scalafmt-docker

Simple scalafmt packaging as a Docker Hub automated build
Dockerfile
1
star
73

rando-slackrisian

🎲 super quick CLI hack to get a random member of your Slack Team
JavaScript
1
star
74

base100-go

💯 Go implementation of Base100 emoji encoding
Go
1
star
75

mta2json

🚋 Proxies reqs to the MTA for Realtime Transit Feed data, converts results to JSON.
JavaScript
1
star
76

howlong

Quick self reference to answer a question I frequently get on calls.
HTML
1
star
77

sunnyinphilly

⛅ is it sunny in Philadelphia?
1
star
78

timeduration

🕐 simple CLI tool to convert humanized time durations
Go
1
star
79

xsort

80% faster versions of Go sort.Search* wrappers
Go
1
star
80

lolcapture

📹 experimental capture tool for lolcommits
Swift
1
star
81

fitdump

📉 Parse Fitbit data export files
Go
1
star
82

khan-bootstrap

🎓 A minimal setup for hacking the Khan Academy codebase
Makefile
1
star