• Stars
    star
    2
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created 7 months ago
  • Updated 6 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A python library for online learning in RL loops, specialized for Contextual Bandit scenarios. Choose actions from multiple options, evaluate decisions, and integrate feedback for improved future outcomes. Features versatile scoring, advanced featurization, and configurable learning policies.

More Repositories

1

vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
C++
8,409
star
2

reinforcement_learning

Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update
C++
72
star
3

coba

Contextual bandit benchmarking
Python
43
star
4

jupyter-notebooks

Vowpal Wabbit examples and tutorials
Jupyter Notebook
19
star
5

estimators

Estimators to perform off-policy evaluation
Python
12
star
6

py-vowpal-wabbit-next

Experimental new Python bindings for the VowpalWabbit library
Python
10
star
7

docker-images

Docker images used for continuous integration
Dockerfile
8
star
8

workshop

Jupyter Notebook
6
star
9

data-science

Python
4
star
10

rl_chain

Python
4
star
11

neurips2019

4
star
12

slates-experiments

Jupyter Notebook
4
star
13

docs

Genenerated documentation
HTML
3
star
14

slope-experiments

Python
3
star
15

vowpalwabbit.github.io

Official website of Vowpal Wabbit
HTML
3
star
16

icml2019

3
star
17

ccb-experiments

C++
1
star
18

feature-broker

Mature software applications are highly componentized and have well-defined API boundaries among components. This structure leads to constraints making in-process inference challenging due to the difficulty of access to features. The FeatureBroker library presents a solution to this problem. This helped with adoption of VW in inference. We believe that this is a pervasive problem. The goal of open sourcing this library is to reduce the developer friction in adopting VW in software stacks that are built with these constraints.
1
star