• Stars
    star
    122
  • Rank 292,031 (Top 6 %)
  • Language
    R
  • Created over 11 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

An convenient R tool for manipulating tables in PostgreSQL type databases and a wrapper of Apache MADlib.

PivotalR

PivotalR is a package that enables users of R, the most popular open source statistical programming language and environment, to interact with Greenplum Database and the PostgreSQL for big data analytics. It does so by providing an interface to the operations on tables/views in the database. These operations are almost the same as those of data.frame. Minimal amount of data is transfered between R and the database. Thus the users of R do not need to learn SQL when they operate on the objects in the database. PivotalR also lets the user to run the functions of the open source machine learning package Apache MADlib directly from R.

  1. An Introduction to PivotalR

     vignette("pivotalr") # execute in R console to view the PDF file
    
  2. To install PivotalR:

    • Get the latest stable version from CRAN by running install.packages("PivotalR")

    • Or try out the latest development version from github by running the following code (need R >= 3.0.2):

      ## install.packages("devtools") # 'devtools' package is only available for R >= 3.0.2
      devtools::install_github("PivotalR", "greenplum-db")
      
    • Or download the source tarball directly from here, and then install the tarball

      install.packages("greenplum-db-PivotalR-xxxx.tar.gz", repos = NULL, type = "source")
      

    where "greenplum-db-PivotalR-xxxx.tar.gz" is the name of the package that you have downloaded.

  3. To get started:

More Repositories

1

gpdb

Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
C
6,009
star
2

gporca

A modular query optimizer for big data
C++
588
star
3

postgres

86
star
4

pxf

Platform Extension Framework: Federated Query Engine
Java
69
star
5

gpbackup

GPDB Backup Utility
Go
52
star
6

diskquota

PostgreSQL disk quota extension
C
48
star
7

GreenplumPython-archive

Python
43
star
8

plcontainer

PL/Container - GPDB execution sandboxing for Python and R
C
43
star
9

go-gpdb

Pivotal Greenplum Database downloader and installer based in golang
Go
27
star
10

geospatial

PostGIS for Greenplum
PLpgSQL
26
star
11

gpupgrade

GPDB major version upgrade utility
Go
22
star
12

greenplum-for-kubernetes

Deploy Greenplum cluster on Kubernetes
Go
22
star
13

gp-xerces

C++
18
star
14

TPC-DS

Greenplum TPC-DS benchmark
C
15
star
15

gpbackup-s3-plugin

S3 plugin for use with GPDB backup utility
Go
10
star
16

pljava

PL/Java GPDB Package
Java
10
star
17

gp-common-go-libs

Go
9
star
18

greenplum-database-release

A repository for code related to creating packages of Greenplum Database
Ruby
9
star
19

gpdb-postgres-merge

Scratch repository for merging upstream Postgres into GPDB
C
9
star
20

pgbouncer

pgbouncer repo
C
8
star
21

filedump

Greenplum Database Filedump is a utility to format Greenplum heap/index/control files into a human-readable form. You can format/dump the files several ways, as listed in the Invocation section, as well as dumping straight binary.
C
5
star
22

plr

plr
C
4
star
23

GreenplumR

R
3
star
24

gssapi

Go
3
star
25

libusual

libusual
C
2
star