• Stars
    star
    200
  • Rank 191,360 (Top 4 %)
  • Language
    Java
  • License
    MIT License
  • Created over 8 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Manages streaming of data to AWS S3 without knowing the size beforehand and without keeping it all in memory or writing to disk.

S3 Stream Upload

This library allows you to efficiently stream large amounts of data to AWS S3 in Java without having to store the whole object in memory or use files. The S3 API requires that a content length be set before starting uploading, which is a problem when you want to calculate a large amount of data on the fly. The standard Java AWS SDK will simply buffer all the data in memory so that it can calculate the length, which consumes RAM and delays the upload. You can write the data to a temporary file but disk IO is slow (if your data is already in a file, using this library is pointless). This library provides an OutputStream that packages data written to it into chunks which are sent in a multipart upload. You can also use several streams and upload the data in parallel.

The entrypoint is the class StreamTransferManager. Read more in the javadoc, including a usage example.

This is available from maven central.

Changelog

2.2.4

2.2.3

2.2.2

2.2.1

  • Exceptions during upload lead to an exception in writers instead of them getting stuck trying to put on the queue.
  • Bump aws-java-sdk and slf4j-api dependency versions.

2.2.0

2.1.0

  • Support uploading empty objects. In this case the user may want to override customisePutEmptyObjectRequest.
  • Bump aws-java-sdk and slf4j-api dependency versions.

2.0.3

  • Bump aws-java-sdk dependency version

2.0.2

  • Avoid race condition in big uploads causing some parts to be missing from final completed upload.

2.0.1

  • Allow thrown Errors (e.g. OOM) to bubble up unchanged.

2.0.0

  • The checkSize() method is now private as the user no longer needs to call it. You can remove all calls to it.
  • The constructor from version 1 is now deprecated. Instead you should call the 3 parameter constructor which has the essentials, and optionally chain the desired builder style setters to configure.

More Repositories

1

heartrate

Simple real time visualisation of the execution of a Python program.
Python
1,624
star
2

birdseye

Graphical Python debugger which lets you easily view the values of all evaluated expressions
JavaScript
1,570
star
3

futurecoder

100% free and interactive Python course for beginners
Python
1,245
star
4

snoop

A powerful set of Python debugging tools, based on PySnooper
Python
927
star
5

sorcery

Dark magic delights in Python
Python
353
star
6

executing

Get information about what a Python frame is currently doing, particularly the AST node being executed
Python
312
star
7

funcfinder

A tool for automatically solving problems of the form "I need a python function that does X."
Python
165
star
8

instant_api

Instantly create an HTTP API with automatic type conversions, JSON RPC, and a Swagger UI. Just add methods!
Python
126
star
9

stack_data

Python
32
star
10

birdseye-pycharm

IntelliJ IDE plugin for the Python debugger birdseye
Java
31
star
11

pure_eval

Safely evaluate AST nodes without side effects
Python
26
star
12

outdated

Check if a version of a PyPI package is outdated
Python
22
star
13

cheap_repr

Better version of repr/reprlib for short, cheap string representations in Python
Python
21
star
14

friendly_states

Declarative, explicit, tool-friendly finite state machines in Python
Python
19
star
15

nameof

Python function to get the name of a variable or attribute, as in C#
Python
13
star
16

boxes

A library that adds object oriented power to fields, letting you do better than traditional getters and setters.
Java
12
star
17

sunhours

Sketchup plugin for analysing the amount of sunlight hitting points on a surface over the year:
HTML
10
star
18

pyodide-worker-runner

TypeScript
9
star
19

instant_client

Type safe JSON RPC client with automatic (de)serialization. Best paired with instant_api.
Python
7
star
20

jsonfinder

Python library to easily handle JSON contained within strings.
Python
7
star
21

oeis-explorer

Explore related sequences in the OEIS
Python
6
star
22

sync-message

TypeScript
5
star
23

comsync

TypeScript
4
star
24

python_runner

Helper for running python code indirectly
Python
4
star
25

dryenv

Simple DRY configuration with environment variables and pydantic
Python
4
star
26

littleutils

Small personal collection of python utility functions, partly just for fun.
Python
3
star
27

askso

AskSO - StackOverflow Python Question Assistant
Python
2
star
28

datafunctions

Automatic (de)serialization of arguments and return values for Python functions
Python
2
star
29

quiggles

Android app for drawing symmetrical patterns
Kotlin
2
star
30

alexmojaki

2
star
31

dependent_types

Python
1
star
32

case-classes

A framework to refactor computing a result from an aggregate object
Java
1
star
33

trace_augmentation

Python
1
star
34

papyros

📜 Scratchpad for Python and JS, running in the browser.
TypeScript
1
star