The Amazon Kinesis Producer Library (KPL) performs many tasks common to creating efficient and reliable producers for Amazon Kinesis. By using the KPL, customers do not need to develop the same logic every time they create a new application for data ingestion.
For detailed information and installation instructions, see the article Developing Producer Applications for Amazon Kinesis Using the Amazon Kinesis Producer Library in the Amazon Kinesis Developer Guide.
-
The native process will no longer attempt to use the default system CA bundle
- The KPL should now run on versions of Linux that don't place the default CA bundle in /etc/pki
- This fixes issue #66
-
Added automatic BJS endpoint selection
- The KPL will now select the BJS endpoint when configured for BJS.
- This fixes issue #36
-
Maven Artifact Signing Change
- Artifacts are now signed by the identity
Amazon Kinesis Tools <[email protected]>
- Artifacts are now signed by the identity
-
Windows Support is not Available for this Version.
This version of the Kinesis Producer Library doesn't currently support windows. Windows support will be added back at a later date.
- Log output from the kinesis_producer is now captured, and re-emitted by the LogInputStreamReader
- The daemon is now more aggressive about restarting the native kinesis_producer process.
- Updated AWS SDK dependency.
- The native process now uses version 1.0.5 of the AWS C++ SDK.
- The native process doesn't currently support any of the AWS C++ SDK credentials providers. Support for these providers will be added a later date.
- The native process now attempts to produce stack traces for various fatal signals.
Misc bug fixes and improvements.
Important: Becuase the slf4j-simple dependency has been made optional, you will now need to have a logging implementation in your dependencies before the Java logs will show up. For details about slf4j, see the manual. For a quick walkthrough on how to get basic logging, see this page.
- The default value of the maxConnections setting has been increased from 4 to 24.
- slf4j-simple dependency is now optional.
aws-java-sdk-core
version increased to1.10.34
. Please ensure your AWS SDK components all have the same major version.- Record completion callbacks are now executed in a threadpool rather than on the IPC thread.
- Fixed bug that produced invalidly signed requests on the latest version of OSX.
Bug fixes and improved temp file management in Java wrapper.
- The wrapper no longer creates unique a copy of the native binary on disk per instance of KinesisProducer. Multiple instances can now share the same file. Clobbering between versions is prevented by adding the hash of the contents to the file name.
- Idle CPU usage has been reduced (Issue 15)
- The native process should now terminate when the wrapper process is killed (Issues 14, 16)
Significant platform compatibility improvements and easier credentials configuration in the Java wrapper.
- The KPL now works on Windows (Server 2008 and later)
- The lower bound on the
RecordMaxBufferedTime
config has been removed. You can now set it to 0, although this is discouraged
- The java packages have been renamed to be consistent with the package names of the KCL (it's now com.amazonaws.services.kinesis.producer).
- The
Configuration
class has been renamedKinesisProducerConfiguration
. KinesisProducerConfiguration
now accepts the AWS Java SDK'sAWSCredentialsProvider
instances for configuring credentials.- In addition, a different set of credentials can now be provided for uploading metrics.
- Glibc version requirement has been reduced to 2.5 (from 2.17).
- The binary is now mostly statically linked, such that configuring
(DY)LD_LIBRARY_PATH
should no longer be necessary. - No longer uses
std::shared_timed_mutex
, so updating libc++ on OS X is no longer necessary - Removed dependencies on glog, libunwind and gperftools.
- First release
The KPL is written in C++ and runs as a child process to the main user process. Precompiled native binaries are bundled with the Java release and are managed by the Java wrapper.
The Java package should run without the need to install any additional native libraries on the following operating systems:
- Linux distributions with kernel 2.6.18 (September 2006) and later
- Apple OS X 10.9 and later
- Windows Server 2008 and later
Note the release is 64-bit only.
A sample java project is available in java/amazon-kinesis-sample
.
Rather than compiling from source, Java developers are encouraged to use the KPL release in Maven, which includes pre-compiled native binaries for Linux, OSX and Windows.
An upgrade to the latest xcode command line tools is recommended.
Once you have the repo checked out, run bootstrap.sh
. This will download and compile all the dependencies. They will be installed into {project_root}/third_party
.
Run ./b2 -j 8
to build in debug mode.
Run ./b2 release -j 8
to build release.
Unit tests:
./b2 release -j 8 && ./bin/darwin-4.2.1/release/tests --report_level=detailed
Currently only GCC is officially supported for building on Linux. Using other compilers may require modifying the build script (the project uses boost build).
You will need gcc 4.9.x (or above).
After that, it's the same as above, i.e.
./bootstrap.sh
./b2 release -j 8
Unit tests:
./b2 release -j 8 && ./bin/gcc-4.9.2/release/tests --report_level=detailed
Note that if you build on a newer distribution and then deploy to an older one, the program may fail to run because it was linked against a newer version of glibc that the old OS doesn't have. To avoid this problem, build on the same (or a similar) distribution as the one you'll be using in production. We build the official release with RHEL 5 (kernel 2.6.18, glibc 2.5).
At the time of writing MSVC (14.0) is not able to compile the KPL native code. As such, we have to use MinGW.
We have successfully built and tested the KPL using the nuwen.net MinGW release 13.0. You can get it here. Get the version that includes git.
Once you have extracted MinGW, go to the MinGW root folder and run open_distro_window.bat
. This opens a command prompt with the current directory set to the MinGW root folder.
From that command prompt, run git\git-bash
. This will turn the command prompt into a bash shell.
From there on it's the same procedure as Linux and OSX, i.e.:
# You may wish to cd to another dir first
git clone https://github.com/awslabs/amazon-kinesis-producer amazon-kinesis-producer
cd amazon-kinesis-producer
./bootstrap.sh
# We need to specify the toolset, otherwise boost will use MSVC by default, which won't work
./b2 release -j 8 toolset=gcc
If you encounter an error that says File or path name too long
, try moving the project folder to /c/
to shorten the paths.
Unit tests:
./b2 release -j 8 toolset=gcc && bin/gcc-mingw-5.1.0/release/tests.exe --report_level=detailed
You may need to increase the command prompt's line and/or buffer size to see the whole test report.
There are two options. You can either pack the binaries into the jar like we did for the official release, or you can deploy the native binaries separately and point the java code at it.
You will need JDK 1.7+, Apache Maven and Python 2.7 installed.
If you're on Windows, do the following in the git bash shell we used for building. You will need to add java
and python
to the PATH
, as well as set JAVA_HOME
for maven to work.
Run python pack.py
Then
pushd java/amazon-kinesis-producer
mvn clean package source:jar javadoc:jar install
popd
This installs the jar into your local maven repo. The jar itself is available in java/amazon-kinesis-producer/targets/
The java wrapper contains logic that will extract and run the binaries during initialization.
The KinesisProducerConfiguration
class provides an option setNativeExecutable(String val)
. You can use this to provide a path to the kinesis_producer[.exe]
executable you have built. You have to use backslashes to delimit paths on Windows if giving a string literal.