Skip to content

fortran01/s3psync

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

S3 Parallel Sync

Dependency Review Python package PyPI

This script is used to sync files and folders to an S3 bucket in parallel, leveraging the aws s3 sync command. The aws s3 sync command supports multipart uploads and can utilize up to 10 threads, making it particularly useful when you have a large number of large files to upload. This script allows you to specify the number of parallel instances of aws s3 sync to use.

Requirements

  • Python 3.10 or higher (includes pip)
  • AWS CLI
  • AWS profile with the necessary permissions to perform S3 uploads (see here for more information on how to set up AWS profiles)

Usage

To install the tool, run the following command:

python -m pip install --upgrade pip wheel
python -m pip install --upgrade s3psync

To sync files and folders to an S3 bucket in parallel, run the following command:

s3psync --help
Usage: s3psync [OPTIONS]

Options:
  --version
  --aws-profile TEXT            AWS profile name  [required]
  --parent-dir TEXT             Parent directory to sync  [required]
  --s3-bucket TEXT              S3 bucket to sync to  [required]
  --parallel-instances INTEGER  Number of parallel instances for aws s3 sync,
                                default is 1
  --help                        Show this message and exit.

Here is an example of how to use the tool:

s3psync \
  --aws-profile tmpuser_for_data_upload \
  --s3-bucket premtests1232 \
  --parent-dir "/path/to/testdir" \
  --parallel-instances 30

Note that you need to have the necessary AWS permissions to perform S3 uploads. The tool will use the AWS CLI to perform the upload, so you need to have the AWS CLI installed and configured with the appropriate credentials. You can use the --aws-profile option to specify the AWS profile to use.

Development

Dev tools

  • Make utility (Make for Windows, Make for Linux and Mac is usually pre-installed)

Set up the environment using the provided Makefile. Follow these steps:

  1. Ensure you have make installed on your system. You can check this by running make --version in your terminal. Install or update make if needed.
  2. Install the necessary dependencies by running make install or make all.
  3. Create a Python virtual environment by running python3 -m venv --prompt s3psync venv. Activate it by running source venv/bin/activate.
  4. Verify the installation by running s3psync --version. If the tool is installed correctly, it should display the version number.
  5. Run the tool for example by running python -m s3psync --help.
  6. Exit the virtual environment by running deactivate.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published