diff --git a/README.md b/README.md index 573969c..74b59d7 100644 --- a/README.md +++ b/README.md @@ -2,7 +2,7 @@ Powered by GTXLab of Genetalks. -technique preview download URL:https://github.com/Genetalks/gtz/archive/0.2.2h_tech_preview.tar.gz +technique preview download URL:https://github.com/Genetalks/gtz/archive/0.2.2i_tech_preview.tar.gz [中文说明](https://github.com/Genetalks/gtz/blob/master/README_chs.md "Markdown"). @@ -12,7 +12,7 @@ GTX Compressor is a fastq compressor and also can be used as a generic data comp GTX Compressor compresses the 33 qualities of FASTQ files (NA12878_1.fastq), with the size of approximately 200GB, to 19% of the original size, in less than 13 minutes, over the AWS R4.8xlarge machine (or the same configuration server) at a speed of more than 256MB/s. As the FASTQ data which is producted by X10 with only **7 qualities, GTX Compressor can gains 5.5% compression.** -**GTX Compressor provides "Directly compress to the cloud" function**. Out of commercial consideration, users not only need to store the massive data generated by gene sequencing locally, but also need to quickly and steadily transfer the data to the cloud. GTX Compressor system can compress the fastq files and concurrently transfer the compressed data to the Amazon AWS S3 platform , Ali cloud OSS platform or Tecent cloud COS platform, by supplying the same compression speed and compression rate with local compression. With ordinary 100Mbits Intenet line, GTX Compressor can directly compress 200GB Fastq file to the cloud in just 30 minutes. +**GTX Compressor provides "to the cloud" function**. Out of commercial consideration, users not only need to store the massive data generated by gene sequencing locally, but also need to quickly and steadily transfer the data to the cloud. GTX Lab has developed gtransfer, a tool that provides long-distance and ultra-long-distance data transmission services under normal bandwidth conditions, enabling easy end-to-end or end-to-cloud data transmission. ## System highlights @@ -22,7 +22,7 @@ GTX Compressor system features: - **High performance:** GTX compressor fully exploits the concurrency of the CPU, the new Haswell CPU architecture, and the computing power of the new instructions such as AVX2, BMI2, which makes GTX compressor gain high compression speed even on a common server, with the throughout of 114MB/s for the whole process of compression and transmission. -- **high-speed direct compression to the cloud:** GTX compressor support direct compression to the cloud and direct decompression from the cloud. Over a common 20-core server with 100Mbits Intenet line, GTX Compressor can derectly compress 200GB Fastq file to the cloud in only 30 minutes. +- **high-speed to the cloud:** GTX gtransfer support long-distance and ultra-long-distance data transmission services under normal bandwidth conditions, enabling easy end-to-end or end-to-cloud data transmission. ## System environment requirements - 64-bit Linux system (CentOS 6.5 or above, or Ubuntu 12.04 or more, and with Ububtu 14.04 and above 64-bit operating system recommended) @@ -31,7 +31,7 @@ GTX Compressor system features: ## Installation Instruction GTX compressor system can be directly used by unpacking, and does not rely on any other library. -The download package contains two tar.gz packages for the ubuntu version and the centos version respectly. Choose the corresponding tar.gz package, extract, and use gtz command for the extraction gtz_0.1_ubuntu_tech_preview directory or gtz_0.1_centos_tech_preview directory. +The download package contains two tar.gz packages for the ubuntu version and the centos version respectly. Choose the corresponding tar.gz package, extract, and use gtz command for the extraction gtz_0.2.2i_ubuntu_tech_preview directory or gtz_0.2.2i_centos_tech_preview directory. ## Command line instructions @@ -39,10 +39,8 @@ The download package contains two tar.gz packages for the ubuntu version and the ``` USAGE: -./gtz [--list] [-e ] [-f] [--endpoint ] [--appid ] [--timeout ] - [--secret-access-key ] [--access-key-id ] [-b - ] [-s ] [-c] [-n ] [-l ] [-i] - [-d] [--delete] [-a] [-g ] [-o ] [--] [--version] +./gtz [--gz] [--rbin-path] [--outdir] [--list] [-e ] [-c] + [-d] [-a] [-o ] [--] [--version] [-h] ... ``` @@ -50,190 +48,43 @@ USAGE: General Options Instruciton: - -h: Outputs the above command line help information - \-\- version: Outputs the version number of the gt_compress program -- \-\- access-key-id: Specifies the cloud platform user ID -- \-\- secret-access-key: Specifies the cloud platform user key -- \-\- endpoint: Specifies the access domain name and data center of the Ali cloud OSS platform -- \-\- appid: Specifies the access domain name and data center of the Tecent cloud COS platform Compression Option Description: -- -f, \-\- force -- \-\-timeout: Specifies the upload timeout threshold -- -i: Increases the index during the compression, which mainly used in the compressed file to quickly retrieve a section of the fastq file, and might reduce the compression speed - -a: append mode, the original file will be appended to the compressed data -- -g: the speed-up compression in group, the more groups, the more need for cpu and memory, and the faster compression. If you do not specify this value, the program will automatically select the optimal value based on cpu and memory. - -o: Specifies the compressed file name. When not specified, the default is out.gtz - file_name: the file or directory need to be compressed. If not specified, the system will read data from the standard input. Decompression Option Description: - -d, \-\-decode: decompression mode, required +- \-\-gz: extract to .gz format.If not specified,the default output is .fastq format +- \-\-rbin-path: Specify the path of the rbin file.The system default path is ~/.config/gtz +- \-\-outdir: Specify the output directory of the extracted file - \-\-list: List all compressed file names in the archive, used together with the -d parameter - -e, \-\-extract: decompresses and extract the target files specified (The file names are separated by ":") in the compressed file. Must used together with the -d parameter -- -f, \-\-force: Forcely delete the object within the container -- \-\-timeout: Specifies the download timeout value - -c, \-\-stdout: output to console(standard output). It can only be use for decompression. - file_name: the file to be decompressed ### Examples: -Configure environment variables: - -export access_key_id=xxxxxx - -export secret_access_key=xxxxxx - -export endpoint=xxxxxx (Only set when transfering to OSS or COS) - -export appid=xxxxxx (Only set when transfering to COS) - ### Compression examples -Direct compression to Ali OSS: - - ./gtz -o oss://gtz/out.gtz   source.fastq (or source.fastq.gz , gtz supports recompress fastq.gz file) - - or - - zcat source.fastq.gz | ./gtz -o oss://gt-compress/out.gtz - - -Direct compression to Tecent COS: - - ./gtz -o cos://gtz/out.gtz   source.fastq (or source.fastq.gz , gtz supports recompress fastq.gz file) - - or - - zcat source.fastq.gz | ./gtz -o cos://gt-compress/out.gtz + ./gtz -o output.gtz source.fastq Compress the original file source.fastq to output.gtz + ./gtz -o output.gtz source.fastq.gz Compress the original file source.fastq.gz to output.gtz + ./gtz -o output.gtz source1.fastq source2.fastq Compress multiple original files source1.fastq,source2.fastq into output.gtz (Support multiple file compression to a single compressed package) + ./gtz -o output.gtz source_diretory Compress the directory source_directory to output.gtz (Supports compression of the directory to a compressed package) + ./gtz -a -o output.gtz source3.fastq Compress the file source3.fastq in an append manner to the existing output.gtz -Direct compression to AWS S3 - - ./gtz -o s3://gtz/out.gtz source.fastq (or source.fastq.gz , gtz supports recompress fastq.gz file) - - or: - - zcat source.fastq.gz | ./gtz -o s3://gt-compress/out.gtz - -Direct compression locally - - ./gtz -o gtz/out.gtz source.fastq - - or: - - zcat source.fastq.gz | ./gtz -o gtz/out.gtz - - -Massive small files (<500MB each) compression: - - -- To compress a large number of small files (500MB or less) to an package or transfer to the cloud, GTZ can work with tar by using pipe. It is very useful to compress and transfer amounts of small data at very fast speed. (Note: In this way, GTZ uses binary data compression algorithm, the fastq or fastq.gz files inside the tar package will not be treated specially ). - - tar -cf - ./you_dir_or_file | gtz -o /dest.gtz - -- Direct compression to AWS S3 , Aliyun OSS or Tecent COS: - - tar -cf - ./you_dir_or_file | gtz -o s3://bucket/dest.gtz - - tar -cf - ./you_dir_or_file | gtz -o oss://bucket/dest.gtz - - tar -cf - ./you_dir_or_file | gtz -o cos://bucket/dest.gtz - - -- Direct decompression: - - gtz -c -d /dest.gtz | tar -xf - - - gtz -c -d s3://bucket/dest.gtz | tar -xf - - - gtz -c -d oss://bucket/dest.gtz | tar -xf - - - gtz -c -d cos://bucket/dest.gtz | tar -xf - - - -Notice: Large size files (500MB or more) or the directory full of Large size files, especially fastq or fastq.gz file or its directory, we suggest to use GTZ to directly compress and package, it will be more faster. - - -### Add files to the compressed package - - ./gtz -a -o oss://gtz/out.gtz /A/source2.fastq # -a denotes it is the additional mode - - ./gtz -a -o cos://gtz/out.gtz /A/source2.fastq # -a denotes it is the additional mode - - ./gtz -a -o s3://gtz/out.gtz /A/source2.fastq # -a denotes it is the additional mode - - ./gtz -a -o gtz /out.gtz /A/source2.fastq # -a denotes it is the additional mode - - -### View the files contained in the compressed gtz file - - ./gtz_0.2.0_ubuntu_release/gtz --list -d oss://gtz/out.gtz - - ./gtz_0.2.0_ubuntu_release/gtz --list -d cos://gtz/out.gtz - - ./gtz_0.2.0_ubuntu_release/gtz --list -d s3://gtz/out.gtz - - ./gtz_0.2.0_ubuntu_release/gtz --list -d gtz/out.gtz - ### Decompression examples + ./gtz -d output.gtz Decompress output.gtz,the default output is fastq format + ./gtz -d output.gtz --gz Decompress output.gtz ,decompress it to .gz format + ./gtz -d output.gtz --outdir output_path Decompress output.gtz into the output_path directory + ./gtz -d output.gtz --rbin-path rbin_path Decompress the enterprise version of high-compression package output.gtz,and specify the path to the corresponding rbin file + ./gtz -d output.gtz --list View the list of files in the compressed package output.gtz -Direct decompression from Ali OSS - - ./gtz -d oss://gtz/out.gtz - - Decompress several files separately: - - # -e denotes the target decompression files, seperated by ":" - ./gtz -e source.fastq:/A/source2.fastq -d oss://gtz/out.gtz - - Decompress the target firles to the tube: - - # -c denotes output files to the console; -e denotes the target decompression file. - ./gtz -c -e source.fastq -d oss://gtz/out.gtz > myfile.txt - - or - - ./gtz -c -e source.fastq -d oss://gtz/out.gtz | gzip -c > source.gz - - -Direct decompression from Tecent COS - - ./gtz -d cos://gtz/out.gtz - - Decompress several files separately: - - # -e denotes the target decompression files, seperated by ":" - ./gtz -e source.fastq:/A/source2.fastq -d cos://gtz/out.gtz - - Decompress the target firles to the tube: - - # -c denotes output files to the console; -e denotes the target decompression file. - ./gtz -c -e source.fastq -d cos://gtz/out.gtz > myfile.txt - - or - - ./gtz -c -e source.fastq -d cos://gtz/out.gtz | gzip -c > source.gz - -Direct decompression from AWS S3 - - ./gtz -d s3://gtz/out.gtz - - Decompress several files separately: - - # -e denotes the target decompression files, seperated by ":" - ./gtz -e source.fastq:/A/source2.fastq -d s3://gtz/out.gtz - - Decompress the target firles to the tube: - - # -c denote output files to the console; -e denotes the target decompression file. - ./gtz -c -e source.fastq -d s3://gtz/out.gtz > myfile.txt - or - ./gtz -c -e source.fastq -d s3://gtz/out.gtz | gzip -c > source.gz - -Direct decompression locally - - ./gtz -d ./gtz/out.gtz Decompress several files separately: diff --git a/README_chs.md b/README_chs.md index 4976dda..2e8cffc 100644 --- a/README_chs.md +++ b/README_chs.md @@ -1,8 +1,8 @@ -# GTX Compressor (直压上云技术预览版) +# GTX Compressor (技术预览版) Powered by GTXLab of Genetalks. -技术预览版本下载地址: https://github.com/Genetalks/gtz/archive/0.2.2h_tech_preview.tar.gz +技术预览版本下载地址: https://github.com/Genetalks/gtz/archive/0.2.2i_tech_preview.tar.gz [English Manual](https://github.com/Genetalks/gtz/blob/master/README.md "Markdown"). @@ -13,7 +13,7 @@ GTX Compressor是Genetalks公司GTX Lab实验室开发的面向大型数据( GTX Compressor可以在AWS C4.8xlarge机器(或同配置服务器),**以超过114MB/s的速度,将接近200GB大小的33个质量数的FASTQ文件(NA12878_1.fastq),在13分钟内压缩到原大小的19%**,而对于X10等只有 **7个质量数的FASTQ数据,其压缩率更可以达到5.5%** 。 -**GTX Compressor提供“直压上云”功能**。考虑商业使用时,用户不仅需要将测序产生的海量数据存储于本地,更迫切地寻求将数据快速稳定传输至云端的能力。 GTX Compressor的数据压缩引擎允许用户直接将fastq文件压缩存储到亚马逊AWS平台,阿里云OSS平台或者腾讯云COS平台,并保持与本地压缩相同的压缩速度与压缩效率。普通100Mbits Intenet线路,可以在短短30分钟内稳定地将200GB Fastq文件的直压上云。 +**GTX Compressor提供“上云”功能**。考虑商业使用时,用户不仅需要将测序产生的海量数据存储于本地,更迫切地寻求将数据快速稳定分发传输的能力。GTX Lab实验室已经研发出在普通带宽条件下,提供远距离和超远距离数据传输服务的工具gtransfer,可以便捷地进行端到端或端到云平台的数据传输。 ## 系统亮点 @@ -23,7 +23,7 @@ GTX Compressor可以在AWS C4.8xlarge机器(或同配置服务器),**以 - **高性能:** GTX compressor充分发挥了CPU的并发性以及新型Haswell CPU体系结构与AVX2、BMI2等指令集的计算能力,使得在普通服务器上的压缩速度,最高能够以接近114MB/s的输入流量输入数据并压缩完毕。 -- **高速直压上云:** GTX compressor支持直压上云和从云端直接解压下载功能。普通的20核服务器,通过百兆Intenet线路,可以在短短30分钟内稳定地将200GB Fastq文件的直压上云。  +- **高速上云:** GTX gtransfer支持在普通带宽条件下,提供远距离和超远距离数据传输服务,可以便捷地进行端到端或端到云平台的数据传输。 @@ -36,7 +36,7 @@ GTX Compressor可以在AWS C4.8xlarge机器(或同配置服务器),**以 ## 安装说明 本系统采用开包即用的打包原则,不依赖当前系统其他任何库。 -下载包内包含ubuntu版本和centos版本的两个tar.gz的包。选择对应tar.gz的包,解压后,gtz命令就在当前解压的gtz_0.2.2b_ubuntu_tech_preview目录或gtz_0.2.2b_centos_tech_preview目录中,直接使用即可。 +下载包内包含ubuntu版本和centos版本的两个tar.gz的包。选择对应tar.gz的包,解压后,gtz命令就在当前解压的gtz_0.2.2i_ubuntu_tech_preview目录或gtz_0.2.2i_centos_tech_preview目录中,直接使用即可。 ## 命令行说明 @@ -46,10 +46,8 @@ GTX Compressor可以在AWS C4.8xlarge机器(或同配置服务器),**以 ``` USAGE: -./gtz [--list] [-e ] [-f] [--endpoint ] [--appid ] [--timeout ] - [--secret-access-key ] [--access-key-id ] [-b - ] [-s ] [-c] [-n ] [-l ] [-i] - [-d] [--delete] [-a] [-g ] [-o ] [--] [--version] +./gtz [--gz] [--rbin-path] [--outdir] [--list] [-e ] [-c] + [-d] [-a] [-o ] [--] [--version] [-h] ... @@ -59,178 +57,52 @@ USAGE: - -h:输出以上命令行帮助信息 - \-\-version:输出gt_compress程序的版本号 -- \-\-access-key-id : 指定云平台用户ID -- \-\-secret-access-key: 指定云平台用户密钥 -- \-\-endpoint : 指定阿里云OSS平台或者腾讯云COS平台的访问域名和数据中心 -- \-\-appid : 指定腾讯云COS平台的用户ID 压缩选项说明: - -- -f, \-\-force : 强制删除容器内的object -- \-\-timeout : 指定上传超时阀值 -- -i:压缩时增加索引,主要用于在压缩文件中快速检索fastq文件的某段内容,该选项会降低压缩速度 - -a:追加模式,本次压缩的内容会追加到压缩文件中 -- -g:分组加速压缩,分组越多,需要的cpu和内存越多,压缩速度越快。不指定该值时,程序会根据cpu和内存自动选择最优值 - -o:指定压缩文件名,不指定时,默认为out.gtz - file_name:需要压缩的文件或目录, 若不指定,则从标准输入中读入数据 解压选项说明: - - -d,\-\-decode : 解压模式 +--gz : 解压为gz格式。如果不指定,默认输出为.fastq格式 +--rbin-path : 指定本地解压所需要的rbin文件的路径 +--outdir : 指定解压输出目录 --list : 列出压缩包中所有的压缩文件名,与-d参数一起使用 -e, --extract : 解压压缩包中指定的压缩文件,文件名之间用冒号:分割,与-d参数一起使用 -- \-\-timeout : 指定下载超时阀值 - -c,\-\-stdout           :   解压数据输出至标准输出, 只能与 -d 参数一起使用 -- -o:指定输出文件名,使用-n或-l时需要指定该选项,否则不需要该选项 - file_name:需要压缩的文件, 若不指定,则从标准输入中读入数据 ### 示例: -配置环境变量: - -export access_key_id=xxxxxx - -export secret_access_key=xxxxxx - -export endpoint=xxxxxx (该环境变量只有上传至OSS或者COS时才需设置) - -export appid=xxxxxx (该环境变量只有上传至COS时才需设置) - ### 压缩举例 + ./gtz -o output.gtz source.fastq 将原文件source.fastq压缩为output.gtz + ./gtz -o output.gtz source.fastq.gz 将原文件source.fastq.gz压缩为output.gtz + ./gtz -o output.gtz source1.fastq source2.fastq 将多个原文件source1.fastq,source2.fastq压缩为output.gtz (支持多个文件压缩到一个压缩包) + ./gtz -o output.gtz source_diretory 将目录source_diretory压缩为output.gtz (支持目录压缩到一个压缩包) + ./gtz -a -o output.gtz source3.fastq 将文件source3.fastq以追加的方式,压缩到已有的output.gtz中 -直压阿里OSS: - - ./gtz -o oss://gtz/out.gtz   source.fastq (or source.fastq.gz, gtz支持对fastq.gz的重新压缩) - - 或者 - # zcat 通过管道将fastq的数据送入gtz加压,zcat解压出来的fastq数据流在 out.gtz 中将以stdin这个文件名存在 - zcat source.fastq.gz | ./gtz -o oss://gt-compress/out.gtz - -直压腾讯COS: - - ./gtz -o cos://gtz/out.gtz   source.fastq (or source.fastq.gz, gtz支持对fastq.gz的重新压缩) - - 或者 - # zcat 通过管道将fastq的数据送入gtz加压,zcat解压出来的fastq数据流在 out.gtz 中将以stdin这个文件名存在 - zcat source.fastq.gz | ./gtz -o cos://gt-compress/out.gtz - -直压AWS S3: - - ./gtz -o s3://gtz/out.gtz   source.fastq (or source.fastq.gz, gtz支持对fastq.gz的重新压缩) - - 或者: - # zcat 通过管道将fastq的数据送入gtz加压,zcat解压出来的fastq数据流在 out.gtz 中将以stdin这个文件名存在 - zcat source.fastq.gz | ./gtz -o s3://gt-compress/out.gtz - -压缩到本地: - - ./gtz -o gtz/out.gtz source.fastq (or source.fastq.gz, gtz支持对fastq.gz的重新压缩) - - 或者 - # zcat 通过管道将fastq的数据送入gtz加压,zcat解压出来的fastq数据流在 out.gtz 中将以stdin这个文件名存在 - zcat source.fastq.gz | ./gtz -o gtz/out.gtz - -### 海量小文件与tar配合进行快速压缩和上传: - - 为了大量小型文件(500MB以下)快速压缩或传输上云,支持使用管道方式与 tar 配合进行打包压缩, 对于海量小型数据,压缩和传输速度将非常好。(注意,这种方式下,GTZ使用的针对普通二进制数据的压缩算法,不会对tar包里面的fastq进行特殊处理)。 - tar -cf - ./you_dir_or_file | gtz -o /dest.gtz - - 或者直接打包上云: - - tar -cf - ./you_dir_or_file | gtz -o s3://bucket/dest.gtz - - tar -cf - ./you_dir_or_file | gtz -o oss://bucket/dest.gtz - - tar -cf - ./you_dir_or_file | gtz -o cos://bucket/dest.gtz - - 直接传输回来解包: - - gtz -c -d s3://bucket/dest.gtz | tar -xf - - - 当然,大型文件(500MB以上)或大型文件所在的目录,尤其是 fastq或fastq.gz文件或其所在的目录,还是建议直接使用 GTZ 进行打包压缩直传,速度更快。 - -### 追加文件进压缩包 - - ./gtz -a -o oss://gtz/out.gtz /A/source2.fastq # -a 指当前是追加模式 - - ./gtz -a -o cos://gtz/out.gtz /A/source2.fastq # -a 指当前是追加模式 - - ./gtz -a -o s3://gtz/out.gtz /A/source2.fastq # -a 指当前是追加模式 - - ./gtz -a -o gtz/out.gtz /A/source2.fastq # -a 指当前是追加模式 - -### 查看压缩包里包含的文件 - - ./gtz_0.2.0_ubuntu_release/gtz --list -d oss://gtz/out.gtz - - ./gtz_0.2.0_ubuntu_release/gtz --list -d cos://gtz/out.gtz - - ./gtz_0.2.0_ubuntu_release/gtz --list -d s3://gtz/out.gtz - - ./gtz_0.2.0_ubuntu_release/gtz --list -d gtz/out.gtz ### 解压举例 - -从阿里 OSS 解压: - - ./gtz -d oss://gtz/out.gtz - - 或者 单独抽取几个文件: - # -e 代表抽取文件,后面要抽取的文件名称间,用 ":" 隔开 - ./gtz -e source.fastq:/A/source2.fastq -d oss://gtz/out.gtz - - 或者某个文件到管道: - # -c 代表输出到console, -e 代表抽取其中的某个文件 - ./gtz -c -e source.fastq -d oss://gtz/out.gtz > myfile.txt - 或者 - ./gtz -c -e source.fastq -d oss://gtz/out.gtz | gzip -c > source.gz - -从腾讯 COS 解压: - - ./gtz -d cos://gtz/out.gtz - - 或者 单独抽取几个文件: - # -e 代表抽取文件,后面要抽取的文件名称间,用 ":" 隔开 - ./gtz -e source.fastq:/A/source2.fastq -d cos://gtz/out.gtz - - 或者某个文件到管道: - # -c 代表输出到console, -e 代表抽取其中的某个文件 - ./gtz -c -e source.fastq -d cos://gtz/out.gtz > myfile.txt - 或者 - ./gtz -c -e source.fastq -d cos://gtz/out.gtz | gzip -c > source.gz - -从AWS S3 解压: - - ./gtz -d s3://gtz/out.gtz - - 或者 单独抽取几个文件: + ./gtz -d output.gtz 解压output.gtz,默认输出为fastq格式 + ./gtz -d output.gtz --gz 解压output.gtz,输出为fastq.gz格式 + ./gtz -d output.gtz --outdir output_path 解压output.gtz到output_path目录中 + ./gtz -d output.gtz --rbin-path rbin_path 解压企业版高倍率压缩包output.gtz,指定对应的rbin文件的路径 + ./gtz -d output.gtz --list 查看压缩包output.gtz里的文件列表 + # -e 代表抽取文件,后面要抽取的文件名称间,用 ":" 隔开 - ./gtz -e source.fastq:/A/source2.fastq -d s3://gtz/out.gtz - - 或者某个文件到管道: - # -c 代表输出到console, -e 代表抽取其中的某个文件 - ./gtz -c -e source.fastq -d s3://gtz/out.gtz > myfile.txt - 或者 - ./gtz -c -e source.fastq -d s3://gtz/out.gtz | gzip -c > source.gz - -从本地文件: + ./gtz -e source.fastq:/A/source2.fastq -d gtz/out.gtz + + 或者某个文件到管道: + # -c 代表输出到console, -e 代表抽取其中的某个文件 + ./gtz -c -e source.fastq -d gtz/out.gtz > myfile.txt + 或者 + ./gtz -c -e source.fastq -d gtz/out.gtz | gzip -c > myfastq.gz - ./gtz -d ./gtz/out.gtz - - 或者 单独抽取几个文件: - # -e 代表抽取文件,后面要抽取的文件名称间,用 ":" 隔开 - ./gtz -e source.fastq:/A/source2.fastq -d gtz/out.gtz - - 或者某个文件到管道: - # -c 代表输出到console, -e 代表抽取其中的某个文件 - ./gtz -c -e source.fastq -d gtz/out.gtz > myfile.txt - 或者 - ./gtz -c -e source.fastq -d gtz/out.gtz | gzip -c > myfastq.gz - diff --git a/gtz_public_0.2.2h_ubuntu_pre_release.tgz b/gtz_public_0.2.2h_ubuntu_pre_release.tgz deleted file mode 100644 index 44e1980..0000000 Binary files a/gtz_public_0.2.2h_ubuntu_pre_release.tgz and /dev/null differ diff --git a/gtz_public_0.2.2i_centos_pre_release.tgz b/gtz_public_0.2.2i_centos_pre_release.tgz new file mode 100644 index 0000000..1a170da Binary files /dev/null and b/gtz_public_0.2.2i_centos_pre_release.tgz differ diff --git a/gtz_public_0.2.2h_centos_pre_release.tgz b/gtz_public_0.2.2i_ubuntu_pre_release.tgz similarity index 58% rename from gtz_public_0.2.2h_centos_pre_release.tgz rename to gtz_public_0.2.2i_ubuntu_pre_release.tgz index 977223e..c7f9ddf 100644 Binary files a/gtz_public_0.2.2h_centos_pre_release.tgz and b/gtz_public_0.2.2i_ubuntu_pre_release.tgz differ