Elasticsearch Dumper

EXAMPLE:

elasticsearch-dumper -s http://source:9200 -d http://destination:9200 -i index1,index2

INSTALL:

go get github.com/hoffoo/elasticsearch-dump
or download a prebuilt binary here: https://github.com/hoffoo/elasticsearch-dump/releases/

Application Options:
  -s, --source=     source elasticsearch instance
  -d, --dest=       destination elasticsearch instance
  -c, --count=      number of documents at a time: ie "size" in the scroll request (100)
  -t, --time=       scroll time (1m)
  -f, --force       delete destination index before copying (false)
      --shards=     set a number of shards on newly created indexes
      --docs-only   load documents only, do not try to recreate indexes (false)
      --index-only  only create indexes, do not load documents (false)
      --replicate   enable replication while indexing into the new indexes (false)
  -i, --indexes=    list of indexes to copy, comma separated (_all)
  -a, --all         copy indexes starting with . and _ (false)
  -w, --workers=    concurrency (1)
      --settings    copy sharding settings from source (true)
      --green       wait for both hosts cluster status to be green before dump. otherwise yellow is okay (false)

NOTES:

Has been tested getting data from 0.9 onto a 1.4 box. For other scenaries YMMV. (look out for this bug: elastic/elasticsearch#5165)
Copies using the _source field in elasticsearch. If you have made modifications to it (excluding fields, etc) they will not be indexed on the destination host.
--force will delete indexes on the destination host. Otherwise an error will be returned if the index exists
--time is the scroll time passed to the source host, default is 1m. This is a string in es's format.
--count is the number of documents that will be request and bulk indexed at a time. Note that this depends on the number of shards (ie: size of 10 on 5 shards is 50 documents)
--indexes is a comma separated list of indexes to copy
--all indexes starting with . and _ are ignored by default, --all overrides this behavior
--workers concurrency when we post to the bulk api. Only one post happens at a time, but higher concurrency should give you more throughput when using larger scroll sizes.
Ports are required, otherwise 80 is the assumed port (what)

BUGS:

It will not do anything special when copying the _id (copies _id from source host). If _id is remapped it may not do what you want.
Should assume a default port of 9200

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
README.md		README.md
main.go		main.go
release.sh		release.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Elasticsearch Dumper

EXAMPLE:

INSTALL:

NOTES:

BUGS:

About

Releases 3

Packages

Languages

hoffoo/elasticsearch-dump

Folders and files

Latest commit

History

Repository files navigation

Elasticsearch Dumper

EXAMPLE:

INSTALL:

NOTES:

BUGS:

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages