Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
Fix typos

Co-authored-by: Madhavan <[email protected]>
  • Loading branch information
pravinbhat and msmygit authored Oct 22, 2024
1 parent 45998e9 commit 1ed5b0a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -157,7 +157,7 @@ spark-submit --properties-file cdm.properties \
- Performance bottleneck are usually the result of
- Low resource availability on `Origin` OR `Target` cluster
- Low resource availability on CDM VMs, [see recommendations here](https://docs.datastax.com/en/data-migration/deployment-infrastructure.html#_machines)
- Bad schema design which could be cause by Out of balance `Origin` cluster, large partitions (> 100 MB), large rows (> 10MB) and/or high column count
- Bad schema design which could be caused by out of balance `Origin` cluster, large partitions (> 100 MB), large rows (> 10MB) and/or high column count.
- Incorrect configuration of below properties
- `numParts`: Default is 5K, but ideal value is usually around table-size/10MB.
- `batchSize`: Default is 5, but this should be set to 1 for tables where primary-key=partition-key OR where average row-size is > 20 KB. Similarly, this should be set to a value > 5, if row-size is small (< 1KB) and most partitions have several rows (100+).
Expand Down

0 comments on commit 1ed5b0a

Please sign in to comment.