Skip to content

Commit

Permalink
Merge pull request #1176 from AI-Hypercomputer:add_flag
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 716807970
  • Loading branch information
maxtext authors committed Jan 17, 2025
2 parents 0b79a29 + 94d1e7b commit 21706fc
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions benchmarks/xla_flags_library.py
Original file line number Diff line number Diff line change
Expand Up @@ -131,6 +131,11 @@
" --xla_latency_hiding_scheduler_rerun=2"
)

# Flags to optimize pipeline parallelism over DCN with large host offloading.
PIPELINING_FLAGS = (
" --xla_tpu_iova_dma_chunk_size_bytes=16777216" # breaks DMA to/from host into 16M chunks
)

# Disable bundle-aware CostModel which was causing worse perf b/357103386.
# Some fusions in the backward pass of the model were 3x slower without this.
DISABLE_BUNDLE_AWARE_COST_MODEL = (
Expand Down

0 comments on commit 21706fc

Please sign in to comment.