Skip to content

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

CPU/CUDA: Gemma 2 FlashAttention support (#8542)

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

Annotations

1 warning

Push Docker image to Docker Hub (light, .devops/llama-cli.Dockerfile, linux/amd64,linux/arm64)

succeeded Aug 24, 2024 in 13m 40s