Skip to content

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

CPU/CUDA: Gemma 2 FlashAttention support (#8542)

CPU/CUDA: Gemma 2 FlashAttention support (#8542) #2

Annotations

1 warning

Push Docker image to Docker Hub (server-cuda, .devops/llama-server-cuda.Dockerfile, linux/amd64)

succeeded Aug 25, 2024 in 2h 23m 42s