PaliGemma Inference Pipeline

Replication and efficient CPU-compatible inference for the PaliGemma model, a state-of-the-art vision-language model combining a SigLIP vision encoder with a Gemma language decoder.

Features

Efficient CPU-based inference for PaLiGemma
Dynamic quantization for optimized performance
Support for image and text inputs
Customizable inference parameters

Installation

Clone the repository:

git clone https://github.com/codingwithsurya/PaliGemma-CPU-Inference.git
cd paligemma-inference

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Run inference using the following command:

python inference.py --prompt "Describe this image" --image_file_path "path/to/your/image.jpg" --only_cpu

Parameters

--prompt: The text prompt for the model
--image_file_path: Path to the input image
--only_cpu: Flag to ensure CPU-only inference
--max_tokens_to_generate: Maximum number of tokens to generate (default: 50)
--temperature: Sampling temperature (default: 0.7)

Technical Details

This project leverages several advanced deep learning concepts:

Transformer architecture with multi-head attention and feed-forward layers
Vision Transformer (ViT) for image processing
Contrastive learning techniques inspired by CLIP and SigLIP
Rotary positional embeddings and grouped query attention
KV-cache for efficient token generation
Dynamic quantization for optimized CPU inference

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
.DS_Store		.DS_Store
README.md		README.md
gemma_model.py		gemma_model.py
inference.py		inference.py
paligemma_processor.py		paligemma_processor.py
requirements.txt		requirements.txt
siglip_model.py		siglip_model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PaliGemma Inference Pipeline

Features

Installation

Usage

Parameters

Technical Details

Acknowledgements

About

Releases

Packages

Languages

codingwithsurya/PaliGemma-CPU-Inference

Folders and files

Latest commit

History

Repository files navigation

PaliGemma Inference Pipeline

Features

Installation

Usage

Parameters

Technical Details

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages