`embed_batch` does not use parallel computation #1

EricLBuehler · 2024-07-06T15:46:40Z

I was taking a look at this great crate, and I noticed that embed_batch does not use batching.

candle_embed/src/lib.rs

Lines 261 to 273 in 3036c6f

pub fn embed_batch(&self, texts: &[&str]) -> Result<Vec<Vec<f32>>> {

if texts.is_empty() {

return Err(Error::msg(

"CandleEmbed error: embed_batch called with empty texts",

));

}

let mut embeddings = vec![];

for text in texts {

let embedding = self.embed_one(text)?;

embeddings.push(embedding);

}

Ok(embeddings)

}

Perhaps you can add this feature by padding the texts and then unpadding the embeddings? It is just a thought, but I think it would improve performance significantly for real-world use cases!

The text was updated successfully, but these errors were encountered:

ShelbyJenkins pinned this issue Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`embed_batch` does not use parallel computation #1

`embed_batch` does not use parallel computation #1

EricLBuehler commented Jul 6, 2024

embed_batch does not use parallel computation #1

embed_batch does not use parallel computation #1

Comments

EricLBuehler commented Jul 6, 2024

`embed_batch` does not use parallel computation #1

`embed_batch` does not use parallel computation #1