Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement InlineVector and use it for lens and strides in TensorDescriptor #3408

Open
CAHEK7 opened this issue Nov 28, 2024 · 0 comments
Open

Comments

@CAHEK7
Copy link
Contributor

CAHEK7 commented Nov 28, 2024

During some host-side perf measurements in #3346 we've found out that keeping 2 dynamic vectors for an array of 5 elements is inefficient, since we are extensively creating and coping TensorDescriptor across the library.

To minimize host side overhead we need a simple vector-like container with statically allocate storage of "at most" 5 elements.

It has to be something like template <typename Type, size_t Capacity> class InlineVector;

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant