Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Computer resources consuming #180

Open
ekorolevanyrun opened this issue Apr 5, 2022 · 1 comment
Open

Computer resources consuming #180

ekorolevanyrun opened this issue Apr 5, 2022 · 1 comment

Comments

@ekorolevanyrun
Copy link

ekorolevanyrun commented Apr 5, 2022

Hi!

I am using ursadb for searching malware samples with yara rules, and my goal is make ursadb work as faster as possible. However I'm facing with 12-15 seconds queries when make select in gigabyte sized datasets (there are approximately 40-50 datasets). Also I noticed that ursa consumes only 10-15% cpu of a single processor thread, 4-5 MB/sec of disk (measured using iotop).

So my question is: is it possible to boost ursadb searching and make to use ursadb max of computer resources?

We have optane ssd, I assume the searching speed should be higher.


I have try to decrease datasets size
I set config parameters (database_workers) to different values

It didn't help

@msm-code
Copy link
Contributor

msm-code commented Nov 26, 2022

Hi, are you still interested in having this fixed?

Ursadb should be either using 100% CPU, or 100% waiting disk. Even if you're using SSD, in most cases it should be blocked on SSD. 12 second queries when making select in gigabyte sized datasets is extremely suspicious. It should take miliseconds, seconds at best.

Is your dataset characteristic in any way? For example files are large, or there's a lot of very small files? Is there a way for me to reproduce the issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants