Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for classifiers accepting sparse matrix formats only #155

Open
beevabeeva opened this issue Sep 21, 2019 · 0 comments
Open

Support for classifiers accepting sparse matrix formats only #155

beevabeeva opened this issue Sep 21, 2019 · 0 comments

Comments

@beevabeeva
Copy link

Hi @csala!,

Sorry it's been so long!

I've been making progress with my research but hit a snag recently when trying to implement the state of the art SVM ThunderSVM.
Unfortunately, ThunderSVM only accepts sparse matrix format as the data input.
It is based on a another SVM implementation (LIBSVM). LIBSVM provides a tool to translate between input formats.
My question is, how hard would it be to implement a feature into ATM that would:

  1. Use the LIBSVM conversion tool in a 'pre-process' sort of stage in ATM, to convert standard input data to LIBSVM sparse format.
  2. Use the converted data from step 1 when calling the ThunderSVM (or any other similar classifier) classifier.
  3. Use the results from step 2 as normal, storing them in the ModelDB hub.

I am looking at alternatives to ThunderSVM in case this is not possible/ really hard.
I think implementing standard input into ThunderSVM itself would be even harder, but I will open an issue on their repo too.

** As a side note, when I do use ThunderSVM in ATM, it seems that ATM runs it as usual (but it does get stuck sometimes). It's probably just that ATM is interpreting the sparse format as standard input. But it is curious that is isn't breaking. Let me know if I should open a separate issue and fill in all the technical details.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant