-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Before after scaling-2L #15
base: main
Are you sure you want to change the base?
Conversation
Updated files after running count_scaling
@@ -32,10 +43,10 @@ merge_batch_size: 512 | |||
min_doc_freq: 2 | |||
|
|||
# feature extraction | |||
n_features: 30 | |||
n_features: 16 | |||
n_rand_dim: 3000 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this
@@ -32,10 +43,10 @@ merge_batch_size: 512 | |||
min_doc_freq: 2 | |||
|
|||
# feature extraction | |||
n_features: 30 | |||
n_features: 16 | |||
n_rand_dim: 3000 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this
@@ -32,10 +43,10 @@ merge_batch_size: 512 | |||
min_doc_freq: 2 | |||
|
|||
# feature extraction | |||
n_features: 30 | |||
n_features: 16 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add comment
import numpy as np | ||
|
||
|
||
def load_rand_proj(flname): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this
def load_rand_proj(flname): | ||
return load(flname) | ||
|
||
def load_bloomfilter(flname): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
REmove this
@@ -0,0 +1,116 @@ | |||
#!/usr/bin/env python |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this file in here twice? Is this version ever used or just the one under scripts
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just the one under scripts.
@@ -7,6 +7,9 @@ | |||
from pybloom import BloomFilter | |||
import scipy.sparse as sp | |||
from sklearn.feature_extraction import FeatureHasher | |||
from sklearn.preprocessing import Binarizer | |||
import numpy as np | |||
|
|||
|
|||
def load_rand_proj(flname): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this
@@ -7,6 +7,9 @@ | |||
from pybloom import BloomFilter |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this
args = parse_args() | ||
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
boo white space :)
@@ -57,7 +57,8 @@ def parse_args(): | |||
matrices.append(matrix) | |||
labels.append(group_name) | |||
|
|||
feature_matrix = sp.vstack(matrices).toarray() | |||
#feature_matrix = sp.vstack(matrices).toarray() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this line since you commented it out
Then you shouldn't commit the file if it isn't used.
…On Tue, Jun 8, 2021 at 10:12 PM Daniel Leskiewicz ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In feature_extractor.py
<#15 (comment)>:
> @@ -0,0 +1,116 @@
+#!/usr/bin/env python
just the one under scripts.
—
You are receiving this because your review was requested.
Reply to this email directly, view it on GitHub
<#15 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAIQGCDQSUXPRPYY52FGYBTTR3L2ZANCNFSM45MT5MXA>
.
|
These are the updated files pulled from the remote machine that were used to create plots on doing the count scaling on the 2L chromosome arm.