Before and After 2R #16

DanielLeski · 2021-05-24T06:44:59Z

These are the updated files that are used to create the plots after count scaling on the 2R chromosome arm.

rnowling

Good job! Some comments that will help remove dead (old) code and increase code clarity.

rnowling · 2021-06-08T21:06:30Z

Snakefile

-        "jellyfish dump -c -o {output.counts} {input.jf}"
+        "jellyfish dump -c -L 2 {input.jf} > {output.counts}"
+
+rule extract_features:


Since you aren't using this rule, you should just delete it for clarity

rnowling · 2021-06-08T21:07:30Z

Snakefile

+    shell:
+        "scripts/pca.py --feature-matrices {input.feature_matrices} --groups-fl {params.groups_fl} --plot-fl {output.plot}"
+
+rule pca_count_binary:


You can add the pca outputs to a list in a top-level rule below. This will allow you to run everything with a single command.

rnowling · 2021-06-08T21:08:22Z

config.yaml

@@ -32,10 +25,9 @@ merge_batch_size: 512
 min_doc_freq: 2

 # feature extraction
-n_features: 30
+n_features: 16 
 n_rand_dim: 3000


Since we aren't using random projection, you should remove this to avoid confusion

rnowling · 2021-06-08T21:08:42Z

config.yaml

@@ -32,10 +25,9 @@ merge_batch_size: 512
 min_doc_freq: 2

 # feature extraction
-n_features: 30
+n_features: 16 
 n_rand_dim: 3000
 use_binary_features: "--binary"


Same as above -- with your changes, this isn't used so you should just remove it

rnowling · 2021-06-08T21:10:01Z

config.yaml

@@ -32,10 +25,9 @@ merge_batch_size: 512
 min_doc_freq: 2

 # feature extraction
-n_features: 30
+n_features: 16 


Add a comment to explain that this means 2^16 -- I made this ambiguous by calling it n_features but expecting it to be log2_n_features

rnowling · 2021-06-08T21:10:38Z

feature_extractor.py

+import numpy as np
+
+
+def load_rand_proj(flname):


Remove this

rnowling · 2021-06-08T21:10:47Z

feature_extractor.py

+def parse_args():
+    parser = argparse.ArgumentParser()
+
+    parser.add_argument("--rand-proj-fl",


Remove this

rnowling · 2021-06-08T21:11:40Z

feature_extractor.py

+                        type=str,
+                        required=True)
+
+    parser.add_argument("--binary",


Remove this -- no longer used

rnowling · 2021-06-08T21:11:44Z

feature_extractor.py

+                        type=str,
+                        required=True)
+
+    parser.add_argument("--passlist-bf",


Remove this

Add files via upload

9112c42

DanielLeski requested a review from rnowling June 8, 2021 20:47

rnowling requested changes Jun 8, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Before and After 2R #16

Before and After 2R #16

DanielLeski commented May 24, 2021

rnowling left a comment

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

rnowling Jun 8, 2021

		import numpy as np


		def load_rand_proj(flname):

Before and After 2R #16

Are you sure you want to change the base?

Before and After 2R #16

Conversation

DanielLeski commented May 24, 2021

rnowling left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment