Create BunisHSPCData #32

dtm2451 · 2021-02-21T18:47:52Z

Addresses #28.
Still need to fill in the citation & add to longtests/.

dtm2451 · 2021-02-21T18:50:53Z

inst/extdata/manifest.csv

@@ -6,6 +6,7 @@ Reference,Taxonomy,Part,Number,Call
 @baron2016singlecell,10090,pancreas,1886,BaronPancreasData('mouse')
 @bhaduri2020cell,9606,cortical organoids,242349,BhaduriOrganoidData()
 @buettner2015computational,10090,embryonic stem cells,288,BuettnerESCData()
+@bunis2021haematopoietic,9606,haematopoietic stem and progenitor,???,BunisHSPCData


Where does the manifest's Number come from? Is it a cell number or taxonomy item? And if cell number, the unfiltered number?

Just give the number of cells that you get from running BunisHSPCData() with the default arguments. Note that the last thing is the actual command (it'll get wrapped in backticks when it gets printed in the vignette).

LTLA

Add some stub tests in longtests/ to make sure that things run with a variety of options; see some of the examples in there for more details.

LTLA · 2021-02-21T20:06:07Z

R/BunisHSPCData.R

+#' 
+#' @details
+#' Column metadata is recreated from GEO using the author-supplied TSV of per-cell annotations, or retrieved from a processed version of the data shared by authors via figshare.
+#' This contains information such as the tissue & sample of origin, age group, likely cell type, and Developmental Stage Scoring. Cevelopmental Stage 


Cevelopmental Stage?

LTLA · 2021-02-21T20:06:25Z

R/BunisHSPCData.R

+#' This contains information such as the tissue & sample of origin, age group, likely cell type, and Developmental Stage Scoring. Cevelopmental Stage 
+#'
+#' If \code{filtered=TRUE}, only the cells used by the authors in their final analysis are returned.
+#' Otherwise, an additional \code{filtered} field will be present in the \code{\link{colData}}, indicating whether the cell was retained by the authors. 


retained field?

LTLA · 2021-02-21T20:06:33Z

R/BunisHSPCData.R

+#' Otherwise, an additional \code{filtered} field will be present in the \code{\link{colData}}, indicating whether the cell was retained by the authors. 
+#'
+#' All data are downloaded from ExperimentHub and cached for local re-use.
+#' Specific resources can be retrieved by searching for \code{scRNAseq/bacher-tcell}.


LTLA · 2021-02-21T20:06:44Z

R/BunisHSPCData.R

+#' @author Daniel Bunis
+#'
+#' @references
+#' Bunis et al. 2021


See format of other references here.

Shouldda noted that I hadn't finalized the docs text yet, so I was gonna get to this later + likely wouldda caught the others above myself! But noted and I'll fix all of these.

Ah, I thought you had cottoned on to the fact that the best way to get shit past me is to distract me with lots of little things that need fixing!

LOL no, but now that you mention it, I did notice you missed a filtered/retained mistake in the actual code.

LTLA · 2021-02-21T20:08:11Z

inst/scripts/2.6.0/make-bunis-hspc-metadata.R

@@ -1,7 +1,7 @@
 write.csv(file="../../extdata/2.6.0/metadata-bunis-hspc.csv",
    data.frame(
        Title = sprintf("Bunis human HSPC %s", c("counts", "colData", "rowData")),
-        Description = sprintf("%s for the Bunis human haematopoietic stem-progenitor single-cell RNA-seq dataset", 
+        Description = sprintf("%s for the Bunis human haematopoietic stem and progenitor single-cell RNA-seq dataset", 


Note: changing this will need a re-propagation to ExperimentHub, as the manifests there do not update automatically. Compile it to recreate the CSVs and then we'll notify the EHub maintainers that this is altered.

It's not that important really. I'll just revert this.

dtm2451 · 2021-02-24T02:10:38Z

=/. Now that I've tried actually testing, I seem to get an error with the colData add because the hub coldata is not full-size...

I can fix it by changing line 18 of create_sce.R from

args$colData <- hub[hub$rdatapath==file.path(host, sprintf("coldata%s.rds", suffix))][[1]]

to

args$colData <- hub[hub$rdatapath==file.path(host, sprintf("coldata%s.rds", suffix))][[1]][colnames(all.assays[[1]]),, drop = FALSE]

but I can't imagine that's what you would actually want to happen.

LTLA · 2021-02-24T03:22:31Z

Probably set has.coldata=FALSE in the .create_sce() call and then add it outside, once we've decided whether or not we want just the filtered cells or everything. Note that there are three filtered modes:

Filtered, the cells you used. This is what you get with filtered=TRUE.
Unfiltered cells. This is what you get with... I dunno, filtered="cells", perhaps?
Unfiltered barcodes (i.e., the full matrix). This is what you get with filtered=FALSE.

A little different from the other cases, and you'll have to use isTRUE and isFALSE to do the checks.

dtm2451 · 2021-04-28T23:14:25Z

ready to go?

LTLA · 2021-04-29T04:28:25Z

Looks good to me.

Create BunisHSPCData, add to manifest & authors@R

cc8ccf5

dtm2451 commented Feb 21, 2021

View reviewed changes

dtm2451 changed the title ~~Create BunisHSPCData, add to manifest & authors@R~~ Create BunisHSPCData Feb 21, 2021

LTLA reviewed Feb 21, 2021

View reviewed changes

Update BunisHSPCData.R

186ffd8

revert metadata creation .R change

052d173

dtm2451 and others added 4 commits February 26, 2021 17:29

Update BunisHSPCData.R

2d18173

add test for BunisHSPCData

508971e

Update bunis fxn, manifest, and roxygenize

a5540ea

Move most bunis tests to longtests

a119e88

Minor streamlining, get rid of a performance regression.

55e64d5

LTLA merged commit 75862a1 into LTLA:master Apr 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create BunisHSPCData #32

Create BunisHSPCData #32

dtm2451 commented Feb 21, 2021

dtm2451 Feb 21, 2021

LTLA Feb 21, 2021

LTLA left a comment

LTLA Feb 21, 2021

LTLA Feb 21, 2021

LTLA Feb 21, 2021

LTLA Feb 21, 2021

dtm2451 Feb 24, 2021

LTLA Feb 24, 2021

dtm2451 Feb 24, 2021

LTLA Feb 21, 2021

dtm2451 Feb 24, 2021

dtm2451 commented Feb 24, 2021

LTLA commented Feb 24, 2021

dtm2451 commented Apr 28, 2021

LTLA commented Apr 29, 2021

Create BunisHSPCData #32

Create BunisHSPCData #32

Conversation

dtm2451 commented Feb 21, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LTLA left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dtm2451 commented Feb 24, 2021

LTLA commented Feb 24, 2021

dtm2451 commented Apr 28, 2021

LTLA commented Apr 29, 2021