- Fixed bug in smart-seq2 pileup_cells reporting
- raer added to the Bioconductor project
- Changes made to prepare for bioc submission
- Replaced base R fisher test with c-wrapper to call htslib fisher test, which speeds up execution many fold.
- The options to write to tabix indexed output files have been removed from
pileup_sites()
as they have limited utility and introduce unwanted code complexity.
- The
genomic-unstranded
option for thelibrary-type
argument inFilterParam()
has been renamed tounstranded
, and theunstranded
option has been removed.
- Function arguments involving a fasta file have been renamed to all be
fasta
- added a single cell specific AEI calculation (
calc_scAEI()
)
- added method to count base consensus base when counting UMIs with
pileup_cells()
using the sum of base qualities to select consensus.
-
pileup_cells()
now allows for multiple alleles to be queried at a site. -
Fixed an indexing bug in
pileup_cells()
that misassigned sites to counts.
-
annot_snps
will now compare the site allele to the SNP allele and return a columnsnp_matches_site
indicating if the site matches the snp. -
added new function,
find_scde_sites()
to identify differentially editing sites in single cell data using fishers exact tests. -
pileup_cells
now respects themin_depth
andmin_variant_reads
FilterParameters.
- support
BamFile
andBamFileList
inputs topileup_sites()
andpileup_cells()
, which provides an option to provide custom BAI index file names.
- rename
prep_for_de()
andperform_de()
tomake_de_object()
andfind_de_sites()
.
-
default values for
edit_from
andedit_to
forcalc_edit_frequency()
have been changed toA
andG
respectively. -
renamed
type
argument inperform_de
totest
and removedtype
argument inprep_for_de
-
added support for processing multiple BAM files with
calc_AEI()
. -
Dropped minimally used
bad_reads
andreads
parameters frompileup_sites()
-
Added utility to screen scRNA-seq bam files for regions with oligo-dT mispriming (
find_mispriming_sites()
). -
add option to query ref and alt SNP alleles
-
added tests for SummarizedExperiment filtering approaches
-
added a strand bias stat
sor
using approach from GATK (StrandOddsRatio), and a confidence scorecalc_confidence()
from SAILOR pipeline. -
'N' bases in read or reference are ignored
-
Removed outdated or unused functionality:
- bed indexing (
indexBed
and related C code) - bam tag indexing (
build_tag_index
,show_tag_index
,get_tag_bam
, ) - bam tag index based single cell approach (
sc_editing
) - bam tag indexing C code from bri (
src/bri/*
) - sparse matrix merging for
merge_pileups()
. - unneeded utilities (
filter_by_coverage
) - Remaining (and mostly unused) Rcpp code
- Removed fastmap, Rcpp, zlibbioc, RColorBrewer, and BiocGenerics dependencies
- Removed system requirements for C libraries used by bri
- bed indexing (
-
The bed indexing used in
pileup_sites()
has been replaced with the region indexing approach frompileup_cells()
. -
pileup_sites()
now requires a GRanges object rather than a bed file. Thebedfile
parameter has been removed and replaced with asites
parameter. -
Renamed
Ref
andVar
output columns toREF
andALT
andnVar
was renamed tonAlt
. This provides consistency with VCF format and consistency acrosspileup_cells()
andpileup_sites()
function calls -
pileup_cells()
gained functionality to process multiple smart-seq2 style bam files. -
Changed
filterParam
argument inpileup_sites
andpileup_cells
toparam
for simplicity. -
Added
FilterParam
to exclude multi-allelic sitesreport_multiallelic
, or exclude reporting a variant in the Var assay based on allelic frequency (min_allelic_freq
). -
The
bam_flags
parameter used inpileup_sites
andpileup_cells
has been moved into theFilterParam
class. -
The
bedindex
parameter forpileup_sites
has been removed. This option is not needed at the user level and is planned to be replaced by the regional indexing used inpileup_cells()
. -
Added
FilterParam
option to trim reads based on fractional distance from 5' (ftrim_5p
) or 3' end (ftrim_3p
). -
Incorporated RBPZ and VDB statistics from bcftools, now returned as rowData columns when calling
pileup_sites
. -
A
RangedSummarizedExperiment
object is now directly returned frompileup_sites
. Usingmerge_pileups
is no longer necessary and is not an exported function. -
Renamed
get_pileup
topileup_sites
andcreate_se
tomerge_pileups
-
Rename
remove_clustered_variants
,remove_multiallelic
, andremove_splice_variants
tofilter_*
for consistency. -
Rewrote and renamed the single cell editing function
sc_editing
topileup_cells()
.pileup_cells()
does not require sorting and index by cell barcode, uses a new format to specify sites to query and requires providing the reference and alternate alleles of interest, writes to disk in a sparse matrix compatible format to reduce memory usage, and should have more performance as there is no need to query a fasta index. -
Implemented method to collapse reads with duplicate UMIs.
-
Added option to filter sites in pileup based on number of reads containing a variant (#54)
-
Added a
NEWS.md
file to track changes to the package.