Add slice_ functions #68

wvictor14 · 2023-08-19T04:48:02Z

Hi I found this issue on tidyomics and thought it would be good experience to try

Let me know how it looks and if there's any changes needed

This adds the slice_ family:

slice_min
slice_max
slice_head
slice_tail

R/dplyr_methods.R

This reverts commit d0b8591.

stemangiola · 2023-08-20T05:25:46Z

R/dplyr_methods.R

+    tibble::rowid_to_column(var  = 'row_number___')  |>
+    dplyr::select(-everything(), row_number___, {{ .by }}) |>


Thanks a lot!

Sorry if I am being pedantic here, but it would be better to do select first and rowid_to_column later both from a memory and elegance perspective.

Another little feedback, a standard we use is to do (this applies to all new function calls)

@importFrom tibble ...

and not use

tibble:: ...

With these two changes I think the PR is good to go!

No problem, will do

R/dplyr_methods.R

stemangiola · 2023-08-20T12:19:12Z

@wvictor14 have a look at the github action; it is failing. Check maybe on your local CHECK and BiocCheck because here I have to "allow" github action each time, unfortunately. After you become a contributor, github action will be runnable by you in the future.

stemangiola · 2023-08-20T12:31:49Z

@wvictor14 please add your authorship details here https://docs.google.com/spreadsheets/d/19XqhN3xAMekCJ-esAolzoWT6fttruSEermjIsrOFcoo/edit?usp=sharing

wvictor14 · 2023-08-21T02:24:34Z

@stemangiola I fixed the errors for CRAN check. There are two bioccheck errors that I am not sure best to address:

ERROR: Maintainer must add package name to Watched Tags on the support
site; Edit your Support Site User Profile to add Watched Tags.

and

ERROR: At least 80% of man pages documenting exported objects must have
runnable examples.

Also I realized that the option of inputting a tibble to order_by argument for slice_min and slice_max doesn't work with the select |> slice approach. So I added an internal function return_args that returns the variables as symbols of such cases, that can supply the necessary columns to select 7cca3f8

stemangiola · 2023-08-21T03:04:19Z

There are two bioccheck errors that I am not sure best to address

No problem, that could be another "challenge"

Also I realized that the option of inputting a tibble to order_by argument for slice_min and slice_max doesn't work with the select |> slice approach

Could you please give me minimal example ?

wvictor14 · 2023-08-21T03:43:16Z

Also I realized that the option of inputting a tibble to order_by argument for slice_min and slice_max doesn't work with the select |> slice approach

Could you please give me minimal example ?

This is what I mean

# in action
pbmc_small |> slice_min(tibble::tibble(nFeature_RNA, nCount_RNA), n = 2)

# return_args extracts the variables out of an expression
order_by <- expr(tibble::tibble(nFeature_RNA, nCount_RNA))
order_by_vars <- return_args(!!order_by)

# let's you input into select
pbmc_small |> select(!!!order_by_vars) |>

  # and then can apply slice_min
  slice_min(!!order_by, n = 6)

# A tibble: 6 × 2
  nFeature_RNA nCount_RNA
         <int>      <dbl>
1           26         51
2           29        157
3           29        172
4           30        150
5           30        202
6           31         62

Not sure if there is a more straightforward way, or how to make it more robust. What do you think?

stemangiola · 2023-08-21T05:02:08Z

Also I realized that the option of inputting a tibble to order_by argument for slice_min and slice_max doesn't work with the select |> slice approach

Could you please give me minimal example ?

This is what I mean
# in action
pbmc_small |> slice_min(tibble::tibble(nFeature_RNA, nCount_RNA), n = 2)

# return_args extracts the variables out of an expression
order_by <- expr(tibble::tibble(nFeature_RNA, nCount_RNA))
order_by_vars <- return_args(!!order_by)

# let's you input into select
pbmc_small |> select(!!!order_by_vars) |>

  # and then can apply slice_min
  slice_min(!!order_by, n = 6)

# A tibble: 6 × 2
  nFeature_RNA nCount_RNA
         <int>      <dbl>
1           26         51
2           29        157
3           29        172
4           30        150
5           30        202
6           31         62
Not sure if there is a more straightforward way, or how to make it more robust. What do you think?

Great! I left a last comment to address (Above) and then this PR is on its way!

wvictor14 · 2023-08-22T01:16:03Z

A detail. In the tidy transcriptomics framework, we adopt the standard of, no abbreviations and no acronyms.

Would you mind changing return_args into a self-explanatory function name, e.g. return_arguments_of_... or anything else that would make sense.

This also applies to any variable, e.g. args_expr, args_vars, ...

I think I got them all!

stemangiola · 2023-08-22T01:35:59Z

Thanks a lot @wvictor14 for your PR! Well done.

Now that you are an expert, if you want to translate your PR to the other tidy adapters tidySingleCellExperiment and/or tidySummarizedExperiment feel free, that would be welcome!

No expectations of course.

wvictor14 added 10 commits August 18, 2023 20:31

slice_head

80665ae

slice_tail

9fcea46

use <- instead of = assignment

222fef8

fix example

bf18ec4

slice min

d5dd780

tests for slice_head _tail _min

bd5a53a

slice_max

5640700

add by example

bcd55fe

add .by to slice

582f57d

drop exports for slice functions

b80fed3

stemangiola assigned wvictor14 Aug 19, 2023

stemangiola added the enhancement New feature or request label Aug 19, 2023

stemangiola linked an issue Aug 19, 2023 that may be closed by this pull request

modernize tidy transcriptomics and genomics with dplyr > 1.0.0 and new tidyr tidyomics/genomics-todos#6

Open

stemangiola self-requested a review August 19, 2023 05:01

stemangiola requested changes Aug 19, 2023

View reviewed changes

R/dplyr_methods.R Outdated Show resolved Hide resolved

wvictor14 added 5 commits August 19, 2023 11:05

extract minimal number of columns

1d2ed45

approach b - create slice df from scratch

d0b8591

Revert "approach b - create slice df from scratch"

7489035

This reverts commit d0b8591.

extract only necessary columns slice_

140d4ba

add a slice test

c181a3c

stemangiola requested changes Aug 20, 2023

View reviewed changes

wvictor14 added 3 commits August 19, 2023 22:36

slice_sample use only necessary metadata variables

02493b1

slice_sample docs

4e679cb

replace with native pipe

b0f4ef2

wvictor14 commented Aug 20, 2023

View reviewed changes

R/dplyr_methods.R Outdated Show resolved Hide resolved

remove :: , importFrom tibble::rowid_to_column

0e173ff

wvictor14 added 2 commits August 20, 2023 17:34

export slice functions

7420193

fix slice_min slice_max tibble input

7cca3f8

wvictor14 added 3 commits August 20, 2023 17:34

test for slice_min _max tibble input

9e742d1

add return_args to utilities.R

fe67aa5

add cran note

dab807b

wvictor14 added 3 commits August 20, 2023 23:33

expand abbreviations

f2b7772

expand more abbreviations

90fd5b5

fix return_arguments_of docs

468f29f

stemangiola merged commit d08b38b into stemangiola:master Aug 22, 2023
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add slice_ functions #68

Add slice_ functions #68

wvictor14 commented Aug 19, 2023

stemangiola Aug 20, 2023 •

edited

Loading

wvictor14 Aug 20, 2023

stemangiola commented Aug 20, 2023

stemangiola commented Aug 20, 2023

wvictor14 commented Aug 21, 2023 •

edited

Loading

stemangiola commented Aug 21, 2023

wvictor14 commented Aug 21, 2023

stemangiola commented Aug 21, 2023

wvictor14 commented Aug 22, 2023

stemangiola commented Aug 22, 2023 •

edited

Loading

		tibble::rowid_to_column(var = 'row_number___') \|>
		dplyr::select(-everything(), row_number___, {{ .by }}) \|>

Add slice_ functions #68

Add slice_ functions #68

Conversation

wvictor14 commented Aug 19, 2023

stemangiola Aug 20, 2023 • edited Loading

Choose a reason for hiding this comment

wvictor14 Aug 20, 2023

Choose a reason for hiding this comment

stemangiola commented Aug 20, 2023

stemangiola commented Aug 20, 2023

wvictor14 commented Aug 21, 2023 • edited Loading

stemangiola commented Aug 21, 2023

wvictor14 commented Aug 21, 2023

stemangiola commented Aug 21, 2023

wvictor14 commented Aug 22, 2023

stemangiola commented Aug 22, 2023 • edited Loading

stemangiola Aug 20, 2023 •

edited

Loading

wvictor14 commented Aug 21, 2023 •

edited

Loading

stemangiola commented Aug 22, 2023 •

edited

Loading