Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

consider renaming workdir to .work so it will be hidden #189

Closed
kelly-sovacool opened this issue Feb 23, 2024 · 6 comments
Closed

consider renaming workdir to .work so it will be hidden #189

kelly-sovacool opened this issue Feb 23, 2024 · 6 comments

Comments

@kelly-sovacool
Copy link
Member

pairs with #188

@slsevilla
Copy link
Contributor

I don't know if this is a good thing - maybe for during the run but since we are having such an issue with space on CCBR I think leaving this exposed so that users can easily delete it post a run is probably a good idea?

@kelly-sovacool
Copy link
Member Author

Yeah, I agree -- I think I'd rather users know about the work dir so they understand how the pipeline works.

@kopardev's idea was if we switch the config to use publish_dir_mode: link (hardlink), the space won't be counted twice for both work and results.

@slsevilla
Copy link
Contributor

Yeah, if people aren't cleaning up the work dir, that's a prob. I THINK this is really the feature we want:

nextflow-io/nextflow#3818

@kelly-sovacool
Copy link
Member Author

Yeah, if people aren't cleaning up the work dir, that's a prob. I THINK this is really the feature we want:

nextflow-io/nextflow#3818

Yes definitely, the lack of temporary outputs is a huge loss for nxf

@kelly-sovacool
Copy link
Member Author

kelly-sovacool commented Feb 26, 2024

Another thing we should think about is whether we actually want to publish all process outputs (we are currently) or turn off publishing for certain intermediate rules so the outputs directory will be less cluttered, while still making sure users can find the files they may want/need

@slsevilla
Copy link
Contributor

Option2 is my vote. Our Snakemake workflows never output everything. Standard was that if a user wanted something kept, they could request it and we'd add it back in. I've done this a few times already for CARLISLE - I think it's a an easy, space-conscious way to handle it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants