-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataset Group #56
Comments
@ekauffma found out that we can use a |
A related prototype (which does not fully generalize) is in iris-hep/analysis-grand-challenge#107. |
Getting back to this: the workaround in iris-hep/analysis-grand-challenge#107 relies on some |
Hi, I was just trying to find this issue again and noticed that it might be in the wrong repository. |
Example of the from servicex import ServiceXDataset
from func_adl_servicex import ServiceXSourceUpROOT
dataset_opendata = "http://xrootd-local.unl.edu:1094//store/user/AGC/datasets/RunIIFall15MiniAODv2/"\
"TT_TuneCUETP8M1_13TeV-powheg-pythia8/MINIAODSIM//PU25nsData2015v1_76X_mcRun2_asymptotic_v12_"\
"ext3-v1/00000/00DF0A73-17C2-E511-B086-E41D2D08DE30.root"
sx_dataset = ServiceXDataset(dataset_opendata, backend_name='uproot', ignore_cache=False)
ds = ServiceXSourceUpROOT(sx_dataset, "events")
dummy_ds = ServiceXSourceUpROOT("cernopendata://dummy", "events", backend_name="uproot")
dummy_ds.return_qastle = True
jet_pt_query = dummy_ds.Select(lambda event: event.jet_pt).value()
res = sx_dataset.get_data_rootfiles_uri(jet_pt_query, as_signed_url=True)
print(f"output .file {res[0].file}")
print(f"input with / -> : {dataset_opendata.replace('/', ':')}") output:
Related to the above: is there a way to work around the need for a |
Small update here: we are going ahead with our workaround for AGC purposes, but I think it should be upstreamed. The loss of more detailed information in the dashboard (since now everything shows up as one big transform) is certainly inconvenient, and an ideal solution should probably be able to preserve the splitting there. I think it would be good to discuss this UX aspect at the AGC workshop. |
During the last AGC challenge, the idea of a dataset group came up.
We should explore this, taking into account prior work done by @kyungeonchoi on his project
The text was updated successfully, but these errors were encountered: