-
-
Notifications
You must be signed in to change notification settings - Fork 404
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MAST: Cutouts limit #2693
base: main
Are you sure you want to change the base?
MAST: Cutouts limit #2693
Conversation
Codecov Report
@@ Coverage Diff @@
## main #2693 +/- ##
==========================================
- Coverage 66.05% 66.01% -0.05%
==========================================
Files 233 233
Lines 17925 17953 +28
==========================================
+ Hits 11840 11851 +11
- Misses 6085 6102 +17
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder whether the InputWarning
is the correct one here, or would rather need a more astroquery/astropy warning type? @keflavich, do you have a preference?
Also, this needs a changelog.
I believe I've addressed everything now so this is ready for a final review @bsipocz @eerovaher . Once I get this approved, I'll do a squash so it's ready for merge. |
astroquery/mast/cutouts.py
Outdated
def _get_default_timeout(self): | ||
""" Gets the default request timeout limit. """ | ||
|
||
return self._service_api_connection.TIMEOUT |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is this method necessary? (I see in the history that there has been a bit of a back and forth, so maybe it's not needed any more?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I included this for the case where a user might switch between get_cutouts
and download_cutouts
but does not anticipate that the timeout
setting stayed the same. Having the timeout
arg in each individual method implies that it would only change timeout
for a call with that method and not any of the others, so I figured it would make sense to keep the changes contained by resetting that arg each time.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The code has moved around since this last comment so I'm not sure what you're referring to, but I made a call to _get_default_timeout
several lines below L414.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But it doesn't provide the default timeout, but gets the attribute at all time. If you need the value from the attribute, do call it directly in the code (this is why I don't see the reason for this method), or if you really need the default, do reach out to it directly from the config.
Maybe it's just a naming issue? Though I still think the method is not necessary as it basically just calls one line that you could call directly when needed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I see test failures, so the tests indeed need to be revisited:
FAILED astroquery/mast/tests/test_mast_remote.py::TestMast::test_zcut_download_cutouts - UnboundLocalError: local variable 'limit_reached' referenced before assignment
FAILED astroquery/mast/tests/test_mast_remote.py::TestMast::test_zcut_get_cutouts - UnboundLocalError: local variable 'limit_reached' referenced before assignment
FAILED astroquery/mast/tests/test_mast_remote.py::TestMast::test_hapcut_download_cutouts - UnboundLocalError: local variable 'limit_reached' referenced before assignment
FAILED astroquery/mast/tests/test_mast_remote.py::TestMast::test_hapcut_get_cutouts - UnboundLocalError: local variable 'limit_reached' referenced before assignment
01b70ce
to
8de360b
Compare
Just squashed and rebased, let me know if there's anything else that needs to be addressed before merge. Just a heads up April 7 will be my last day in the office before I leave on vacation for a week @bsipocz @eerovaher |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm afraid some more iterations for the tests are needed to cover the missing (and not working) input case, as well as to fix checking for the raised warning. But overall the PR is pretty close to being ready.
astroquery/mast/cutouts.py
Outdated
def _get_default_timeout(self): | ||
""" Gets the default request timeout limit. """ | ||
|
||
return self._service_api_connection.TIMEOUT |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@@ -1016,6 +1021,15 @@ def test_tesscut_get_cutouts_mt(self): | |||
moving_target=True) | |||
assert error_tica_mt in str(error_msg.value) | |||
|
|||
@pytest.mark.xfail(raises=InputWarning) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is not the correct approach, xfails are more for cases of known buggy behaviour, or expected server side issues, etc.
Instead, please use pytest.warns context manager, there are plenty of usage examples for it in astroquery.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The reason I had to do it this way is because running with pytest.warns will allow the query to run fully, which will take several minutes and also end up failing the test with the timeout error. This is for the queries that exceed the recommended timeout limit of 600 seconds because the cutout size is too large and the timeout
arg hasn't been changed.
if isinstance(size, list) & (mission == 'TESS'): | ||
if len(size) == 2: | ||
if np.isscalar(size[0]): | ||
size = [size[0] * u.pixel, size[1] * u.pixel] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I convert the individual elements in size
to pixels rather than the whole data structure because unit conversion for the whole data structure happens 3 lines below, so doing that twice for the pixel unit case would result in a unit of pix^2
which leads to incompatible dimensions being compared. This is the error that comes up:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, I would think you don't need the size[0].unit
in your example and then it would work (you already have that in the screenshot below)
size = [size[0] * u.pixel, size[1] * u.pixel] | ||
|
||
with u.set_enabled_equivalencies(u.pixel_scale(21 * u.arcsec / u.pixel)): | ||
limit_reached = (size * size[0].unit > 30 * u.pixel).any() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
here you could put the list into a quantity and then do the check.
size2 = u.Quantity(size2)
So, maybe distinguish in the isinstance
conditional between Quantity and not yet quantity inputs?
astroquery/mast/cutouts.py
Outdated
limit_reached = False | ||
|
||
# Checking 2d size inputs for the recommended cutout size | ||
if not isinstance(size, (int, float, u.Quantity)) & (mission == 'TESS'): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My goal here was to catch for any array-like data structures, so lists tuples or np arrays. Unfortunately, I can't call for them explicitly because u.Quantity
objects are arrays themselves (see below), so I went with excluding anything that is a single int/float/u.Quantity value, which will implicitly pick up list/tuple/arrays data structures.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Needed parentheses around the not isinstance(size, (int, float, u.Quantity))
- this has been patched, as reflected in the latest build.
Just made my last commit @bsipocz , this is ready for review. I added comments to certain lines which should hopefully provide more context. Let me know if there's anything else I should address before squashing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
two more points to come back to for a tiny bit of a cleanup, but otherwise this will be good to go.
size = [size[0] * u.pixel, size[1] * u.pixel] | ||
|
||
with u.set_enabled_equivalencies(u.pixel_scale(21 * u.arcsec / u.pixel)): | ||
limit_reached = (size * size[0].unit > 30 * u.pixel).any() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
here you could put the list into a quantity and then do the check.
size2 = u.Quantity(size2)
So, maybe distinguish in the isinstance
conditional between Quantity and not yet quantity inputs?
astroquery/mast/cutouts.py
Outdated
def _get_default_timeout(self): | ||
""" Gets the default request timeout limit. """ | ||
|
||
return self._service_api_connection.TIMEOUT |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
But it doesn't provide the default timeout, but gets the attribute at all time. If you need the value from the attribute, do call it directly in the code (this is why I don't see the reason for this method), or if you really need the default, do reach out to it directly from the config.
Maybe it's just a naming issue? Though I still think the method is not necessary as it basically just calls one line that you could call directly when needed.
@bsipocz I've removed the I'm not sure I understood this comment here, are you suggesting to remove the
|
I'll do a suggestion commit when next to a computer (travelling atm). Basically I meant to suggest chopping up the conditional to distinguish between vector quantities vs a list of scalar quantities, so the case you show in the screenshot would work. |
…rge.Adding arg to change API request timeout cutoff.
Hi @bsipocz and @keflavich, I'm tagging the interim |
Addresses some points from the conversation here #2447 .
Changes:
timeout
argument toget_cutouts
that allows the user to modify the request processing time limit so that at timeout error doesn't occur at 600s.NOTE: Screenshots are outdated
Users selecting large cutout sizes (either in pixels, degrees, arcmins, or arcseconds) will be met with a warning that they may run into a timeout error. The suggestions are to either change the cutout size to something smaller, or change the request time limit.
The upper limit of 30 pixels (in either direction) is converted to degrees/arcmins/arcseconds for cases where a user may input their cutout size in a unit other than pix:
Users who choose to change the request time limit will receive an
INFO
message informing them of this change:Users can also decrease the request time upper limit. Here is an example where we return to default:
Other cutout services remain unaffected: