-
Notifications
You must be signed in to change notification settings - Fork 222
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bugs fixes, adjustments, and cleaning. #84
Conversation
- For Indeed and Monster, the query string was not properly encoded when a quoted phrase with spaces in-between words were provided. The fix was to encode all spaces with the proper character(+/-). This issue and fix also applied to city names. - For GlassDoorStatic, the query string was encoded for a URL and returned improper results. Since this class searches using a JSON payload, the solution was to combine the keywords with a space instead. -The old query construction function was moved from GlassDoorBase to GlassDoorDynamic to prevent the dynamic scraper class from breaking. Fixes issues PaulMcInnis#80.
- Removed unused requests imports - Changed URL strings that had http in them to https - Set provider header dictionary as the default headers on the provider's session object. Setting headers on the actual post/get method call is only necessary for temporarily overriding the session headers on an individual request. - Adjusted search_page_for_job_soups method for GlassDoorStatic class so that it uses GET instead of POST. Sending payload data when we already have the search page URL is unnecessary and can lead to bot detection measures activating more frequently.
- Updated test URL to test for https instead of http
- Previously futures would be deleted whether they finished parsing or not. - Added code to delete the HTML page after it's parsed. - Added code to log any errors during blurb retrieval and parsing.
Codecov Report
@@ Coverage Diff @@
## master #84 +/- ##
==========================================
- Coverage 58.37% 58.34% -0.03%
==========================================
Files 13 13
Lines 1146 1150 +4
==========================================
+ Hits 669 671 +2
- Misses 477 479 +2
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
excellent!
Thanks @PaulMcInnis ! Btw, the codecov bot doesn't seem to like some my changes. Is there anything specific I need to add when making pull requests? |
Its ok, its just unhappy about reduction in coverage - should be possible
to widen the threshold so it doesn't fail for so little a reduction.
On Sat, Jul 11, 2020 at 3:16 PM Erich M ***@***.***> wrote:
Thanks @PaulMcInnis <https://github.com/PaulMcInnis> ! Btw, the codecov
bot doesn't seem to like some my changes. Is there anything specific I need
to add when making pull requests?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#84 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAYKY2LFNYOLLN42VZJZXRDR3C3A3ANCNFSM4OWZRRPQ>
.
--
- Paul McInnis
|
Description
As the title implies, this pull request encompasses a few bug fixes, a few adjustments and some code cleaning. Major changes I've made will be described here; any minor details or specifics can be found within the commit messages.
Fixes
Adjustments
Context of change
Please add options that are relevant and mark any boxes that apply.
Type of change
Please mark any boxes that apply.
How Has This Been Tested?
Checklist:
Please mark any boxes that have been completed.