Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Starts and then fails? #144

Closed
Pittsie72 opened this issue May 12, 2021 · 6 comments
Closed

Starts and then fails? #144

Pittsie72 opened this issue May 12, 2021 · 6 comments
Labels

Comments

@Pittsie72
Copy link

I try running the attached two files in an anaconda prompt on my windows 10 PC.
When I run I get this error:
funnel load -s Settings.yaml
[2021-05-12 14:24:47,379] [INFO] JobFunnel: Scraping local providers with: ['IndeedScraperUSAEng', 'MonsterScraperUSAEng']
[2021-05-12 14:24:48,647] [INFO] IndeedScraperUSAEng: Found 13 pages of search results for query=HR+Human Resource+Human Resources
[2021-05-12 14:24:51,164] [INFO] IndeedScraperUSAEng: Scraped 0 job listings from search results pages
[2021-05-12 14:24:51,165] [ERROR] JobFunnel: Failed to scrape jobs for IndeedScraperUSAEng
[2021-05-12 14:24:51,174] [INFO] MonsterScraperUSAEng: No get() or set() will be done for Job attrs: ['REMOTENESS']
[2021-05-12 14:24:51,730] [ERROR] JobFunnel: Failed to scrape jobs for MonsterScraperUSAEng
[2021-05-12 14:24:51,731] [INFO] JobFunnel: Completed all scraping, found 0 new jobs.
[2021-05-12 14:24:51,737] [WARNING] JobFunnel: No new jobs were added to CSV.

Notice it start then stop. I have this on the basic branch not sure if I should try a different one or not
Steel.txt
Bill.txt

@Pittsie72 Pittsie72 added the bug label May 12, 2021
@thebigG
Copy link
Collaborator

thebigG commented May 12, 2021

Yes, unfortunately. I'm currently working on a fix on here b804ff5, but I'm very busy at the moment. So no promises. But hopefully I'll get this fixed soon :).

@thebigG
Copy link
Collaborator

thebigG commented May 13, 2021

What's happening is that all websites are being loaded dynamically and we'll have to switch to using selenium at some point. Turns out it does have a headless option, which is awesome because now we'll be able to test it on CI. If you look at my code, you'll notice I'm passing options to the firefox driver, and one of those options is a headless flag :).

@PabloJT
Copy link

PabloJT commented May 25, 2021

I have the same problem, I hope it is solved soon! :)

[2021-05-25 17:59:01,067] [INFO] JobFunnel: Scraping local providers with: ['IndeedScraperFRFre']
[2021-05-25 17:59:02,176] [ERROR] JobFunnel: Failed to scrape jobs for IndeedScraperFRFre
[2021-05-25 17:59:02,176] [INFO] JobFunnel: Completed all scraping, found 0 new jobs.
[2021-05-25 17:59:02,231] [WARNING] JobFunnel: No new jobs were added to CSV.

@PaulMcInnis
Copy link
Owner

PaulMcInnis commented Sep 21, 2021

Echoing @thebigG I also would like to see this improved, I think it will be some work to get this project up and running again, though a number of people still star this every week, so it feels warranted.

Ideally we avoid re-writing too much as there is a fair bit of code currently.

@PaulMcInnis
Copy link
Owner

added a notice to project README directing people to the discussion.

@PaulMcInnis
Copy link
Owner

noting here that discussion is in #148

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants