Uncaught exception with ImmoScout/ScrapingAnt
See original GitHub issueHi,
yesterday I tried the ImmoScout provider for the first time. At least once, scraping/retrieval worked fine and yielded results. After a few hours, though, Fredy crashed with the following error:
node:internal/process/promises:246
triggerUncaughtException(err, true /* fromPromise */);
^
Error: Request failed with status code 404
at createError (/usr/home/.../fredy/node_modules/axios/lib/core/createError.js:16:15)
at settle (/usr/home/.../fredy/node_modules/axios/lib/core/settle.js:17:12)
at IncomingMessage.handleStreamEnd (/usr/home/.../fredy/node_modules/axios/lib/adapters/http.js:293:11)
at IncomingMessage.emit (node:events:402:35)
at endReadableNT (node:internal/streams/readable:1340:12)
at processTicksAndRejections (node:internal/process/task_queues:83:21) {
[...]
Does this need to be caught somewhere or am I doing something wrong?
Another issue I faced is:
TypeError: Cannot read properties of undefined (reading 'substring')
at normalize (/usr/home/.../fredy/lib/provider/immoscout.js:8:58)
at Array.map (<anonymous>)
but this one could be easily fixed by checking if o.link
is defined and setting it to empty if not. Apparently some ImmoScout entries do not have a link or the parsing goes wrong.
Issue Analytics
- State:
- Created 2 years ago
- Comments:10 (4 by maintainers)
Top Results From Across the Web
web scraping - Python webscraping blocked - Stack Overflow
I want to webscrape german real estate website immobilienscout24.de. I would like to download the HTML of a given URL and then work...
Read more >ScrapingAnt - Twitter
The easiest way to scrape websites via #API. ScrapingAnt uses the latest Chrome browser and rotates proxies to automate your data mining tasks....
Read more >API Basics | ScrapingAnt
Documentation of ScrapingAnt web scraping REST API that enables to scrape websites with a headless Chrome browser.
Read more >React 18 Support - Nolimits4web/Swiper - IssueHint
Uncaught exception with ImmoScout /ScrapingAnt, 10, 2021-11-08 ; "mute" or exclude forked co-roosting dataset from being indexed, 5, 2022-02-23 ; Unauthorized ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Has been running fine for almost 18 hours now so appears to be fixed. Thanks a lot.
For the record, this is what works somewhat reliably now: