[Bug] BOT detected when headless mode
See original GitHub issueDescribe the bug if I set headless mode, open the web that I scrap detects as BOT. I know that this is because of WAF. so that it is taken to /_Incapsula_Resource? blah…bla…bla
Any ideas to bypass this problem?
Because if I force it to set headless=false, then the problem I face is that my OS is Linux, which doesn’t have a GUI.
Versions
"dependencies": { "cheerio": "*", "express": "^4.17.1", "moment": "^2.29.1", "mongodb": "^4.2.0", "mysql": "^2.18.1", "puppeteer-extra": "^3.2.3", "puppeteer-extra-plugin-adblocker": "^2.12.0", "puppeteer-extra-plugin-stealth": "^2.9.0", "request-promise": "^4.2.6", "shelljs": "^0.8.4", "socket.io": "^4.4.0", "socket.io-client": "^4.4.0", "sprintf-js": "^1.1.2", "telegraf": "^4.5.2", "util": "^0.12.4" }
Issue Analytics
- State:
- Created 2 years ago
- Comments:8
Top GitHub Comments
for me google detects i’m headless. refuses to log me in because ‘browser may not be secure’ or whatever.
does anyone have a fix for this?
I don’t want to add a display to my environment, I need headless.
I have the exact same issue… except I’m on Windows. The protection service on the website I’m trying to scrape is “StackPath.” It passes after a few seconds without headless mode, but it gets blocked instantly with headless mode.