question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Bug] BOT detected when headless mode

See original GitHub issue

Describe the bug if I set headless mode, open the web that I scrap detects as BOT. I know that this is because of WAF. so that it is taken to /_Incapsula_Resource? blah…bla…bla

Any ideas to bypass this problem?

Because if I force it to set headless=false, then the problem I face is that my OS is Linux, which doesn’t have a GUI.

Versions

"dependencies": { "cheerio": "*", "express": "^4.17.1", "moment": "^2.29.1", "mongodb": "^4.2.0", "mysql": "^2.18.1", "puppeteer-extra": "^3.2.3", "puppeteer-extra-plugin-adblocker": "^2.12.0", "puppeteer-extra-plugin-stealth": "^2.9.0", "request-promise": "^4.2.6", "shelljs": "^0.8.4", "socket.io": "^4.4.0", "socket.io-client": "^4.4.0", "sprintf-js": "^1.1.2", "telegraf": "^4.5.2", "util": "^0.12.4" }

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:8

github_iconTop GitHub Comments

3reactions
Posturcommented, Feb 21, 2022

for me google detects i’m headless. refuses to log me in because ‘browser may not be secure’ or whatever.

does anyone have a fix for this?

I don’t want to add a display to my environment, I need headless.

2reactions
soshimeecommented, Feb 11, 2022

I have the exact same issue… except I’m on Windows. The protection service on the website I’m trying to scrape is “StackPath.” It passes after a few seconds without headless mode, but it gets blocked instantly with headless mode.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Headless Chrome: What it is & how to detect it
The best way to stop automated threats coming from headless browsers is to use a specialized bot detection solution that blocks all ...
Read more >
It is *not* possible to detect and block Chrome headless
We basically just launch the browser in headless mode, visit the test page, take a screenshot of the results table, and exit. The...
Read more >
Detecting Chrome headless
One day I was accessing a bot remotely, and told it to access a page that was NoCaptcha protected (because the site wasn't...
Read more >
Detecting Headless Chrome: Spotlight on Puppeteer-Extra- ...
Puppeteer—a framework some bot developers use to instrument Headless Chrome—lets users create bots faster, but leaves small traces in the ...
Read more >
4. How to detect puppeteer with 100% accuracy Support ...
... the first of a series on bot detection. I have never told anyone about this. com for testing. 0. We basically just...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found