Random Read timed out.
See original GitHub issueIt seems that under heavy load my API service is throwing the following errors at random intervals:
Traceback (most recent call last):
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/elasticapm/transport/http.py", line 86, in send
response = self.http.urlopen(
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/poolmanager.py", line 417, in urlopen
return self.urlopen(method, redirect_location, **kw)
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/poolmanager.py", line 375, in urlopen
response = conn.urlopen(method, u.request_uri, **kw)
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/elasticapm/instrumentation/packages/base.py", line 205, in call_if_sampling
return wrapped(*args, **kwargs)
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/connectionpool.py", line 755, in urlopen
retries = retries.increment(
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/util/retry.py", line 532, in increment
raise six.reraise(type(error), error, _stacktrace)
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/packages/six.py", line 770, in reraise
raise value
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/connectionpool.py", line 699, in urlopen
httplib_response = self._make_request(
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/connectionpool.py", line 447, in _make_request
self._raise_timeout(err=e, url=url, timeout_value=read_timeout)
File "/Users/phil/.pyenv/versions/nvp/lib/python3.9/site-packages/urllib3/connectionpool.py", line 336, in _raise_timeout
raise ReadTimeoutError(
urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='xxxx', port=443): Read timed out. (read timeout=5)
Its weird because some traces do successfully come before and after I get those errors without anything changing server side.
Have you seen this before? I am running APM Server on EKS through ECK.
Issue Analytics
- State:
- Created 2 years ago
- Comments:6 (6 by maintainers)
Top Results From Across the Web
Dropwizard integration test: random read timeouts · Issue #2878
dropwizard version: 1.3.14 I have written integration test following the document The test sometime works and randomly fails sometime with ...
Read more >java.net.SocketTimeoutException: Read timed out under Tomcat
It just means the client isn't sending. You don't need to worry about it. Browser clients come and go in all sorts of...
Read more >java.net.SocketTimeoutException: Read timed out exception ...
Steps to Reproduce. Unknown. This error seemingly appears at random intervals depending on time of day and network load. Root Cause. This ...
Read more >Random read time out - Google Analytics Query - KNIME Forum
I´m recently running into issues when i query Google Analytics. Randomly i get read time outs while executing the Google Analytics Query node....
Read more >Random "Read Timed Out" error - Bukkit Forums
I run a Minecraft server for about 20-30 people, and lately it's been acting really weird. After a random amount of time (anywhere...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Retry sounds nice until we’re impacting your host adversely due to retries.
The assumption is that users who are running at high load are also probably handling a lot of traffic, and thus tuning their sample rate to something closer to 10% or 1%. At that point, we’re only sampling a subset anyway, which means that losing traces is more acceptable than using more resources, since you’re not seeing all the traces anyway.
Granted, we still collect durations for unsampled transactions, which means that dropped events will result in slight inaccuracies for overall metrics. But we still think this is a valid tradeoff to avoid impacting a system.
I’ll close this as, as far we can tell, the agent works as intended.