question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

I have read the documentation and did not find any place talking about Server Queue Size. As far as I understood from the TRTIS Architecture, incoming inference requests are queued by Model Schedulers and when Execution Context is available, the request is passed for inference. I would like to know the Server Queue Size or if possible how to set it. This would help to control the incoming request traffic.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:8 (4 by maintainers)

github_iconTop GitHub Comments

2reactions
deadeyegoodwincommented, Oct 28, 2020

We will consider queue depth as an enhancement to the statistics API. Note that the statistics already report average time that requests spend in the queue which is likely a good substitute.

1reaction
royinxcommented, Sep 3, 2021

is there any update?

Actually what we want is real time, pending (data received and scheduled ) request queue size for auto scaling. According to the last reply, average time of requests spend is cumulative , developer cannot obtain the last 15 mins queue size for auto-scaling. is that the latency is the only way to detect the request loads?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Activision Support on Twitter: "We're aware some players are ...
We're aware some players are experiencing long wait times in the server queue for #WARZONE. Stay tuned for updates. ... Are you aware...
Read more >
Identifying Server Queue Information - Oracle Help Center
Server queue information controls the creation and access of server message queues. On a BEA Tuxedo system, you can create Multiple Server, Single...
Read more >
Queue servers and queues - Product Documentation
Queue servers are programs used to control the background processing of print and processing tasks (Jobs). These programs are Progress ...
Read more >
6 Ways To Fix Warzone Server Queue - YouTube
Some players have been experiencing Warzone " Server Queue " and stating on screen: “Servers are undergoing temporary maintenance”.
Read more >
How to fix the Call of Duty: Warzone Server Queue bug
The queues usually last around five to 10 minutes. If you've been sitting in the queue longer than that, there may be an...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found