question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

It is possible to kill the raylet by writing some bytes to it over TCP.

See original GitHub issue

Start Ray with e.g.,

ray start --head --node-manager-port=12345

Then in a separate process (you can also do this on a separate machine, but then you need to change the host), you can do

import socket

s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
s.connect(('localhost', 12345))

s.send(1000 * b'asdf')

Then if you look at the raylet logs under /tmp/ray/..., you will see an error like

F0207 21:28:52.615898 2396021632 client_connection.cc:245]  Check failed: read_version_ == RayConfig::instance().ray_protocol_version() 
*** Check failure stack trace: ***
*** Aborted at 1549603732 (unix time) try "date -d @1549603732" if you are using GNU date ***
PC: @                0x0 (unknown)
*** SIGABRT (@0x7fff5633fb66) received by PID 30417 (TID 0x7fff8ed06380) stack trace: ***
    @     0x7fff564fdf5a _sigtramp

cc @pcmoritz @richardliaw @hartikainen

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Reactions:2
  • Comments:5 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
raulchencommented, Feb 8, 2019

This is duplicated with https://github.com/ray-project/ray/issues/3915. We’ve actually seen this error in prod. But we didn’t figure out who was sending the bad bytes. @zhijunfu fixed this by requiring all clients to send a connect_client message before sending other messages. Then we will can ignore this error it it’s from an unknown client. Do you think the fix makes sense? we can open an PR for this.

0reactions
robertnishiharacommented, Feb 8, 2019

Ah thanks @raulchen!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Avoiding dataloss in Go when writing with CLOSE_WAIT socket
go program opens a conn with net.DialTCP to said client. kill the netcat; in go program, do conn.Write() with a []byte -> it...
Read more >
Ray Documentation - Read the Docs
Ray includes a Docker image that includes dependencies necessary for running some of the examples. This can be an.
Read more >
Release 0.8.5 The Ray Team - the Ray documentation
Ray is a fast and simple framework for building and running distributed applications. Ray is packaged with the following libraries for ...
Read more >
Viewing online file analysis results for 'JVC_43868.vbs'
Tip: Click an analysed process below to view more details. Analysed 1 process in total (System Resource Monitor). wscript.exe "C:\JVC_43868.vbs" ...
Read more >
Nuclear medicine physics : a handbook for students and ...
to have some abnormality, as is done with X ray imaging in radiology, nuclear ... This handbook was conceived and written by physicists,...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found