xhr poll error: using cluster #300

konieshadow · 2014-12-29T03:04:04Z

server.js

var cluster = require('cluster')
var numCPUs = require('os').cpus().length
var engine = require('engine.io')

if (cluster.isMaster) {
  for (var i = 0; i < numCPUs; i++) {
    cluster.fork()
  }
}
else {
  var server = engine.listen(1337, function () {
    console.log('server bound')
  })

  server.on('connection', function (socket) {
    socket.on('error', function (err) {
      console.error(err)
    })

    socket.on('message', function (data) {
      console.log(data)
    })

    socket.on('close', function (reason) {
      console.log(reason)
      socket.close()
    })

    socket.send('hello from ' + '\r\n')
  })
}

client.js

var eio = require('engine.io-client')

function request() {
  setTimeout(function () {
    var socket = eio('ws://localhost:1337')

    socket.on('error', function (err) {
      console.error(err)
    })

    socket.on('open', function () {
      socket.on('message', function (data){
        console.log(data)
        socket.close()
      })
    })

    request()
  }, 10)
}

request()

engine.io version: 1.4.3
engine.io-client version: 1.4.3
node.js version: 0.10.34
os: window 8.1 64bit
c++ complier: Microsoft Visual Studio Community 2013 Visual C++ 2013

defunctzombie · 2015-01-10T18:41:51Z

Can confirm this issue. What is even stranger is that if you use a cluster size of 2 there is no problem but with 3 ore more the problem starts to happen.

defunctzombie · 2015-01-10T18:52:00Z

Interesting find, but the reason this is happening is the same reason we require sticky sessions on load balancers when running engine.io servers.

The reason for the xhr poll error is because the different poll requests are being sent to different cluster backends. Each cluster backend is a separate nodejs process and does not share memory with the other process. What happens is that the session is established with the first request (and session id assigned) but future requests get routed to a different process which does not know about the session id.

Further, the actual error from the response is being masked by the 'xhr poll error' hardcoded string. Upon inspecting the responseText, the following message is shown:

{"code":1,"message":"Session ID unknown"}

This is an amusing way to expose the fact that we require sticky sessions so that requests can be routed to the correct backend that is aware of active session ids.

If you want to use cluster, you will need an adapter on top of engine.io server that will share session ids and session data between servers or avoid using cluster and instead run multiple separate processes behind a load balancer which supports sticky sessions.

I think we should update our README/docs/guide to mention that cluster should be avoided due to this limitation. We should also pass along the response text error so that debugging this is easier in the future.

defunctzombie · 2015-01-10T19:24:36Z

Additional references: https://github.com/indutny/sticky-session

(tho it may not work 100%, but a good starting point for an engine.io-cluster-support module)

neemah · 2015-02-11T13:50:21Z

sticky-session does not help if project runs on Heroku with several dynos. And this became show-stopper for horizontal scaling in our app :(

What happens is:

client connects to website (let's say backend no. 1 handles this connection). engine.io saves socket id in local variable.
during protocol upgrade session client reconnects to website (backend no. 2 handles this connection). Engine.io checks socket id of second request in local client hash and fails (server.js: Server.verify()) with UNKNOWN_SID error.

This is so far the cause of the problem. Any suggestions how to handle this will be very helpful.

Thanks in advance.

3rd-Eden · 2015-02-12T14:22:13Z

@neemah Just get off Heroku and use hosting provider that actually supports real-time applications (and has a load balancer that uses sticky sessions).

3rd-Eden · 2015-02-12T14:23:35Z

@defunctzombie sticky-session is seriously flawed as it does the sticky load balancing based on the incoming IP address. So when you run this behind another load balancer all ip's will be the same as the loadbalancers IP, causing all connections to go to one single node process.

defunctzombie · 2015-02-12T17:42:52Z

@3rd-Eden yep, that is why we don't recommend it outright

defunctzombie · 2015-02-12T17:43:57Z

@3rd-Eden there are problems with using amazon as well since their ELB doesn't support HTTP 1.1 so you have to pick between having websockets (tcp load balance) or polling (http with sticky).

neemah · 2015-02-13T10:54:58Z

@3rd-Eden i'd be very pleased if you suggest one that will handle sticky-session.

3rd-Eden · 2015-02-13T11:32:09Z

HAProxy, nginx, http-proxy(node) and many others.

On Feb 13, 2015, at 11:54, Slava Tsyrulnik [email protected] wrote:

@3rd-Eden i'd be very pleased if you suggest one that will handle sticky-session.

—
Reply to this email directly or view it on GitHub.

defjamuk · 2015-05-06T08:47:16Z

Is there any reason we need sticky sessions? It's an anti pattern. I would like to store the session information in a distributed database such as cassandra. If someone could point me in the right direction I would be willing to develop a module to do this with a cassandra data store. It would help our application horizontally scale on AWS.

wzrdtales · 2016-02-10T10:04:03Z

@3rd-Eden That is why I build https://github.com/wzrdtales/socket-io-sticky-session to support hashing informations from layer 4 instead. But I also would prefer to be able to use something else than sticky sessions, with layer 4 information it is now also possible to balance in a bit more controlled behavior, but the best thing would be to be able to just balance clients without caring to much about the handshake.

Thus the best option would be if engine.io would finally support a handshake that works across servers. For example in combination with a storage in between like redis.

JSONP transport fails when sending JSON stringified message

darrachequesne · 2021-01-25T14:41:38Z

For future readers: I think it is implemented this way because without sticky session you would have something like this:

Since the event handlers are registered upon connection (in the current implementation, at least), any subsequent HTTP request should be forwarded to the 1st instance, but that wouldn't scale well, would it? Same with outgoing packets, if you call socket.send() on the 1st instance and the HTTP long-polling connection is established on the 3rd instance.

Besides, we have published @socket.io/sticky in order to use Socket.IO within a cluster. Unlike sticky-session and socketio-sticky-session, it is based on the sid query parameter.

Sample usage:

const cluster = require("cluster");
const http = require("http");
const { Server } = require("socket.io");
const redisAdapter = require("socket.io-redis");
const numCPUs = require("os").cpus().length;
const { setupMaster, setupWorker } = require("@socket.io/sticky");

if (cluster.isMaster) {
  console.log(`Master ${process.pid} is running`);

  const httpServer = http.createServer();
  setupMaster(httpServer, {
    loadBalancingMethod: "least-connection", // either "random", "round-robin" or "least-connection"
  });
  httpServer.listen(3000);

  for (let i = 0; i < numCPUs; i++) {
    cluster.fork();
  }

  cluster.on("exit", (worker) => {
    console.log(`Worker ${worker.process.pid} died`);
    cluster.fork();
  });
} else {
  console.log(`Worker ${process.pid} started`);

  const httpServer = http.createServer();
  const io = new Server(httpServer);
  io.adapter(redisAdapter({ host: "localhost", port: 6379 }));
  setupWorker(io);

  io.on("connection", (socket) => {
    /* ... */
  });
}

The documentation was updated accordingly: https://socket.io/docs/v3/using-multiple-nodes/#Using-Node-JS-Cluster

defunctzombie mentioned this issue Jan 10, 2015

CORS pre-flight breaks socket.io behind load balancer #279

Closed

darrachequesne pushed a commit that referenced this issue May 8, 2020

Merge pull request #300 from pyhrus/fix_jsonp_newline_escape

edb4c55

JSONP transport fails when sending JSON stringified message

darrachequesne closed this as completed Jan 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xhr poll error: using cluster #300

xhr poll error: using cluster #300

konieshadow commented Dec 29, 2014

defunctzombie commented Jan 10, 2015

defunctzombie commented Jan 10, 2015

defunctzombie commented Jan 10, 2015

neemah commented Feb 11, 2015

3rd-Eden commented Feb 12, 2015

3rd-Eden commented Feb 12, 2015

defunctzombie commented Feb 12, 2015

defunctzombie commented Feb 12, 2015

neemah commented Feb 13, 2015

3rd-Eden commented Feb 13, 2015

defjamuk commented May 6, 2015

wzrdtales commented Feb 10, 2016

darrachequesne commented Jan 25, 2021

xhr poll error: using cluster #300

xhr poll error: using cluster #300

Comments

konieshadow commented Dec 29, 2014

defunctzombie commented Jan 10, 2015

defunctzombie commented Jan 10, 2015

defunctzombie commented Jan 10, 2015

neemah commented Feb 11, 2015

3rd-Eden commented Feb 12, 2015

3rd-Eden commented Feb 12, 2015

defunctzombie commented Feb 12, 2015

defunctzombie commented Feb 12, 2015

neemah commented Feb 13, 2015

3rd-Eden commented Feb 13, 2015

defjamuk commented May 6, 2015

wzrdtales commented Feb 10, 2016

darrachequesne commented Jan 25, 2021