Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

The definitive replication and authorization guide

See original GitHub issue

I think we need to hash out and clarify the replication and authorization processes a bit. I have been struggling with this for many days now and based on issues here I’m not alone, so I’m hoping we can use this issue to clear things up.

After reading the documentation, reading the tests, going over the code and issues, I still can’t get this right, so I think I’m not far off if I say that the whole replication process is not intuitive to implement. There are so many details to grok so let’s go over it one step at a time. The biggest issue seems to be that it’s not clear how the hyperdbs need to be set up for replication to work.

Here are two scenarios that I want to figure out, but it seems I can’t.

Scenario 1: I have hyperdb 1 that I want to read & write with hyperdb 2

My logic:

Create 2nd hyperdb with 1st hyperdb’s local.key
Authorize 2nd hyperdb to write 1st hyperdb
Create replicate() streams from both hyperdbs
Create socket connection between machines and do stream1st.pipe(socket) and socket.pipe(stream_2nd)

That’s the structure I got from the docs etc. but it doesn’t seem to work. There’s the issue with “first hypercore must be the same” which I guess means I have to create 2nd hyperdb with hyperdb(storage, hyperdb_1st.local.key). With that, I can see the connection happening, but nothing gets replicated. What steps are missing here?

Scenario 2: I want to replicate another hyperdb without writing

My logic:

Create own hyperdb with remote hyperdb’s local.key?
Create replicate({ live: true }) stream for my hyperdb
Connect to remote hyperdb’s socket
Do socket.pipe(my_hyperdb_stream)

With this I don’t get any errors, I see data going over the socket, and I can see the data in the remote hyperdb, but nothing shows up in my hyperdb. It’s a bit like the replication doesn’t start for some reason.

Issue Analytics

State:
Created 5 years ago
Comments:8

Top GitHub Comments

6reactions

lachenmayercommented, Nov 8, 2018

Hey folks, I wrote a pretty detailed guide about authorization & replication in hyperdb. Hope it’s useful!

3reactions

pfrazeecommented, Oct 13, 2018

@0fork Your broader point about needing a guide is on point. I debugged your script and was only able to do so because I know about some gotchas.

I made a few changes but there were only two that mattered:

I changed the use of @hyperswarm/network to only have one side announce and the other side lookup. That’s because hyperswarm doesn’t yet have connection deduplication builtin, and so you were getting more connections than you need. We’re either going to have dedup builtin to the code, or we’ll put that pattern in the readme once we’ve got one written.
You were providing a public key to both hyperdb instances, but that will only work if you already have the private key to match. Not supplying the public key to the first instance solved that – if the archive already exists it’ll load the key from disk, and if it doesnt already exist it’ll mint a new keypair.

Here’s the fixed snippet:

const pump = require("pump")
const hyperdb = require("hyperdb")
const network = require("@hyperswarm/network")
const cr = require("crypto")

// this is meant to "simulate" two separate servers so two networks
const net1 = network()
const net2 = network()

const db1 = hyperdb(`./_test1`, { valueEncoding: "utf-8" })
let db2
db1.ready(() => {
  console.log("hyper1 created")

  console.log('swarming')
  const $key = db1.key
  const id = cr.createHash("sha256").update($key).digest()
  net1.discovery.holepunchable((err, yes) => {
    if (err || !yes) {
      console.log("no hole")
      process.exit()
    }
  })

  net1.join(id, {
    lookup: false,
    announce: true,
  })
  net1.on("connection", (socket) => {
    console.log('net1 got connection')
    // this is suppose to "push" so piping from
    // rep1 to socket (rep2) to rep1
    var rep = db1.replicate({ live: true })
    pump(rep, socket, rep, function() {
      console.log("socket1 pipe end")
    })
    socket.on("data", (data) => {
      console.log("socket1 got data", data)
    })
  })
  db2 = hyperdb(`./_test2`, $key, { valueEncoding: "utf-8" })
  db2.ready(() => {
    console.log("hyper2 created")
    net2.join(id, {
      lookup: true,
      announce: false,
    })
    net2.on("connection", (socket) => {
      console.log('net2 got connection')
      // this is suppose to replicate so piping from
      // socket (rep1) to rep2 to socket (rep1)
      var rep = db2.replicate({ live: true })
      pump(rep, socket, rep, function() {
        console.log("socket2 pipe end")
      })
      socket.on("data", (data) => {
        console.log("socket2 got data", data)
      })
    })
    db2.watch("/test", (err, data) => {
      console.log("socket2 /test", data)
    })
  })
})

setInterval(function() {
  db1.put("/test", "test", () => {
    db1.list((err, list) => {
      console.log("1", list)
    })
    db2.list((err, list) => {
      console.log("2", list)
    })
  })
}, 3000)

Top Results From Across the Web

MongoDB: The Definitive Guide, 3rd Edition [Book] - O'Reilly

Updated for MongoDB 4.2, the third edition of this authoritative and accessible guide shows you the advantages of using document-oriented databases. You'll ...

24. Replication and High Availability — AMPS User Guide 5.2 ...

This chapter discusses the support that AMPS provides for replication, and how AMPS features help to build systems that provide high availability. Overview...

Dell EMC Avamar Administration Guide

Replication authentication. ... Avamar and Data Domain System Integration Guide ... considered definitive or exhaustive.

Amazon Adds Replication to EFS | Petri IT Knowledgebase

... check out What is AWS (Amazon Web Services)? – The Ultimate Guide on Petri. ... IAM authorization & access points were added...

OSIsoft® PI System Historian Replication Use Cases

This document outlines 8 use cases for the implementation of Owl data diode cybersecurity to defend a variety of industrial and critical infrastructure...