RFC: (Engineering) Introduce A Reflectable Deserialization Helper

Current State

We have several places where we validate/deserialize JSON. Validation can fail with error messages, defaults need to be inserted and a JSON schema is also usually provided.

Here are some examples of how we do it currently:

Problem

The current style of doing JSON deserialization/validation is very verbose and inconsistent. There is no guarantee that the JSON-Schema matches the validation (though sometimes this is done on purpose).

Proposed Solution

I suggest to explore introducing a generic, descriptive deserialization helper that allows to specify the typescript type, the validation logic, the default value and human readable descriptions at a single place. The JSON schema can be derived from that.

The outcome should be an internal helper (not an external library) that is similar to io-ts or my own library hediet/semantic-json (example).

In case of editor.minimap.side, the code should look like this:

const editorMinimap = objectDeserializer({
    properties: {
        ...
        side: property(enumDeserializer(['left', 'right']), {
            defaultValue: 'left',
            description: nls.localize('minimap.side', "Controls the side where to render the minimap.")
        })
        ...
    }
});

const editorSettings = objectDeserializer({
    properties: {
        ...
        minimap: editorMinimap
        ...
    }
});

assert.deepStrictEqual(flatten(objectDeserializer({ properties: { editor: editorSettings } })).getJSONSchema(), {
    ...
    "editor.minimap.side": {
        type: 'string',
        enum: ['left', 'right'],
        default: 'left',
        description: "Controls the side where to render the minimap."
        ...
    }
    ...
});

People interested in this might be: @alexdima @connor4312 @aeschli

I’m open to explore this idea further if there is interest in this.

Issue Analytics

State:
Created 2 years ago
Comments:6 (6 by maintainers)

Top GitHub Comments

1reaction

alexdimacommented, Sep 27, 2021

In general, I think we should explore code generation. All our settings could be described in one or more JSON files (possibly using JSON schema or JSON schema++) and then type definitions (with nice JSDoc comments) or efficient type validation code could be generated. But this needs to be discussed in the team and agreed team-wide.

From the editor options point of view, I am very interested in seeing a clear proposal that addresses the following:

the TS type must be available with JSDoc comments in the monaco.d.ts because that is what editor embedders end up compiling against. We also sometimes instantiate editors from our codebase (e.g. the repl editor) and we need type validation there.
the JSDoc comment must contain inside of it the default value to inform developers about the default value for an option.
the editor options must also be made available via JSON schema to the settings service/registry. Here a default value must also be given.
settings passed in to the editor from the configuration service must be validated
options passed in to the editor from API users (in the embedder case) must be validated
options must be computed efficiently in case of multiple calls to updateOptions
calling updateOptions should end up creating an option change event that is queryable for the option that has changed.
editor options must be quick to be read, as lots of code inside the view / viewModel / layout / etc need to read options.

0reactions

connor4312commented, Sep 27, 2021

If we go with a code generation approach, which would solve the .d.ts problem, I would vote to simply use JSON schema. I’ve actually done this when building service APIs before and like it a lot – use the JSON schema as the point of truth and way to validate input, and derive types from it. Plus, there’s lots of existing tooling around JSON schemas.

Also, while this design focuses on settings, something adjacent to this and possibly solvable with the same toolchain is extension host communication. Currently communication to and from the extension host is done mostly though JSON serialization with small bits of special handling. As I referenced in https://github.com/microsoft/vscode/pull/133200, the abstractions we currently have here are leaky and there can be benefits, even without straying from JSON, to knowing the shape of serialized data ahead of time. If we use a system that provides runtime information for the protocol types, there’s opportunity for improvement.

Top Results From Across the Web

A thorough team guide to RFCs - LeadDev

RFCs originated as documents meant to facilitate technical decision-making related to the Internet and its infrastructure. To put it simply, ...

draft-ietf-ppm-dap-03 - Distributed Aggregation Protocol for ...

RFC stream, Internet Engineering Task Force (IETF) ... Introduction This document describes a distributed aggregation protocol for privacy preserving ...

XSL Transformations (XSLT) Version 3.0 - W3C

Abstract. This specification defines the syntax and semantics of XSLT 3.0, a language designed primarily for transforming XML documents into ...

Scaling Engineering Teams via RFCs: Writing Things Down

The scale on how thorough the documentation should be, who the docs are pushed to, how strictly it is reviewed or how standardized...

to Software Engineering - OAPEN Library

1 Introduction. Large-scale software design has shifted from the classic monolithic architecture to one where applications are structured in ...