Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

RFC: Add existing Relational Database (Aurora Serverless) as a Data Source for an AppSync API

See original GitHub issue

Feature - ‘add-datasource’

This issue will primarily detail the design regarding adding an existing Relational Database as a data source for an AppSync API via the amplify-cli. In the future, we can extend the functionality beyond Aurora Serverless to other data sources.

We propose a subcommand ‘add-datasource’, under the amplify ‘api’ command, with the intended functionality:

Allow the customer to specify an existing Aurora Serverless cluster and corresponding Database within the cluster to use as a data source for an AppSync API
Perform an introspection on the Relational Tables and generate the GraphQL Schema for the API and relevant Resolvers based on the tables (and relationships between the tables, i.e. foreign-keys, etc.).
Output a CFN template with the AppSync resources needed to utilize the Aurora Serverless database

The proposed subcommand ‘add-datasource’ will execute a question walkthrough that will present the user with the following workflow:

fun@aws:~$ amplify api add-datasource
What kind of data source do you wish to support?
> Aurora Serverless (Relational Database)
Do you wish to import a database? (Y/n)
> Yes

Subsequent prompts will be used to collect the Aurora Cluster Identifier, Secrets Manager ARN (used to access the cluster), Database name, JDBC inputs, and other information regarding the data source.

Further information on the walkthrough questions:

Regarding: ‘What kind of data source do you wish to support?’
- This will display a selectable list of data sources, for now Aurora Serverless will be the only option in the list. In the future other data sources will be added to the list.
Regarding: ‘Do you wish to import a database?’
- The reason for this question is that in the future we wish to support the flow where the customer does not necessarily have a Aurora Data Source or wants to start with a new one

With the above information, we will have enough to perform the Database introspection and generate the CFN template with the AppSync Resources ready to utilize the Aurora Serverless Database.

An example to clarify the intended use-case and Introspection:

Imagine you have an existing database with two tables, ‘Post’ and ‘Author’, and wish to use it as data-source for AppSync.

Upon going through the workflow mentioned above, amplify will perform the introspection and automatically generate the GraphQL Schema for the data source as well as the VTL Resolvers. In addition to the basic resolvers based on the Types (i.e. CreatePost, GetPost, ListPosts, etc.), it will generate additional query resolvers that leverage SQL’s intended queryable nature.

Examples of the additional query resolvers would be:

GetPostsWithUpvotes, think:

SELECT * FROM Post WHERE upvotes > 0

GetPostsWithTitleLike, think:

SELECT * FROM Post WHERE title LIKE '%{resolverInput}%

GetAuthorsWithNameLike, think:

SELECT * FROM Author WHERE name LIKE '%{resolverInput}%

The selection of the fields will be based on non-primary id Ints and Strings (as the basic resolvers cover the primary id already).

As a result of performing the introspection, the command will be able to help users get started with AWS AppSync by allowing them to simply bring in their already existing data source and not need to worry about creating the GraphQL schema or writing resolvers in Velocity (VTL). The outputted CFN template will be stored in the ‘/backend/api/[apiname]/stacks’ directory as mentioned in #574. Therefore, the next amplify push (post a successful add-datasource command) will provision the AppSync resources in the cloud via CFN and the user will be ready to use AppSync with their existing data source.

Related Issues

#87, #520

Feedback on this proposal is highly encouraged!

Issue Analytics

State:
Created 5 years ago
Reactions:29
Comments:15 (3 by maintainers)

Top GitHub Comments

7reactions

olivoilcommented, Feb 28, 2019

This may be outside of the scope of this RFC, but it would be useful to plan out how the directives that currently work with dynamoDB data sources could be ported over to AS data sources (@model, @auth, @searchable, @connection, @versioned).

With an AS data source, I don’t expect the @model directive to create the tables as this is managed outside of cloud formation. But I would see more value in being able to write up specific @model directives rather than using the introspection feature. It would let me declare a model without first creating the database schema, and would let me filter down which tables to expose via AppSync endpoints. To support this, maybe the @model directive could take an argument to specify the data source it should connect to, and the table:

type Post @model(datasource: "datasource name", table: "database table name") {
    id: ID!
    title: String!
}

and prompt the user to specify a name for the datasource when running the add-datasource subcommand.

Support for the other directives would be useful regardless of the data source a model is connected to, and having directive parity between the two would make it easier to switch from a dynamoDB backed model to an AS one.

The @auth and @connection directives should also work together, as array of strings are not typically found in mysql schemas:

type Post
  @model(datasource: "datasource name", table: "posts")
  @auth(rules: [
      { allow: owner },
      { allow: owner, ownerField: "editors", mutations: [update] }
  ]) {
  id: ID!
  title: String!

  # expect the field `owner` to be the ID of a user, typically a foreign key constraint to the `users` table
  owner: User @connection

  # expect no `editors` field to exist in the `posts` table, but rather use the `post_editors` table
  editors: [PostEditor] @connection(name: "PostEditors")
}

type PostEditor @model(datasource: "datasource name", table: "post_editors", queries: null) {
  id: ID!
  post: Post! @connection(name: "PostEditors")
  editor: User! @connection(name: "UserEditors")
}

type User @model(datasource: "datasource name", table: "users") {
  id: ID!
  username: String!

  # expect no `posts` field to exist in the `users` table, but rather use the `post_editors` table
  posts: [PostEditor] @connection(name: "UserEditors")
}

7reactions

ghostcommented, Jan 25, 2019

It be more orthogonal to use

$amplify api add
? Please select from one of the below mentioned services (Use arrow keys)
> GraphQL
  REST
What kind of data source do you wish to support?
> Aurora Serverless (Relational Database)
  DynamoDB (Non Relational Database)

Top Results From Across the Web

Tutorial: Aurora Serverless - AWS AppSync

AWS AppSync provides a data source for executing SQL commands against Amazon Aurora Serverless clusters which have been enabled with a Data API....

API (GraphQL) - Relational Databases - AWS Amplify Docs

The following instruction show how to create an Amazon Aurora Serverless database, import this database as a GraphQL data source and test it....

How to use RDS as a Data Source for AppSync

Use an AWS-managed relational database with GraphQL.

Gary A. Stafford | Programmatic Ponderings | Page 6

Access to AWS Glue data sources is also available from within the Editor. ... Choose 'Amazon Athena' from the list of available Database...

2020

Amazon MSK adds support for Apache Kafka version 2.7.0 ... existing replication instance either through the Amazon Database Migration Console or the SDK....