question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

How to parametrise the dataform schedule

See original GitHub issue

We are exploring dataform to integrate it with Airflow. We are not able to figure out options to pass the parameterized value to the schedule. The most generic use case could be like performing insert as select.

insert into new_table select * from source_dat where source_dat.created_at> ‘$create_At’ definitions/new_table.sqlx config { type: "table" } select * from source_dat

– We wanted to pass the value for create_at at scheduled time.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:6

github_iconTop GitHub Comments

2reactions
mohammadskhancommented, Dec 16, 2020

@BenBirt Thanks for your response, the incremental could only satisfy the scenarios when source data is time-series and contiguous. In our case, the data is time-series but it is very much common the x+1 hours data could come before x hours data. Hence chances of missing intervals in the dest table will be high. Is there any way to avoid the same supporting below options? Backfilling as & when required Concurrent execution for different interval

1reaction
astorrscommented, Feb 12, 2021

At my company we’re also incredibly excited about Dataform and cannot wait to use it with Airflow. Similarly to @mohammadskhan we do generally implement DAGs that need to run parametrized queries. Dataform would be perfect to finally stop the sould crushing issues that come from running SQL through distributed files.

Is there are feature in consideration to parametrize runs? E.g. dataform run --dataform.projectConfig.vars.run_date="'2021-02-12'". Or just have placeholders in the dataform.json for that matter?

https://github.com/dataform-co/dataform/pull/1187 looks like it might be in the next release 👍

Read more comments on GitHub >

github_iconTop Results From Across the Web

Setting up a schedule - Documentation | Dataform
Setting up a schedule · Navigate to the Scheduling page by clicking on the hamburger menu in the top left hand corner. ·...
Read more >
Schedule executions with Cloud Composer | Dataform
To manage schedules of Dataform executions with Cloud Composer 2, you can use Dataform operators in Airflow DAGs. You can create an Airflow...
Read more >
Dataform Office Hours: Managing Environments - YouTube
At Dataform, we believe that analytics should follow software engineering best practices and therefore ... SSIS Parameters and Environments.
Read more >
Use parameters in queries, forms, and reports
Create a form that collects parameters · The ability to use data-type-specific controls, such as calendar controls for dates. · Persistence of the...
Read more >
Dataform: Change datasets with branches - Roel Peters
json, one can overwrite the settings from dataform.json, using the configOverride parameter. This is the environment named production, that I ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found