Missing classification-parameter when creating table in Glue
See original GitHub issueHey. I haven’t reported bugs before, so I hope I’m doing things correctly here.
When creating Glue table using aws_cdk.aws_glue.Table with data_format = _glue.DataFormat.JSON classification is set to Unknown. Querying the table fails.
Reproduction Steps
glue_table = _glue.Table(self,'GlueTable'
,database = _glue.Database.from_database_arn(self, 'GlueDatabase'
,'arn:aws:glue:region:{}:database/abc'.format(accound_id)
)
,table_name = 'def_ghi'
,data_format = _glue.DataFormat.JSON
,bucket = s3_bucket
,s3_prefix = 'prefix/'
If I manually add “classification” with value “json” in the Table properties, after deploying with CDK, the query works fine.
Error Log
Amazon Invalid operation: Invalid DataCatalog response for external table “abc”.“def_ghi”: Cannot deserialize table. Missing mandatory field: Parameters in response from external catalog. ;
Environment
- CLI Version :
- Framework Version: 1.37.0
- OS :Windows 10
- Language :Python
This is 🐛 Bug Report
Issue Analytics
- State:
- Created 3 years ago
- Reactions:4
- Comments:8 (1 by maintainers)
Top Results From Across the Web
Working with tables on the AWS Glueconsole - AWS Glue
Classification. A categorization value provided when the table was created. Typically, this is written when a crawler runs and specifies the format of...
Read more >Creating a Glue Data Catalog Table within a Glue Job
To create a table in Data Catalog following code can help: ... partitions, parameters, location, serdeInfo, hiveCompatible) glueContext.
Read more >Find Answers to AWS Questions about AWS Glue | AWS re:Post
I have manually created an lake formation tag key :classification with tag value :non pii and associated to tag to table columns,here i...
Read more >Getting Started with Data Analysis on AWS using AWS Glue ...
You can even customize Glue Crawlers to classify your own file types. ... and creating table definitions in the AWS Glue Data Catalog....
Read more >Solutions to AWS Glue Errors - Medium
Scenario 3: While running Glue Crawler. Error: You see the message “1 table has been created” on the successful execution of the crawler,...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
It is still a thing, is there any update on this?
To get around this I have added a post-deploy code snippet using boto3 to update the table, like this: