Issue with CSV ingestion with non-default delimiter
See original GitHub issueThe code assumes that the delimiter for CSV in the reader config should also be used when specifying the header string in the ingestion spec. While this is (arguably) a valid assumption, it might be non intuitive for the user since these are two different configs in two different places.
Filing this issue for a better user experience, perhaps:
- If the CSV delimiter does not work for header String in job spec, try
comma
? - At the very least, error out notifying the user that the two are inconsistent.
Issue Analytics
- State:
- Created 2 years ago
- Comments:13 (13 by maintainers)
Top Results From Across the Web
6 Common CSV Import Errors and How to Fix Them - Flatfile
One of the most common CSV import errors is that the file is simply too large. That can be caused by too many...
Read more >Dealing with commas in a CSV file - Stack Overflow
First, get the customer to follow RFC-4180 and enclose in double quotes the fields that contain commas or newline characters. If you are...
Read more >Copying CSV files delimited by commas where commas are ...
The problem we're faced with is that commas within a given column, enclosed in double quotes, are being perceived as a column delimiter,...
Read more >CSV not parsed into columns despite Get Data delimiter set
My Excel does not parse CSV file correctly. The separator is comma (,). Even if I use the Get Data tool and set...
Read more >handling malformed CSV (unenclosed delimiters) in an ELT ...
My situation is that a source system provides data in a CSV file with unenclosed delimiters. example file illustrating the unenclosed ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@mayankshriv I think gracefully erroring out if they don’t match is the least we can do here. Shall we start with that?
@suddendust: In the ingestion job spec yaml file, we provide
delimiter
,header
as two configs. It is implicitly assumed thatheader
strings will also be separated bydelimiter
.