question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Skip non-valid data in mr

See original GitHub issue

How can a mr job skip non-valid json document in input data? An exception is thrown:

Java.lang.IllegalStateException: Found unrecoverable error [Bad Request(400) - MapperParsingException[failed to parse [failField]]; nested: JsonParseException[Numeric value (18446744073594037153) out of range of long (-9223372036854775808 - 9223372036854775807)

It is catched in mapper and logged.

I’m using:

es.input.json=true
es.batch.write.retry.policy=none

But this exception is appearing over an over again and map task gets really slow (duration increases from minutes to hours).

Thanks for any advise…

Issue Analytics

  • State:closed
  • Created 10 years ago
  • Comments:5 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
jbaieracommented, Feb 5, 2018

Visit from the Backlog Fairy: With the introduction of bulk write failure handlers in 6.2.0 we are very much looking to target adding serialization failure handlers for both reading data and writing data.

0reactions
jbaieracommented, Mar 28, 2018

Closing this in favor of #1128

Read more comments on GitHub >

github_iconTop Results From Across the Web

how to ignore / skip invalid data row in LOAD DATA INFILE ...
You cant skip those rows, if you are using load in file function. Do one thing, initially load all data into a temporary...
Read more >
Skip rows when Charting data with invalid/missing data - Mr. Excel
I am trying to chart project milestone dates off of a status spreadsheet. A simplified example follows: PROJECT1 10/01/2004 PROJECT2 1/1/2004 PROJECT3...
Read more >
MR-DT Displays Invalid Data - AccessIt!
MR -DT displays invalid data when entering a command and will not execute the command. Cause A. An alarm zone has no points...
Read more >
DATA VALIDATION ANNOYANCES - Excel Annoyances [Book]
The secret is using data validation. Click a cell (or a group of selected cells), and then turn on the validation feature by...
Read more >
myrepos/mr at master - GitHub
Another way to use skip is for a lazy checkout. This makes mr skip. operating on a repo unless it already exists. To...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found