question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Questions & Requests?

See original GitHub issue
  1. eval_relations(gt=[ground_truth_relations], res=[your_relations], cmp_blank=True)
  • As the objective is to find the merged relations of the neighboring cells, isn’t the blank relation comparison a wrong evaluation metric?
  1. rel_gen.py this
  • That assumes every cell has unique text and case sensitive. Considering the fact this is only to generate relations for comparisons against the prediction. This may result in false negatives?
  1. Evaluation stats & trained models
  • Can we expect to get the evaluation stats & trained models getting published?

Issue Analytics

  • State:open
  • Created 3 years ago
  • Comments:27 (9 by maintainers)

github_iconTop GitHub Comments

6reactions
Darenarcommented, Feb 4, 2021

Hi, everyone. For those who also struggles with the identification of boundary boxes in the pdf - I think I’ve got a solution. First of all, to read PDF correctly through python, you have to be sure that the size of loaded PDF is exactly the same as the original one. For example, I’ve been loading pages from pdf as images using pdf2image convert_from_path function. As the default, it uses parameter dpi as 200, which is appeared to be not correct. The thing you need to do is to identify the correct shape of the PDF, using for example PyPDF2 from https://stackoverflow.com/questions/6230752/extracting-page-sizes-from-pdf-in-python , and then you could have used convert_from_path with parameter ```size`` specifying the correct shape of the pdf.

Another vital point here is that coordinates are different from those you wish to apply to your PIL.Image object, for example. The thing you have to do is too adjust your y1 and y2 by y1 = PAGE_HEIGHT-y2 and y2 = PAGE_HEIGHT-y1. It comes from the different measures of Y coordinates in python and in PDF itself.

I hope it will be handy to someone, cause I’ve struggled with it for almost the whole day!

2reactions
kbrajwanicommented, Nov 23, 2020

@rmporsch @akshowhini @abhyantrika can you help me to inference the trained model. I have only image from which contains the table. so how can i create the other required files and get the structure of table from image.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Use Indirect Questions to Make Requests and Offer Subtle ...
Learn how to make requests and offer subtle suggestions tactfully and politely with indirect questions, also known as embedded questions.
Read more >
Ways of asking questions and making requests - thesaurus
Comprehensive list of synonyms for ways of asking questions and making requests, by Macmillan Dictionary and Thesaurus.
Read more >
How to ask the right customer feedback questions - Canny Blog
With Canny, you can respond directly to feedback, and ask follow-up questions. Customers who submitted the request will be notified, and they'll be...
Read more >
To Use or Not Use the "?" in Requests - Merriam-Webster
You also probably noticed the terminal periods where one might expect a question mark, and (perhaps after an initial bolt of worry that...
Read more >
Responding To The Other Side's Requests For Information
Learn what to do if you have received written discovery requests from the other ... requests to produce documents, or to answer written...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found