reading order representation (XML order vs index)
See original GitHub issueThe reading order detection capabilities of eynollah look pretty amazing AFAICS – if viewed through the glasses of PageViewer. But it is noteworthy that the actual representation for PAGE-XML does not correspond to that schema’s documentation regarding @index
.
It surprisingly turns out that PageViewer gets it wrong too. See here for full report.
So IMO eyhollah needs to actually inverse its representation: the (currently correct) XML ordering needs to also become the (currently broken) @index
ordering.
Issue Analytics
- State:
- Created 3 years ago
- Comments:5 (5 by maintainers)
Top Results From Across the Web
In XML, is order important? - Stack Overflow
Order of elements is significant in XML, so in your example the two documents are different. Attribute order is not significant, though.
Read more >XML Indexes (SQL Server) - Microsoft Learn
The primary XML index is a shredded and persisted representation of the XML BLOBs in the xml data type column. For each XML...
Read more >XML Data Using Extended Order and Path Index
ABSTRACT. The eXtensible Markup Language (XML) is becoming a new standard for information representation and exchange over the Internet. How to index XML...
Read more >Getting Started With XML Indexes - Simple Talk
This information is made up of XML node values, types, and paths. A primary index is a 'pre-shredded' representation of the XML blob...
Read more >XML Schema Part 2: Datatypes Second Edition - W3C
The order in which the ·memberTypes· are specified in the definition (that is, the order of the <simpleType> children of the <union> element,...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Dear Robert, first let me thank you for your nice words. We already had the same issue with Page Viewer and another tool which was used by @cneud (I’ve forgotten the name). But the point is that by eynollah we can find the reading order as Page Viewer shows that or better to say regardless of viewers we know the orders of text regions in the right manner. So we can actually have a call (including @cneud , @kba and @mikegerber) to discuss about this and to see how we can write it into output in order to get desired results.
not yet 😦