question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Ignore HTML tags in source text

See original GitHub issue

Would be nice if TextVide had an option to ignore HTML tags in the source text.

I have a source text that is an HTML string, previously rendered from markdown. The only way I have been able to use TextVide is by using jquery to render the HTML, looping over each tag, and running TextVide on each individual text node. Very inelegant and probably inefficient.

Also as a bonus feature it would be good to be able to provide a blacklist of specific tags to exclude their content from TextVide, for example existing <b> or <strong> tags.

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:6 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
teakcommented, Aug 9, 2022

Hey thank you for getting this done, honestly not ‘late’ at all. So far it is all working great. If all goes well we’ll be adding this as an optional accessibility feature in our education app.

1reaction
teakcommented, Jul 30, 2022

I had a quick look at your source, your right it does not seem to be a simple change. I think how I would implement it would be to create a separate function (not an option on the main) and use an HTML parsing library like ‘htmlparser2’ to preprocess the text.

Read more comments on GitHub >

github_iconTop Results From Across the Web

How to make html ignore code that is part of text?
Short Answer. Encode your code using an online HTML Encoder and then put it inside pre <pre> <%--your encoded code goes here--%> </pre>....
Read more >
if I give a text with html tags Spacy NER will ignore ... - GitHub
One very simple way to do this initially is to replace all the HTML tags with spaces rather than removing them, so the...
Read more >
How to remove HTML data tags from text fields
To eliminate HTML tags from your fields you can create a new field and use any of the following two expressions: REGEXREPLACE (Field_Name,...
Read more >
Remove HTML Tags Using Java | Baeldung
Learn how to remove all HTML tags and extract the text from an HTML document string.
Read more >
HTML Stripper to remove / scrub / strip HTML tags helps ...
HTML Stripper removes HTML tags and convert HTML code to text, which scrub text formatting of the HTML to save and share TEXT....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found