Doesn't checks for valid termination
See original GitHub issueFor the following input:
from urlextract import URLExtract
extractor = URLExtract()
text="""
http://httpbin.org/status/204, http://httpbin.org/status/204.
"""
urls = extractor.find_urls(text)
print(urls)
The output generated is:
['http://httpbin.org/status/204,', 'http://httpbin.org/status/204.']
The set [.,?!-]
are not valid terminal symbols for the url and thus should be checked.
Issue Analytics
- State:
- Created 5 years ago
- Comments:5 (4 by maintainers)
Top Results From Across the Web
'Employment at Will' Isn't a Blank Check to Terminate ... - SHRM
Many small employers and, especially, their CEOs believe "employment at will" allows them to fire a worker for just about any reason.
Read more >Final Paycheck Rules by State - LawInfo
For contract employees, the final paycheck amount cannot be determined until the contract is officially terminated. Once that occurs, the ...
Read more >12 Things Everyone Should Know About Employment Law
A termination that is unfair is not necessarily illegal. In most circumstances, employers can terminate employees “at will,” meaning at any time for...
Read more >Arizona Labor Laws on Termination Pay and Final Paychecks
If an employee is laid off, the employer is required to dispense their final paycheck within seven business days or by the next...
Read more >Employee Terminations - MRSC
What are the Grounds for Which an Employee May Not Be Terminated? · The Termination May Not Be Based on the Employee's Age,...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
OK, thanks. I will look to all your reported issues.
In the meantime, one can hack (at least commas) via
Perhaps sensible default is treating unconventional special characters as forbidden in url and adding a nicer constructor argument to allow to configure that if someone really wants them in URL?