python - How do you remove html tags using Universal Feed Parser? -
Documentation lists the tags that are allowed / removed by default:
But it does not tell anything which extra tags you can remove.
Is there a way to use the Universal Feed Parser to do this or do you like something like your own rajx and / or beautiful soup?
I took a quick look at the code and I do not think there is a way to overwrite them directly But you can overwrite the feedparser._HTML. List of tags to be removed before Sanitizer.acceptable_elements , feedparser.parse
Comments
Post a Comment