Skip to content

Commit

Permalink
Avoids spaces after inline code
Browse files Browse the repository at this point in the history
  • Loading branch information
steineggerroland committed Jan 13, 2025
1 parent 959560e commit 25085f7
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions trafilatura/xml.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@
CONTROL_PARSER = XMLParser(remove_blank_text=True)

NEWLINE_ELEMS = {'graphic', 'head', 'lb', 'list', 'p', 'quote', 'row', 'table'}
SPECIAL_FORMATTING = {'code', 'del', 'head', 'hi', 'ref'}
WITH_ATTRIBUTES = {'cell', 'row', 'del', 'graphic', 'head', 'hi', 'item', 'list', 'ref'}
NESTING_WHITELIST = {"cell", "figure", "item", "note", "quote"}

Expand Down

0 comments on commit 25085f7

Please sign in to comment.