Skip to content
  • Volker Krause's avatar
    Improve HTML to text conversion · fcc5767f
    Volker Krause authored
    Specifically, this avoid generating spurious whitespaces and does line
    breaks that more closely follow the expected layout in HTML.
    
    This helps extractors that have to use the text representation due to the
    corresponding extractors containing no meaningful DOM structure to work
    with.
    fcc5767f