Click logo to clean text
Click logo to clean text
What gets cleaned
- Zero-width characters, non‑breaking spaces
- Markdown symbols:
##,**, backticks, brackets, link syntax - Typographic dashes and quotes normalization
- HTML tags and entities
- URLs and fragments (optional)
- Comment lines starting with
# - Emojis and pictographs (optional)
- Extra spaces and duplicate blank lines