HTML-to-markdown
This is a robust Golang tool to convert HTML, even entire websites, into clean Markdown. I find it particularly neat for LLM website use cases where structured content is critical.
This Go library provides robust language detection, accurately identifying the language of both short and long texts. It's designed to overcome common limitations of other solutions, offering offline functionality without external APIs.
Visit github.com →