#html #markdown #converter

bin+lib html2md

Library and binary to convert simple html documents into markdown

8 releases

0.2.5 Aug 7, 2018
0.2.4 Jul 28, 2018
0.2.1 Jun 17, 2018
0.2.0 May 11, 2018
0.1.1 Apr 2, 2018

#50 in Parser implementations

Download history 10/week @ 2018-05-16 3/week @ 2018-05-23 18/week @ 2018-05-30 3/week @ 2018-06-06 14/week @ 2018-06-13 19/week @ 2018-06-20 34/week @ 2018-06-27 9/week @ 2018-07-04 5/week @ 2018-07-11 23/week @ 2018-07-18 18/week @ 2018-07-25 29/week @ 2018-08-01 93/week @ 2018-08-08

92 downloads per month

GPL-3.0+

44KB
691 lines

HTML2MD

Library to convert simple html documents into markdown flavor. Implements markdown as written on its inception page.

Features

Currently supported:

  • Lists (and inner lists)
  • Headers
  • Quotes (and inner quotes)
  • Paragraphs
  • Horizontal rulers
  • Images and links
  • Tables
  • Formatting (bold, italic, strikethrough, underline)
  • Code

Limitations

  • no markdown flavors support (-/+ unordered list styles, ##/== headers etc.)
  • doesn't yet detect code style

Used libraries

html5ever - Servo egine HTML parsing library, used to convert html input to DOM

regex - PCRE support in Rust, used to correct whitespaces

Contributions

You may create merge request or bug/enhancement issue right here on GitLab, or send formatted patch via e-mail. For details see CONTRIBUTING.md file in this repo.

License

Copyright (C) 2018  Oleg `Kanedias` Chernovskiy

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

Dependencies

~3.5MB
~63K SLoC