Convert PDF to HTML format using C# and .NET


PDF to HTML
  • Supports all PDF versions (1.0 - 2.0, PDF/A)
  • Password protected docs.
  • Allows to convert custom PDF pages.
  • Produces HTML5 with CSS.
  • Allows to store images inside HTML using "base64".
  • Compatible with Chrome, Firefox, IE.

     Without belaboring the point, let's see how to add "PDF to HTML feature" into any .NET application. First of all, to give your .NET application ability to convert PDF to HTML, add a reference to the "SautinSoft.PdfFocus.dll" assembly. You may download it here, 63.3 Mb .

PDF Focus .Net generates documents in HTML5 format. Another point of interest is that it can generate HTML documents with images encoded by base-64 algorithm. Given this fact, it's possible to convert PDF to HTML completely in memory without filesystem.

All HTML documents produced using SautinSoft.PdfFocus.dll are completely compatible with W3C standards W3C Markup validation service.