![]() |
|
Convert PDF to text, HTML or CSV, extract images and even information about PDF files in .NET and ActiveX interfaces with Bytescout PDF Extractor SDK.
PDF Extractor SDK allows developers to convert PDF to text, PDF to HTML, extract images from PDF, convert PDF to CSV for Excel. Works WITHOUT any additional software required.
Here are some key features of "Bytescout PDF Extractor SDK":
· extracts text from PDF files according to original PDF layout;
· converts PDF to Excel by reading cells from given rectangle;
· converts PDF to HTML with layout preserved;
· extracts PDF file metadata (title, author, description) and get other information about the file (number of pages, encrypted or not);
· allows to extract images from PDF document;
· doesn't require Adobe Reader or any other PDF rea der software to be installed;
· provides .NET and ActiveX interfaces;
· made with 100% managed C# code;
Limitations:
· 30 days trial
What's New in This Release: [ read full changelog ]
· improved support for Unicode text extraction
· improved support for PDF/A pdf files
· issues with white stripes appearing on multiple images combined fixed
· data extraction internal optimizations
· improved support for 8 bit images inside PDF
· vector drawings improved to provide better support for multiple small objects
· Color representation in images with indexed colors fixed
· Type2 fonts support improved
· Improved support for embedded fonts in PDF produced by Ghostscript engine
· CCIT images compression compression related issues fixed
· LZW compressed PDF support improved
· improved support for shading objects
· improved PDF fonts support
· improved support for PDF with 4 bit images

Via: Bytescout PDF Extractor SDK 2.40.650