ConvertSpot.com PDF to XML, Convert PDF to XML, PDF to XML Converter, PDF XML Converter
PDF to Text, Excel to XML, Excel to CSV, Excel to Text, Word to Text, PowerPoint to Text
 
   

Products->Convert PDF to XML (COM Edition)

PDF to XML Converter(P2X) extract the text information from the pdf file and output them into a xml file. All the functionalities were encapsulated into a COM component, the exposed methods/interface is as same as PDF Plain Text Extractor(P2T)-COM Edition, but the output file is in XML format. You can integrate it into your own application and redistribute it royalty free. The trial version can only convert the first 5 pages of every document. it's 15 days free.

Download Now          FAQ'S            Buy It Now

Features:
    1.Supports PDF specification 1.0-1.5
    2. Don't need any third party products such as Adobe Acrobat or Adobe Acrobat reader
    3. Support LZW encoded pdf documents.
    4. Supports English, French, German, Italian and other European languages.
    5. Can output page ranges of the document.
    6. Support Unicode, multi-bytes languages.
    7. Support Type0, Type1, Type3, TrueType, CIDFont fonts.   
    8. The component is multiple threads safe.

System Requirements:
    1.Pentium 75 PC or higher, 32MB RAM or more. 
    2.Win95, Win98, Win ME, Win2000, WinNT 4.0, Win XP, Vista, Windows 7.

The definition of output XML file:   
The format of output xml file was defined in PDFDocument.xsd

Output XML file example:

<?xml version="1.0" encoding="UTF-8"?>
<PDFDocument>
  <PDFInfo>
      <title><![CDATA[ PDF Reference ]]></Title>
      <Subject><![CDATA[PDF Reference 1.4]]></Subject>
      <Author><![CDATA[Smith.H]]></Author>
      <Creator><![CDATA[PDF Writer]]></Creator>
      <Producer><![CDATA[Adobe Acrobat]]></Producer>
      <CreateDate><![CDATA[2002/06/15]]></CreateDate>
      <KeyWords><![CDATA[PDF Reference]]></KeyWords>
  </PDFInfo>
  <Pages>
    <Page>
      <PageNumber>1</PageNumber>
      <PDFElement>
          <Coordinate_X>12</Coordinate_X>
          <Coordinate_Y>34</Coordinate_Y>
          <DataString>
<![CDATA[
Hello, this is a data chunk with
special chars "~@@^%^$(^#\''"'and
line break.CDATA will deal with
this kind of data perfectly.
]]>
         </DataString>
      </PDFElement>
      .
      .
      .
    </Page>
    .
    .
    .
  </Pages>
</PDFDocument>

 

Installation and Sample code:

 Please click here for detail installation guide and sample code.

Purchase Online:

We currently only provide developer license for PDF XML Converter-(DLL Edition),
Please read the software license carefully before you place order.
Developer License:

With a developer license you can integrate it into your product and distribute it with your own product royalty free.    Developer License  

$1300 Buy Now

 

 
© 2004 Retsina Software Solutions Ltd. All rights reserved.           Site Map  |  Contact us  |  Privacy Policy