When any document is processed, a series of XML entries are created that combine the location of text on a page with the text itself.  Certain cleanup logic is applied to the newly created entries.  PDocRowXml is used to store these entries.  The coordinates are specified relative to the bottom left hand corner of the page with 72 units per inch.  So a value of LeftPdf=144 means that the left hand size of the box enclosing the text is located 2 inches from the left side of the page. In DocVacEnterprise where the user has a much greater ability to customize and optimize how searches are performed, it is often very instructive to examine the underlying XML data to determine what search strategy will be optimal to get the desired extraction results, and to debug why a given search strategy fails to extract a data element as expected.

PDocRowXmlNc is a simplified version of PDocRowXml and allows a DocVacBasic user through the web service GetPDocRowXmlNc to retrieve the XML entries for the document:

public class PDocRowXmlNc
        [Display(Name = "PdrxId")]
        public int PDocRowXmlId { get; set; }

        [Display(Name = "PdId")]
        public int PDocId { get; set; }

        [Display(Name = "PdemId")]
        public int PDocExtractionModeId { get; set; }

        [Display(Name = "PdpId")]
        public int PDocPageId { get; set; }

        [Display(Name = "Left")]
        public decimal LeftPdf { get; set; }

        [Display(Name = "Bottom")]
        public decimal BottomPdf { get; set; }

        [Display(Name = "Right")]
        public decimal RightPdf { get; set; }

        [Display(Name = "Top")]
        public decimal TopPdf { get; set; }

        [Display(Name = "Text")]
        public string TextContent { get; set; }



