Archive

Trying to find more information on something? Use the search function to quickly find what you need.


I'm trying to find...

Date
Title Summary:
4/20/2018
DocVac Use Cases

Some common usage scenarios (use cases) where DocVac can be helpful:

1. Financial statements, or other documents containing tables that can be run in table extraction ...

more
2/9/2021
MySearch - DocVacGold / Enterprise

MySearch contains a number of items helpful in the construction of searches.  Searches at a document level are controlled by one or more

more
4/14/2021
Web Services

With DocVacBasic, DocVacGold or DocVacEnterprise, you can upload documents, check on their status and retrieve information about documents using our web services.  Our documentation ...

more
4/14/2021
DocVac Basic vs. DocVac Gold vs DocVac Enterprise

DocumentVacuum provides two offerings that you can use -  DocVacBasic and DocVacGold, with a third option DocVacEnterprise which is invitation only for existing DocVacGold users and not ...

more
4/2/2018
Using OCR

Optical character recognition or OCR allows text to be extracted with varying degrees of accuracy from images.  A PDF that contains embedded text can usually be extracted with 100% accuracy, ...

more
2/20/2021
Raw Text Extraction

Raw text extraction where you look for a word like 'COMPANY' can be accomplished in two ways.  95%+ of the time, the best way is to search the raw text (that you can view in MyDoc - ...

more
2/9/2021
Search - OneDocMaster

There are 11 user configurable functions to classify the document and the individual pages within available to a Gold user e.g search for instances of the word COMPANY).  Example output: ...

more
10/11/2018
Search - OnePageMaster

There are 21 functions available to extract data from individual data points inside the document available to Gold and Enterprise users.  Of these, 3 are cleanup related e.g. when the cell ...

more
3/3/2021
Combining Multiple Docs into One Doc

If you have multiple JPG/TIF/PNG files that represent the same document, you can combine them as if they were one doc.  This is not currently supported for PDFs.  So for example, if you ...

more
2/17/2021
Billing - DocVacBasic & DocVacGold

Monthly Fees

A monthly account fee of $5 (DocVacBasic) or $50 (DocVacGold) is charged and is payable in advance.

Reference Documents

For ...

more
4/14/2021
Web Services - Usage Charges

Usage charges for web services are as follows:

PDocDetailApi/GetPddListLatest, BscId=7, $0.01 per call, first 288/day free (= once every 5 minutes)

PDocDetailApi/GetPddList, ...

more
10/27/2018
CSV Files

We provide comma separated value or CSV extracts in a number of places to help users download and review tables of data in Excel or other spreadsheets.  A few things to note - firstly, ...

more
6/18/2018
Key Term Search with Wildcards

You can wildcards to a key term to refine your searches of XML data.  We'll describe the character set used by DocVacBasic here; DocVacEnterprise has two additional character sets ...

more
4/14/2021
Postman

To test our web services using Postman, do the following:

1. Download and install Postman if you don't currently have ...

more
4/14/2021
Web Services - Ws - PDocDetailApi.GetPddList

Sample C# code to use the web service - similar code will work for all the web services except uploading documents.  Note the ability to loop through model validation ...

more
2/9/2021
Upload Mode

There are 5 modes of uploading a document:

1. Simple mode - a plain text search to find words e.g. COMPANY in a document and store the page number where found.

2. ...

more
4/14/2021
External Software & Services

We are very grateful to a number of individuals and organizations whose software and services help power documentvacuum.com:

Microsoft for its C# language, .Net Core 3.1 and .Net 5 and ...

more
4/9/2021
Web Services - Data Defns - ExtrMode

There are 3 modes of document extraction:

Id=1 - PDF treated as having embedded text (only first 100 pages extracted)

Id=2 - PdfAsImage : treated pdf as image file with ...

more
4/14/2021
Web Services - Data Definitions - PDocStatus

In general, an uploaded file is stored in the cloud and entries are made to create a new PDocId and generally prepare for data extraction.  The PDoc then goes through 4 extraction steps - an ...

more
2/9/2021
Web Services - Data Definitions - PdrxDir

PdrxDir (PDocRowXmlDirection) contains 20 directions used by a DocVacGold user.  A few of the more commonly used are listed here:

=XmlCell - the pdrx cell itself contains both the ...

more