Documents/Uploads (Raw Original Documents)

Documents/Uploads refer to the initial step of the data pipeline where raw data is introduced into the system. Typically, these uploads are in PDF format, containing a variety of information such as text, images, and tables. The PDF documents can originate from multiple sources, including scanned forms, digital reports, or generated documents.


We support a huge array of PDF structures, this includes:

  • Single page PDFs of any format
  • Scanned images converted to PDFs
  • Bundled documents into a PDF

What’s Next

Uploads become Extractions