Document Pipelines

Document pipelines give you a way to upload documents via an API. You first need to create an API key and attach that to a dataset. The documents will be uploaded to that dataset.

Alt text

Accessing the API

The following curl example shows you how to upload a file.

curl -X 'POST' \
'http://localhost:7800/v1/document_upload' \
-H "Authorization: Bearer $UPLOAD_API_KEY" \
-H 'accept: application/json' \
-H 'Content-Type: multipart/form-data' \
-F 'files=@YOURDOCUMENT.pdf'

Change YOURDOCUMENT.pdf to match the name and location of the document you want to upload and $UPLOAD_API_KEY to match the API key that was generated in the user interface.

Your document will appear on the datasets screen where you can see it's progress through the pipeline.

Tools

This interface supports direct coding, but many tools are available to help you set up automated pipelines without writing any code. For example, the image below shows an n8n workflow that periodically monitors a specific folder on Google Drive for new files. When new files are detected, they are automatically uploaded to a designated dataset via an API call.

Alt text