The S3 connection will keep a S3 Bucket Folder and everything in it, including subfolders, in sync.

Creating a Connection

  1. First, follow our guide to Create your AWS credentiasl

  2. Select S3 from the Connector Dropdown

  3. Fill out the form with your Access Key, Secret Access Key and Region

  4. Once you submit, you will be redirected to a new page

  5. Select an API key from the dropdown at the top. All documents created by the connection will be attributed to the selected API key.

  6. For S3 Bucket, type in the bucket you want to sync

  7. For S3 Bucket Prefix, type in which folder you want to sync. If you want to sync all files and folders, leave this blank. If you want to sync everything under images/cats type in images/cats.

  8. Fill out any metadata you want to associate with the files in the folder. You can use this to filter the data later. This is in JSON format. You can leave it blank, or you could set it to something like

    1. {
        "company": "acme"
      }
      

  9. Select an Import mode. This is only applicable for PDFs. Fast only extracts text, but Hi-res also extracts images and tables. Fast can be up to 20x fast than Hi-res

  10. Enter a name for your partition where you want your documents to live.

  11. Click "Create Connection" at the top right.

You will be taken back to the connectors page, and the system will schedule a sync process for you as soon as possible.

What is synced?

  • Files - all files supported by Ragie are synced. See the api reference for more information.