Data Formats

We support the following formats:

  • CSV
  • NDJSON
  • JSON

You may upload this data via raw text, file upload, or remote URL.

UI

Navigate to “Sources” on the left menu bar, and click on ”+“:

Infer from a Text sample

Paste a sample JSON, CSV or NDJSON data into the text field. Click Next

Validate the schema and give a name to the source

Click “Create” when done.

Infer from a File

Upload your data via your preferred format.

Then press “Next”

Validate the schema and give a name to the source

Click “Create” when done.

Infer from a URL

Paste the public URL into the field.

Then press “Next”


Validate the schema and give a name to the source

Click “Create” when done.

CLI

Define the schema for your source in a YAML file:

sales_calls.yaml
name: sales_calls
description: sales call transcripts
cols:
  ID: UUID
  Transcript: String
settings:
  engine: MergeTree()
  order_by: '`ID`'
  partition_by: tuple()

Push sources to your workspace with af push.

For example, to push sales_calls.yaml, run:

af push sales_calls.yaml