This Quickstart should take around 6 minutes. Speedy!
1. Creating an account
Sign up to the Segna platform by visiting https://platform.segna.io/ and clicking "Create account".
2. Creating a Pipeline
A Pipeline defines what you want your output schema to look like.
Create a Pipeline by clicking the "Build new Pipeline" button.
3. Adding an output destination for your data (optional)
You can configure to have csvs/files imported using this Pipeline to output to a database or data-lake of your choice.
Select "Add Database"
Select "Add New Connector"
Select your desired database or data-lake
Enter the relevant credentials then select "Add Connector"
You may need to add additional details once the connector has been selected such as the table name or destination folder
4. Defining your output schema
Add fields to your output schema - i.e. the columns you expect there to be once the csv file has been uploaded, cleaned and processed.
The expected data type of the field can be configured, as well as in some cases, the expected unit and format. Segna will automatically do this conversion for you when a csv is uploaded!
If you don't want to allow any additional fields to be added as the csv is uploaded, you can deselect "Allow additional fields on job run".
5. Publishing and testing your Pipeline
You can now test your Pipeline by clicking on "Run Test Job". Upload a csv file that you would be expected to be uploaded - remember your input columns don't need to match! Segna will also try cleaning the file.
Once you have completed the upload process, if you had supplied an output destination in Step 3, you will see the data in your specified destination.