Parquet output format makes it easy to set up data pipelines for data lakes. Parquet is more efficient than CSV for storing and querying the data, and it makes processing the data easy as it contains metadata such as the data types of each field.
You can select the Parquet output format for cloud storage destinations when setting up a destination on the Supermetrics Hub.
Instructions
- Log in to the Supermetrics Hub.
- In the sidebar, go to Storage → Storage destinations.
- Click Add storage, or if you have already set up some destinations, click New storage.
- Fill in the necessary details as you would normally do. Take a look at our prerequisite and configuration guides for various data warehouse destinations.
- Set a unique upload path for the data, avoiding conflicts with existing destinations.
- In the Output format dropdown, select Parquet as the output format for the destination. This setting is only visible for cloud storage destinations, not data warehouses.
- Click Save to apply the changes.
- Create a transfer to your data lake destination. The standard steps for creating transfers apply, so as usual, specify the source, destination, and any additional transfer settings required.