Let's get started!¶
Skippr is a data pipeline tool that makes it easy to ingest, transform and output optimised data.
Hello World Example¶
In this simple example we'll convert json source data to Parquet files for analysis.
This is a simple copy and paste example, to get started quickly in your local terminal. You'll be ready to deploy Skippr and integrate enterprise systems in no time!
- Install Skippr cli tool
- Ingest example Bike Hire IoT data from S3
- Finally, we'll do some analysis on the output data.
Install Skippr¶
curl -sL "https://raw.githubusercontent.com/skipprd/skipprd/main/install.sh" | bash
Ingest Example Data With Skippr¶
Copy and run the below command below in your terminal.
DATA_SOURCE_PLUGIN_NAME=s3 \
DATA_SOURCE_S3_BUCKET=skippr-public-sample-data \
DATA_SOURCE_S3_PREFIX=bike-hire \
AWS_DEFAULT_REGION=us-east-1 \
PIPELINE_NAME=bikehire \
skippr sync
Analyse the output data¶
skippr query --query 'SELECT COUNT(*) FROM bikehire;'
Help¶
Join us on Slack.