Since version 5.0, Squey is able to import and export Apache Parquet files!
«
Apache Parquet is an open-source file format that stores data efficiently in columnar format, provides different encoding types, and supports predicate filtering. With good compression ratios and efficient encoding, VPC flow logs stored in Parquet reduce your Amazon S3 storage costs.
» ― AWS Blog
Let’s take advantage of the fact that AWS VPC Flow Logs can be natively stored in Apache Parquet format to seamlessly visualize our network and understand traffic patterns, identify security issues, audit usage, and diagnose network connectivity.
In AWS VPC console
select a VPC, click Create flow log
and set the following parameters:
Then you can open in Squey as many Parquet files as you want in one go, as show in the quick video below: