-
collect_data.py
– the main script that does the data collection -
collect_data_configuration.json
which holds all the configuration parameters -
my-company-sa.json
– A service account file to be used to upload the collected data.
collect_data_configuration.json
The configuration file looks as follows:
regatta_deploy_dir
This is the location where the Regatta software was deployed in. In this example the default is:
regatta_logs_dir
This is the location where the Regatta cluster stores its logs. The default is:
file_ignore_list
This list parameter allows you to filter out files that you wish would not be collected by the
script. It’s basically a list of patterns that are applied to remove any file that matches it. The
default value is:
NOTE: Filtering out files without consulting with Regatta Support prior to doing so may hinder
the ability of support to analyze the data and understand the state of the system.
upload_bucket_name
This is the Regatta bucket where we hold the uploaded data. The default is:
upload_bucket_folder
This is your dedicated personal folder in the Regatta bucket. The default value is:
archive_output_dir
This is the location where the script will create an archive of all the data collected on the server.
The default here is:
service_account_file
The service account file is the authentication file to the folder. The default is:
collect_data.py
script. If you
decide to locate it elsewhere, please add the path to it as well.
upload
This flag indicates whether to upload the data or not. The default value is FALSE. In case you
have an internet connection accessible, and you would like the script to upload the collected
data for you, please change it to TRUE.
Note that you can use the script later to upload the data that was collected and not uploaded
yet.
delete_tar
This flag indicates whether to delete the data after a successful upload. The default value is
FALSE.
tar_path
This parameter allows you to upload a tar that was generated but not uploaded as a result of the
flag upload being set to FALSE. The default of this flag is NULL. If it is set to a path, for example:
- The script WILL NOT collect data as it is assumed that the data was collected and it’s in the tar_path. Note that this path should contain a tar.zst file to be uploaded.
- The collect_data script will try to upload the data to the Regatta cloud storage bucket.
folder_path
This parameter allows you to request uploading the data located in the folder_path
. The
default value for it is NULL. If it is set to some path, it will collect the files from there instead of
the 2 predefined locations, i.e., regatta_deploy_dir
and regatta_logs_dir
. It will
apply the file_ignore_list
filter on the data collected from the path defined in
folder_path
.