python generate_train_test_split.py \
--dataset "MY_DATASET_PATH"
--dataset
Path to the DataSet which contain .dat files.
--train_ratio,
--min_cat_train_size,
--output_path
Use pandas to speed up calculation of training and testing split
Bad implementation