usage: genoml discrete supervised munge [-h] [--prefix PREFIX]
                                        [--impute {median,mean}] [--geno GENO]
                                        --pheno PHENO [--addit ADDIT]
                                        [--feature_selection FEATURE_SELECTION]
                                        [--gwas GWAS] [--p P] [--vif VIF]
                                        [--iter ITER]
                                        [--ref_cols_harmonize REF_COLS_HARMONIZE]
                                        [-v]
optional arguments:
  -h, --help            show this help message and exit
  --prefix PREFIX       Prefix for your output build.
  --impute {median,mean}
                        Imputation: (mean, median). Governs secondary
                        imputation and data transformation [default: median].
  --geno GENO           Genotype: (string file path). Path to PLINK format
                        genotype file, everything before the *.bed/bim/fam
                        [default: None].
  --pheno PHENO         Phenotype: (string file path). Path to CSV phenotype
                        file [default: lost].
  --addit ADDIT         Additional: (string file path). Path to CSV format
                        feature file [default: None].
  --feature_selection FEATURE_SELECTION
                        Run a quick tree-based feature selection routine prior
                        to anything else, here you input the integer number of
                        estimators needed, we suggest >= 50. The default of 0
                        will skip this functionality. This will also output a
                        reduced dataset for analyses in addition to feature
                        ranks. [default: 0]
  --gwas GWAS           GWAS summary stats: (string file path). Path to CSV
                        format external GWAS summary statistics containing at
                        least the columns SNP and P in the header [default:
                        nope].
  --p P                 P threshold for GWAS: (some value between 0-1). P
                        value to filter your SNP data on [default: 0.001].
  --vif VIF             Variance Inflation Factor (VIF): (integer). This is
                        the VIF threshold for pruning non-genotype features.
                        We recommend a value of 5-10. The default of 0 means
                        no VIF filtering will be done. [default: 0].
  --iter ITER           Iterator: (integer). How many iterations of VIF
                        pruning of features do you want to run. To save time
                        VIF is run in randomly assorted chunks of 1000
                        features per iteration. The default of 1 means only
                        one pass through the data. [default: 1].
  --ref_cols_harmonize REF_COLS_HARMONIZE
                        Are you now munging a test dataset following the
                        harmonize step? Here you input the path to the to the
                        *_refColsHarmonize_toKeep.txt file generated at that
                        step.
  -v, --verbose         Verbose output.