Model training
Training with GenoML evaluates several different algorithms and outputs the best algorithm based on a specific metric that can be tweaked using the --metric_max flag (default is AUC).
Required arguments for GenoML are the following:
data: Is the datacontinuousordiscrete?method: Do you want to usesupervisedorunsupervisedmachine learning? (unsupervised currently under development)mode: would you like tomunge,train,tune, ortestyour model?--prefix: Where would you like your outputs to be saved?
The most basic command to train your model looks like the following; it looks for the *.dataForML file that was generated in the munging step:
If you would like to determine the best competing algorithm by something other than the AUC, you can do so by changing the --metric_max flag (Options include AUC, Balanced_Accuracy, Sensitivity, and Specificity) :
caution
The --metric_max flag is only available for discrete datasets.