- generate test/training sets for different fragment sizes using the
  generate_fragment_testset.py script
- optimize the weights for different feature combinations using 
  optimize_weight_group.py
