Read this paper describing a search for peptides using machine learning. Then answer the questions.
You can download Weka if you want to experiment.
This version of the pepseq data file is in the Weka arff format
This version is smaller. Try experimenting with it first.