Protein classification based on text document classification techniques | doi.page