TopCat: data mining for topic identification in a text corpus | doi.page