Topic-based document segmentation with probabilistic latent semantic analysis | doi.page