Paraphrase recognition using machine learning to combine similarity measures
2009pp. 27–27
Citations Over TimeTop 10% of 2009 papers
Abstract
This paper presents three methods that can be used to recognize paraphrases. They all employ string similarity measures applied to shallow abstractions of the input sentences, and a Maximum Entropy classifier to learn how to combine the resulting features. Two of the methods also exploit WordNet to detect synonyms and one of them also exploits a dependency parser. We experiment on two datasets, the MSR paraphrasing corpus and a dataset that we automatically created from the MTC corpus. Our system achieves state of the art or better results.
Related Papers
- Enlarging Paraphrase Collections through Generalization and Instantiation(2012)
- Finnish Paraphrase Corpus(2021)
- → Chinese Paraphrase Dataset and Detection(2021)2 cited
- Paraphrase extraction from interactive Q&A communities(2012)
- → A Study on the Application of Paraphrase Strategy in the Translation from Chinese to English(2018)1 cited