Text mining: Generating hypotheses from MEDLINE
Citations Over TimeTop 10% of 2003 papers
Abstract
Abstract Hypothesis generation, a crucial initial step for making scientific discoveries, relies on prior knowledge, experience, and intuition. Chance connections made between seemingly distinct subareas sometimes turn out to be fruitful. The goal in text mining is to assist in this process by automatically discovering a small set of interesting hypotheses from a suitable text collection. In this report, we present open and closed text mining algorithms that are built within the discovery framework established by Swanson and Smalheiser. Our algorithms represent topics using metadata profiles. When applied to MEDLINE, these are MeSH based profiles. We present experiments that demonstrate the effectiveness of our algorithms. Specifically, our algorithms successfully generate ranked term lists where the key terms representing novel relationships between topics are ranked high.
Related Papers
- → Analysis of Operational Airborne ISR Full Motion Video Metadata(2013)2 cited
- → A case study in designing Chinese metadata(2000)2 cited
- → Finding coincident data from satellites: using "meta-metadata" to reduce load on archive(2003)1 cited
- The Exploration of the Standardization in Data Image on Metadata(2007)
- Review on Audiovisual Metadata Elements Abroad(2005)