DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data
2014pp. 61–66
Citations Over TimeTop 1% of 2014 papers
Abstract
We present DKPro TC, a framework for supervised learning experiments on textual data. The main goal of DKPro TC is to enable researchers to focus on the actual research task behind the learning problem and let the framework handle the rest. It enables rapid prototyping of experiments by relying on an easy-to-use workflow engine and standardized document preprocessing based on the Apache Unstructured Information Management Architecture (Ferrucci and Lally, 2004). It ships with standard feature extraction modules, while at the same time allowing the user to add customized extractors. The extensive reporting and logging facilities make DKPro TC experiments fully replicable.
Related Papers
- → The WEKA data mining software(2009)17,793 cited
- Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data(2001)
- → Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments(2018)829 cited
- → Identifying Argumentative Discourse Structures in Persuasive Essays(2014)368 cited
- → A broad-coverage collection of portable NLP components for building shareable analysis pipelines(2014)131 cited