The dataset supports the thesis "Semantic Impact - A novel approach for domain concept selection in ontology learning". Document-based Ontology (DbO) sets for the second experiment - 3000 in each corpus. Constructed by adding additional 2000 documents into the original 1000 corpus set. Folder X -> Target Corpus Folder Y -> Source Corpus