Prompting the data transformation activities for cluster analysis on collections of documents