Data miners' little helper: data transformation activity cues for cluster analysis on document collections