An explainable data-driven approach to web directory taxonomy mapping