Spatial Temporal Transformer Network for Skeleton-Based Action Recognition