Efficient model-free Q-factor approximation in value space via log-sum-exp neural networks