Accelerating Distributed K-FAC with Smart Parallelism of Computing and Communication Tasks