Mitigating Performance Saturation in Neural Marked Point Processes: Architectures and Loss Functions