DIVISION: Memory Efficient Training via Dual Activation Precision