A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models