Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently