Bitnet.cpp: Efficient Edge Inference for Ternary LLMs