I tried running AI chatbots locally on my laptop -- and they kinda suck

PCWorld 

Newer open LLMs often brag about big benchmark improvements, and that was certainly the case with DeepSeek-R1, which came close to OpenAI's o1 in some benchmarks. But the model you run on your Windows laptop isn't the same one that's scoring high marks. This simple question--and the LLM's rambling answer--shows how smaller models can easily go off the rails. They frequently fail to notice context or pick up on nuances that should seem obvious. In fact, recent research suggests that less intelligent large language models with reasoning capabilities are prone to such faults.