HealthQA-BR: A System-Wide Benchmark Reveals Critical Knowledge Gaps in Large Language Models