Is Function Similarity Over-Engineered? Building a Benchmark