Efficient Global String Kernel with Random Features: Beyond Counting Substructures