Efficient Approximation Algorithms for Strings Kernel Based Sequence Classification
Muhammad Farhan, Juvaria Tariq, Arif Zaman, Mudassir Shabbir, Imdad Ullah Khan
–Neural Information Processing Systems
Sequence classification algorithms, such as SVM, require a definition of distance (similarity) measure between two sequences. A commonly used notion of similarity is the number of matches between k-mers (k-length subsequences) in the two sequences. Extending this definition, by considering two k-mers to match if their distance is at most m, yields better classification performance. This, however, makes the problem computationally much more complex. Known algorithms to compute this similarity have computational complexity that render them applicable only for small values of k and m.
Neural Information Processing Systems
Oct-4-2024, 08:11:48 GMT
- Country:
- Asia
- Middle East > Jordan (0.04)
- Pakistan > Punjab
- Lahore Division > Lahore (0.05)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California > Los Angeles County
- Long Beach (0.04)
- New Jersey (0.04)
- New York (0.04)
- California > Los Angeles County
- Asia
- Industry:
- Technology: