Sequence and Tree Kernels with Statistical Feature Mining

Dec-31-2006–Neural Information Processing Systems

This paper proposes a new approach to feature selection based on a statistical featuremining technique for sequence and tree kernels. Since natural language data take discrete structures, convolution kernels, such as sequence and tree kernels, are advantageous for both the concept and accuracy of many natural language processing tasks. However, experiments haveshown that the best results can only be achieved when limited small substructures are dealt with by these kernels. This paper discusses thisissue of convolution kernels and then proposes a statistical feature selection that enable us to use larger substructures effectively. The proposed method, in order to execute efficiently, can be embedded into an original kernel calculation process by using substructure mining algorithms.Experiments on real NLP tasks confirm the problem in the conventional method and compare the performance of a conventional method to that of the proposed method.

artificial intelligence, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Dec-31-2006

Conferences PDF

Add feedback

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
Sequence and Tree Kernels with Statistical Feature Mining
Sequence and Tree Kernels with Statistical Feature Mining

Similar Docs Excel Report more

Title	Similarity	Source
None found