Comparison of Classification Methods for Very High-Dimensional Data in Sparse Random Projection Representation

Dec-18-2019–arXiv.org Machine Learning

Machine learning is a mature scientific field with lots of theoretical results, established algorithms and processes that address various supervised and unsupervised problems using the provided data. In theoretical research, such data is generated in a convenient way, or various methods are compared on standard benchmark problems - where data samples are represented as dense real-valued vectors of fixed and relatively low length. Practical applications represented by such standard datasets can successfully be solved by one of a myriad of existing machine learning methods and their implementations. However, the most impact of machine learning is currently in the big data field with the problems that are well explained in natural language ("Find malicious files", "Is that website safe to browse?") but are hard to encode numerically. Data samples in these problems have distinct features coming from a huge unordered set of possible features. Same approach can cover a frequent case of missing feature values [10, 28].

dataset, projection, sparse data, (14 more...)

arXiv.org Machine Learning

Dec-18-2019

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > New York
    - New York County > New York City (0.04)
  - Canada > Quebec
    - Montreal (0.04)
- Europe
  - Netherlands > South Holland
    - Dordrecht (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
- Asia > Middle East
  - Israel > Jerusalem District > Jerusalem (0.04)

Genre:
- Research Report (1.00)

Industry:
- Information Technology > Security & Privacy (0.96)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Neural Networks (1.00)
  - Performance Analysis > Accuracy (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found