Algorithmic and Statistical Perspectives on Large-Scale Data Analysis

Oct-8-2010–arXiv.org Machine Learning

In recent years, ideas from statistics and scientific computing have begun to interact in increasingly sophisticated and fruitful ways with ideas from computer science and the theory of algorithms to aid in the development of improved worst-case algorithms that are useful for large-scale scientific and Internet data analysis problems. In this chapter, I will describe two recent examples---one having to do with selecting good columns or features from a (DNA Single Nucleotide Polymorphism) data matrix, and the other having to do with selecting good clusters or communities from a data graph (representing a social or information network)---that drew on ideas from both areas and that may serve as a model for exploiting complementary algorithmic and statistical perspectives in order to solve applied large-scale data analysis problems.

bioinformatics, data mining, machine learning, (21 more...)

arXiv.org Machine Learning

Oct-8-2010

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.93)

Genre:
- Research Report (0.82)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining (1.00)
  - Information Management > Search (0.93)
  - Communications > Networks (0.67)
  - Biomedical Informatics > Translational Bioinformatics (0.66)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language (0.67)
    - Machine Learning > Statistical Learning
      - Clustering (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found