mahoney
Optimal Subsampling with Influence Functions
As the amount of data increases, the question arises as to how best to deal with the large datasets. While computational platforms such as Spark [28] and Ray [23] help process large datasets once a desired model is chosen, simply using smaller data can be a faster solution for exploratory data modeling, rapid prototyping, or other tasks where the accuracy obtainable from the full dataset is notneeded.
Country:
- North America > United States > Washington > King County > Seattle (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Asia > Middle East > Jordan (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Country:
- North America > United States > Michigan (0.04)
- North America > United States > Pennsylvania (0.04)
- North America > United States > Massachusetts > Plymouth County > Hanover (0.04)
Country:
- Europe > France > Île-de-France > Paris > Paris (0.05)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Country:
- Europe > Germany (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Europe > Italy (0.04)
Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > United States > California > Alameda County > Berkeley (0.04)
- Asia > China (0.04)
- (3 more...)
Technology:
Country:
- North America > United States > California > Santa Clara County > Stanford (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Country:
- North America > United States > California > Alameda County > Berkeley (0.05)
- North America > United States > New York (0.04)
- North America > United States > Massachusetts > Plymouth County > Hanover (0.04)
- (6 more...)
Country:
- North America > United States > California > San Francisco County > San Francisco (0.14)
- North America > Canada (0.04)
- Europe > Sweden > Stockholm > Stockholm (0.04)
Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.51)
- Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.41)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.41)