AITopics

1609.00661

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > New Jersey > Essex County > Newark (0.04)
Asia > China > Sichuan Province > Chengdu (0.04)
(15 more...)

Genre: Research Report > New Finding (0.46)

Industry: Telecommunications (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Klaassen, Sven, Kueck, Jannis, Spindler, Martin

Transformation Models in High-Dimensions

arXiv.org Machine LearningDec-20-2017

Transformation models are a very important tool for applied statisticians and econometricians. In many applications, the dependent variable is transformed so that homogeneity or normal distribution of the error holds. In this paper, we analyze transformation models in a high-dimensional setting, where the set of potential covariates is large. We propose an estimator for the transformation parameter and we show that it is asymptotically normally distributed using an orthogonalized moment condition where the nuisance functions depend on the target parameter. In a simulation study, we show that the proposed estimator works well in small samples. A common practice in labor economics is to transform wage with the log-function. In this study, we test if this transformation holds in CPS data from the United States.

artificial intelligence, information fusion, transformation model, (15 more...)

1712.07364

Country:

Europe > Germany > Hamburg (0.04)
North America > United States > New York (0.04)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Data Science > Data Integration (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.83)

@machinelearnbotDec-16-2017, 15:16:18 GMT

data-integration-is-one-thing-the-cloud-makes-worse.html

One, enterprises have too many decisions to make. Two, it's difficult to find success with complex data integration. Those are the two main excuses I hear these days, as enterprises move to the cloud. Whatever the justification, the lack of attention to data integration is beginning to cause some real damage. Enterprises have so much coming at them that they don't think about every approach and technology that they need to think about.

artificial intelligence, data, information fusion, (11 more...)

Industry: Information Technology (0.39)

Technology:

Information Technology > Data Science > Data Integration (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)

@machinelearnbotDec-15-2017, 11:40:30 GMT

10 Best Big Data Management Tools

The revenue from data management tools is going to increase by 50% to around $187 billion by the year 2019. By using data management tools, you get to utilize a lot of built in functions rather than having to design the same from scratch. 4. Tools are classified by the stage of Big Data analytics process: 1. ETL (data preparation) 2. Data analysis (actual number crunching) 3. Data visualization (transforming numbers to actionable insights) 5. In Data analytics, ETL is a process in which Data is collated from the source system and transferred to a Data warehouse. It is the primary step in the Data analytics chain. Following are the top tools for ETL. 6. IBM Infosphere Information Server, with its massive parallel processing capabilities can deliver a hugely scalable and flexible platform to process multiple varieties of Data volumes.

data management tool, data preparation, visualization, (10 more...)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.79)

@machinelearnbotDec-8-2017, 22:31:44 GMT

ETL Frameworks and why not just use a GPL (Python, Node, Scala)? • r/datascience

Welcome to /r/datascience, a place to discuss data, data science, becoming a data scientist, data munging, and more! If you're brand new to this subreddit and want to ask a question, please use the search functionality first before posting. This way you can search if someone has already asked your question.

artificial intelligence, datascience, social media, (7 more...)

Industry: Media > News (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.40)

Zhang, Chihao, Zhang, Shihua

Bayesian Joint Matrix Decomposition for Data Integration with Heterogeneous Noise

arXiv.org Machine LearningDec-8-2017

Matrix decomposition is a popular and fundamental approach in machine learning and data mining. It has been successfully applied into various fields. Most matrix decomposition methods focus on decomposing a data matrix from one single source. However, it is common that data are from different sources with heterogeneous noise. A few of matrix decomposition methods have been extended for such multi-view data integration and pattern discovery. While only few methods were designed to consider the heterogeneity of noise in such multi-view data for data integration explicitly. To this end, we propose a joint matrix decomposition framework (BJMD), which models the heterogeneity of noise by Gaussian distribution in a Bayesian framework. We develop two algorithms to solve this model: one is a variational Bayesian inference algorithm, which makes full use of the posterior distribution; and another is a maximum a posterior algorithm, which is more scalable and can be easily paralleled. Extensive experiments on synthetic and real-world datasets demonstrate that BJMD considering the heterogeneity of noise is superior or competitive to the state-of-the-art methods.

artificial intelligence, bayesian inference, machine learning, (20 more...)

1712.03337

Country:

Asia > China > Beijing > Beijing (0.04)
South America > Paraguay > Asunción > Asunción (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.81)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.66)

@machinelearnbotDec-6-2017, 22:06:28 GMT

5 Predictions About the Future of Machine Learning - Talend Real-Time Open Source Data Integration Software

Machine Learning is currently one of the hottest topics in IT. The reason stems from the seemingly unlimited use cases where machine learning can play from fraud detection to self-driving cars, and even identifying your'gold card' customers to price prediction. But what is the future for this fascinating field? What will be the next best thing? Where will we be in ten years time?

artificial intelligence, information fusion, machine learning, (10 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.57)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.40)

@machinelearnbotNov-27-2017, 17:30:17 GMT

80/20 Rule of Data Science: Hear How Fast, Easy Data Integration Can Break It

At this year's Strata Data Conference in New York City, Syncsort's Paige Roberts sat down with John Myers (@johnlmyers44) of Enterprise Management Associates to discuss what he sees in the evolving Big Data landscape. In this final blog in the three-part interview, we'll discuss the 80/20 rule of data science which points out that most data scientists spend 80% of their time getting data ready for analysis, rather than doing what they do best. In case you missed the earlier parts of our interview… In the first part of the discussion, Myers pointed out a shift away from technology and toward business value and some advantages of in-memory processing for machine learning. In part two, we talked about how to deal with cultural pushback against machine learning applications and how to get machines and people working together to take advantage of the strengths of each. Most of what a scientist has to do is you get the right data together so they can apply to their model, or to manipulate the data that they have.

artificial intelligence, data mining, machine learning, (6 more...)

Country: North America > United States > New York (0.25)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.61)
Information Technology > Artificial Intelligence > Machine Learning (0.47)
Information Technology > Data Science > Data Integration (0.41)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.41)

#artificialintelligenceNov-25-2017, 16:30:19 GMT

Using Neural Networks with Talend Data Integration and ESB

Many times during Data Integration projects we have situations where we have to analyze the data in order to come up with acceptance criteria for it. In a lot of cases, this is pretty straight forward and can be easily written into simple rule-based logic. But in some situations, it is not so cut and dry. In these situations a lot of people will generate rule of thumb logic which will isolate certain rows to be double-checked by a human. It is time consuming and requires human intervention, but it works. However, in a lot of those situations we can use Neural Networks to do that job for us.

artificial intelligence, information fusion, machine learning, (17 more...)

#artificialintelligence

Country:

Oceania > Australia (0.04)
North America > United States > California > Orange County > Irvine (0.04)
Europe (0.04)
(2 more...)

Industry:

Leisure & Entertainment > Games (0.30)
Banking & Finance (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.61)

Ouyang, Ruofei, Low, Kian Hsiang

Gaussian Process Decentralized Data Fusion Meets Transfer Learning in Large-Scale Distributed Cooperative Perception

arXiv.org Machine LearningNov-16-2017

This paper presents novel Gaussian process decentralized data fusion algorithms exploiting the notion of agent-centric support sets for distributed cooperative perception of large-scale environmental phenomena. To overcome the limitations of scale in existing works, our proposed algorithms allow every mobile sensing agent to choose a different support set and dynamically switch to another during execution for encapsulating its own data into a local summary that, perhaps surprisingly, can still be assimilated with the other agents' local summaries (i.e., based on their current choices of support sets) into a globally consistent summary to be used for predicting the phenomenon. To achieve this, we propose a novel transfer learning mechanism for a team of agents capable of sharing and transferring information encapsulated in a summary based on a support set to that utilizing a different support set with some loss that can be theoretically bounded and analyzed. To alleviate the issue of information loss accumulating over multiple instances of transfer learning, we propose a new information sharing mechanism to be incorporated into our algorithms in order to achieve memory-efficient lazy transfer learning. Empirical evaluation on real-world datasets show that our algorithms outperform the state-of-the-art methods.

artificial intelligence, information fusion, machine learning, (19 more...)

1711.06064

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (1.00)