IBM TJ Watson Research Center
Optimal Sequential Drilling for Hydrocarbon Field Development Planning
Torrado, Ruben Rodriguez (Repsol S.A.) | Rios, Jesus (IBM TJ Watson Research Center) | Tesauro, Gerald (IBM TJ Watson Research Center)
We present a novel approach for planning the development of hydrocarbon fields, taking into account the sequential nature of well drilling decisions and the possibility to react to future information. In a dynamic fashion, we want to optimally decide where to drill each well conditional on every possible piece of information that could be obtained from previous wells. We formulate this sequential drilling optimization problem as a POMDP, and propose an algorithm to search for an optimal drilling policy. We show that our new approach leads to better results compared to the current standard in the oil and gas (O&G) industry.
How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media?
Choudhury, Munmun De (Arizona State University) | Lin, Yu-Ru (Arizona State University) | Sundaram, Hari (Arizona State University) | Candan, Kasim Selcuk (Arizona State University) | Xie, Lexing (IBM TJ Watson Research Center) | Kelliher, Aisling (Arizona State University)
Platforms such as Twitter have provided researchers with ample opportunities to analytically study social phenomena. There are however, significant computational challenges due to the enormous rate of production of new information: researchers are therefore, often forced to analyze a judiciously selected “sample” of the data. Like other social media phenomena, information diffusion is a social process–it is affected by user context, and topic, in addition to the graph topology. This paper studies the impact of different attribute and topology based sampling strategies on the discovery of an important social media phenomena–information diffusion. We examine several widely-adopted sampling methods that select nodes based on attribute (random, location, and activity) and topology (forest fire) as well as study the impact of attribute based seed selection on topology based sampling. Then we develop a series of metrics for evaluating the quality of the sample, based on user activity (e.g. volume, number of seeds), topological (e.g. reach, spread) and temporal characteristics (e.g. rate). We additionally correlate the diffusion volume metric with two external variables–search and news trends. Our experiments reveal that for small sample sizes (30%), a sample that incorporates both topology and user context (e.g. location, activity) can improve on naive methods by a significant margin of ~15-20%.