Africa
Who’s Calling? Demographics of Mobile Phone Use in Rwanda
Blumenstock, Joshua Evan (University of California, Berkeley) | Gillick, Dan (University of California, Berkeley) | Eagle, Nathan (Santa Fe Institute)
But whereas in the general Rwandan populace males tend Despite the increasing ubiquity of mobile phones in the developing to be much better educated (76.3% of males are literate, but world, remarkably little is known about the structure only 64.7% of females), among mobile phone users it is the and demographics of the mobile phone market. While a women who achieve higher levels of education: the median few qualitative studies have detailed social norms of phone woman completes secondary school, while the median man use in specific communities (Donner 2007; Burrell 2009), does not (t 4.79). Table 1 shows a few statistics on asset and a handful of quantitative researchers have begun to analyze ownership, with associated sampling error.
Using Data Mining to Combat Infrastructure Inefficiencies: The Case of Predicting Nonpayment for Ethiopian Telecom
Yigzaw, Mariye (Addis Ababa University) | Hill, Shawndra (University of Pennsylvania) | Banser, Anita (University of Pennsylvania) | Lessa, Lemma (Addis Ababa University)
Data mining and machine learning technologies for business applications have evolved over the past two decades, and are regularly applied in contemporary organizations to everything from manufacturing to online advertising in fields ranging from health care to motor racing. Unfortunately, data mining techniques are not applied as often to problems in the developing world. Despite the fact that some industries, such as banks, airlines, courts, and telecommunications firms, necessitate data storage as part of their business process. We argue that data mining could be used to reduce infrastructure inefficiencies, which is one of the largest problems faced by Africa. We demonstrate that we can potentially reduce the infrastructure inefficiency of the Ethiopian telecommunications industry by ranking customers according to their likelihood of nonpayment using a data mining approach.
Traffic Flow Monitoring in Crowded Cities
Quinn, John Alexander (Makerere University) | Nakibuule, Rose (Makerere University)
Traffic monitoring systems usually make assumptions about the movement of vehicles, such as that they drive in dedicated lanes, and that those lanes rarely include non-vehicle clutter. Urban settings within developing countries often present extremely chaotic traffic scenarios which make these assumptions unrealistic. We show how a standard approach to traffic monitoring can be made more robust by using probabilistic inference, and in such a way that we bypass the need for vehicle segmentation. Instead of tracking individual vehicles but treat a lane of traffic as a fluid and estimate the rate of flow. Our modelling of uncertainty allows us to accurately monitor traffic flow even in the presence of substantial clutter.
People, Quakes, and Communications: Inferences from Call Dynamics about a Seismic Event and its Influences on a Population
Kapoor, Ashish (Microsoft Research) | Eagle, Nathan (The Santa Fe Institute) | Horvitz, Eric (Microsoft Research)
We explore the prospect of inferring the epicenter and influences of seismic activity from changes in background phone communication activities logged at cell towers. In particular, we explore the perturbations in Rwandan call data invoked by an earthquake in February 2008 centered in the Lac Kivu region of the Democratic Republic of the Congo. Beyond the initial seismic event, we investigate the challenge of assessing the distribution of the persistence of needs over geographic regions, using the persistence of call anomalies after the earthquake as a proxy for lasting influences and the potential need for assistance. We also infer uncertainties in the inferences and consider the prospect of identifying the value of surveying the areas so that surveillance resources can be best triaged.
Reality Mining Africa
Hill, Shawndra (University of Pennsylvania) | Banser, Anita (University of Pennsylvania) | Berhan, Getachew (Addis Ababa University) | Eagle, Nathan (Santa Fe Institute)
Cellular phones can be used as mobile sensors, continuously logging users’ behavior including movement, communication and proximity to others. While it is well understood that data generated from mobile phones includes a record of phone calls, there are also more sophisticated data types, such as Bluetooth or cell tower proximity logging, which reveal movement patterns and day-to-day human interactions. We explore the possibility of using mobile phone data to compare movement and communication patterns across cultures. The goal of this proof-of-concept study is to quantify behavior in order to compare different populations. We compare our ability to predict future calling behavior and movement patterns from the cellular phone data of subjects in two distinct groups: a set of university students at MIT in the United States and the University of Nairobi in Kenya. In addition, we show how Bluetooth data may be used to estimate the diffusion of an airborne pathogen outbreak in the different populations.
A Gender-Centric Analysis of Calling Behavior in a Developing Economy Using Call Detail Records
Frias-Martinez, Vanessa (Telefonica Research, Madrid) | Frias-Martinez, Enrique (Telefonica Research, Madrid) | Oliver, Nuria (Telefonica Research, Madrid)
The gender divide in the access to technology in developing economies makes gender characterization and automatic gender identification two of the most critical needs for improving cell phone-based services. Gender identification has been typically solved using voice or image processing. However, such techniques cannot be applied to cell phone networks mostly due to privacy concerns. In this paper, we present a study aimed at characterizing and automatically identifying the gender of a cell phone user in a developing economy based on behavioral, social and mobility variables. Our contributions are twofold: (1) understanding the role that gender plays on phone usage, and (2) evaluating common machine learning approaches for gender identification. The analysis was carried out using the encrypted CDRs (Call Detail Records) of approximately 10,000 users from a developing economy, whose gender was known a priori. Our results indicate that behavioral and social variables, including the number of input/output calls and the in degree/out degree of the social network, reveal statistically significant differences between male and female callers. Finally, we propose a new gender identification algorithm that can achieve classification rates of up to 80% when the percentage of predicted instances is reduced.
Contextual Information Portals
Chen, Jay Chen (New York University) | Karthik, Trishank (New York University) | Subramanian, Lakshminarayanan (New York University)
There is a wealth of information on the Web about any number of topics. Many communities in developing regions are often interested in information relating to specific topics. For example, health workers are interested in specific medical information regarding epidemic diseases in their region while teachers and students are interested in educational information relating to their curriculum. This paper presents the design of Contextual Information Portals, searchable information portals that contain a vertical slice of the Web about arbitrary topics tailored to a specific context. Contextual portals are particularly useful for communities that lack Internet or Web access or in regions with very poor network connectivity. This paper outlines the design space for constructing contextual information portals and describes the key technical challenges involved. We have implemented a proof-of-concept of our ideas, and performed an initial evaluation on a variety of topics relating to epidemiology, agriculture, and education.
An Approach for Mining Accumulated Crop Cultivation Problems and their Solutions
El-Beltagy, Samhaa R. (Cairo University) | Rafea, Ahmed (American University in Cairo) | Mabrouk, Said (The Central Lab for Agricultural Expert Systems) | Rafea, Mahmoud (The Central Lab for Agricultural Expert Systems)
This paper presents an approach for mining agricultural problems that have been accumulated in a textual database over a period of 5 years. The problems, which are accompanied by their solutions, offer a wealth of knowledge that can be used by decision makers, researchers, and farmers alike. However, this wealth of knowledge can not be unlocked without a) representing these problems in a structured format, and b) applying algorithms that can summarize and analyze this information. Towards the achievement of the first goal, a multi-faceted object extraction methodology is presented, and for the achievement of the second, association rules are employed. As a proof of concept, the tool was applied of a set of weed problems. The presented methodology can be modified to work with any help and support textual database where both problems and their solutions are present.
Mining Road Traffic Accident Data to Improve Safety: Role of Road-Related Factors on Accident Severity in Ethiopia
Beshah, Tibebe (Addis Ababa University) | Hill, Shawndra (University of Pennsylvania)
Road traffic accidents (RTAs) are a major public health concern, resulting in an estimated 1.2 million deaths and 50 million injuries worldwide each year. In the developing world, RTAs are among the leading cause of death and injury; Ethiopia in particular experiences the highest rate of such accidents. Thus, methods to reduce accident severity are of great interest to traffic agencies and the public at large. In this work, we applied data mining technologies to link recorded road characteristics to accident severity in Ethiopia, and developed a set of rules that could be used by the Ethiopian Traffic Agency to improve safety.
On Action Theory Change
As historically acknowledged in the Reasoning about Actions and Change community, intuitiveness of a logical domain description cannot be fully automated. Moreover, like any other logical theory, action theories may also evolve, and thus knowledge engineers need revision methods to help in accommodating new incoming information about the behavior of actions in an adequate manner. The present work is about changing action domain descriptions in multimodal logic. Its contribution is threefold: first we revisit the semantics of action theory contraction proposed in previous work, giving more robust operators that express minimal change based on a notion of distance between Kripke-models. Second we give algorithms for syntactical action theory contraction and establish their correctness with respect to our semantics for those action theories that satisfy a principle of modularity investigated in previous work. Since modularity can be ensured for every action theory and, as we show here, needs to be computed at most once during the evolution of a domain description, it does not represent a limitation at all to the method here studied. Finally we state AGM-like postulates for action theory contraction and assess the behavior of our operators with respect to them. Moreover, we also address the revision counterpart of action theory change, showing that it benefits from our semantics for contraction.