Scientific Discovery
Opensource & Machine Learning for GDPR Data Discovery
GDPR (EU General Data Protection Regulation) is around the corner and bigger companies are getting ready to adopt it as they already know what kind of penalties come from non-compliance. It replaces replaces the Data Protection Directive 95/46/EC and was designed to harmonize data privacy laws across Europe and it is the biggest change on data privacy regulation in 20 years for Europe. While GDPR main elements can be a little tricky to understand, one thing is clear as sensitive Data Discovery is mandatory, so you can find the Personal and sensitive information on your data repositories, that can be almost everything from databases to files. Basically, we focus our data discovery on three main areas: column discovery, data discovery and file discovery. Column discovery is easy to understand, based on specific keywords or sentences we find column names on databases and match it with possible sensitive data.
First Horned Dinosaur Remains Found In North America In Chance Discovery From Mississippi
A large body of water separated the present-day North American continent into two halves during most of the late Cretaceous Period, between 95 and 66 million years ago, and because of the seaway linking the Arctic Ocean to the Gulf of Mexico, land animals on one side could not make it to the other, and would therefore evolve independently. One such genus of animals, trapped on the western half, was the horned dinosaur, whose remains have been found in western North America, as well as Asia. However, the discovery of a tooth in Mississippi provides evidence that horned dinosaurs were present in eastern North America as well. The fossil, dated to between 66 and 68 million years ago, is from a dinosaur closely related to Triceratops, the most well-known genus of horned dinosaurs. The find also suggests that there could have existed some land connection between the two land masses thought to be completely separate at the time. This is a tooth of a ceratopsid horned dinosaur from Mississippi.
IoT: New Paradigm for Connected Government @ThingsExpo #AI #DX #IoT
The Internet of Things (IoT) is an uninterrupted connected network of embedded objects/ devices with identifiers without any human intervention using standard and communication protocol. It provides encryption, authorization and identification with different device protocols like MQTT, STOMP or AMQP to securely move data from one network to another. IoT in connected Government helps to deliver better citizen services and provides transparency. It improves the employee productivity and cost savings. It helps in delivering contextual and personalized service to citizens and enhances the security and improves the quality of life.
IoT: A New Paradigm for Connected Government @ThingsExpo #AI #ML #IoT #DX
The Internet of Things (IoT) is an uninterrupted connected network of embedded objects/ devices with identifiers without any human intervention using standard and communication protocol. It provides encryption, authorization and identification with different device protocols like MQTT, STOMP or AMQP to securely move data from one network to another. IoT in connected Government helps to deliver better citizen services and provides transparency. It improves the employee productivity and cost savings. It helps in delivering contextual and personalized service to citizens and enhances the security and improves the quality of life.
IoT: A New Paradigm for Connected Government @ThingsExpo #AI #ML #IoT #M2M
The Internet of Things (IoT) is an uninterrupted connected network of embedded objects/ devices with identifiers without any human intervention using standard and communication protocol. It provides encryption, authorization and identification with different device protocols like MQTT, STOMP or AMQP to securely move data from one network to another. IoT in connected Government helps to deliver better citizen services and provides transparency. It improves the employee productivity and cost savings. It helps in delivering contextual and personalized service to citizens and enhances the security and improves the quality of life.
Seth Meyers makes a major scientific discovery about President Trump
Today in Entertainment: Seth Meyers finds a new law of Trump physics; Jonathan Demme brought out performers' best DMX cancels L.A. performance due to'medical emergency' DMX cancels L.A. performance due to'medical emergency' The science on the Trump administration is a little closer to settled. "Late Night with Seth Meyers" offered a deep dive Wednesday night into the administration's apparent fondness for executive orders -- the president has signed 30 so far -- and highlighted how Trump the candidate was less enamored of the practice than Trump the president appears to be. "It is at this point like a law of physics," Meyers said at the beginning of one of his "A Closer Look" segments. "For every Trump action, there's an equal and opposite Trump clip." Meyers amusingly applied the same science to New Jersey Gov. Chris Christie's stance on executive orders as well.
A Flexible Framework for Hypothesis Testing in High-dimensions
Javanmard, Adel, Lee, Jason D.
Hypothesis testing in the linear regression model is a fundamental statistical problem. We consider linear regression in the high-dimensional regime where the number of parameters exceeds the number of samples ($p> n$) and assume that the high-dimensional parameters vector is $s_0$ sparse. We develop a general and flexible $\ell_\infty$ projection statistic for hypothesis testing in this model. Our framework encompasses testing whether the parameter lies in a convex cone, testing the signal strength, testing arbitrary functionals of the parameter, and testing adaptive hypothesis. We show that the proposed procedure controls the type I error under the standard assumption of $s_0 (\log p)/\sqrt{n}\to 0$, and also analyze the power of the procedure. Our numerical experiments confirms our theoretical findings and demonstrate that we control false positive rate (type I error) near the nominal level, and have high power.
Can Scientific Discovery Be Automated?
Science is in the midst of a data crisis. Last year, there were more than 1.2 million new papers published in the biomedical sciences alone, bringing the total number of peer-reviewed biomedical papers to over 26 million. However, the average scientist reads only about 250 papers a year. Meanwhile, the quality of the scientific literature has been in decline. Some recent studies found that the majority of biomedical papers were irreproducible.
Data-adaptive statistics for multiple hypothesis testing in high-dimensional settings
Cai, Weixin, Hejazi, Nima S., Hubbard, Alan E.
Current statistical inference problems in areas like astronomy, genomics, and marketing routinely involve the simultaneous testing of thousands -- even millions -- of null hypotheses. For high-dimensional multivariate distributions, these hypotheses may concern a wide range of parameters, with complex and unknown dependence structures among variables. In analyzing such hypothesis testing procedures, gains in efficiency and power can be achieved by performing variable reduction on the set of hypotheses prior to testing. We present in this paper an approach using data-adaptive multiple testing that serves exactly this purpose. This approach applies data mining techniques to screen the full set of covariates on equally sized partitions of the whole sample via cross-validation. This generalized screening procedure is used to create average ranks for covariates, which are then used to generate a reduced (sub)set of hypotheses, from which we compute test statistics that are subsequently subjected to standard multiple testing corrections. The principal advantage of this methodology lies in its providing valid statistical inference without the \textit{a priori} specifying which hypotheses will be tested. Here, we present the theoretical details of this approach, confirm its validity via a simulation study, and exemplify its use by applying it to the analysis of data on microRNA differential expression.
Importance of Hypothesis Testing in Quality Management
Essentially good hypotheses lead decision-makers like you to new and better ways to achieve your business goals. When you need to make decisions such as how much you should spend on advertising or what effect a price increase will have your customer base, it's easy to make wild assumptions or get lost in analysis paralysis. A business hypothesis solves this problem, because, at the start, it's based on some foundational information. In all of science, hypotheses are grounded in theory. Theory tells you what you can generally expect from a certain line of inquiry.