Databases


Data Virtualization: Unlocking Data for AI and Machine Learning

#artificialintelligence

Hybrid Execution allows you to "push" queries to a remote system, such as to SQL Server, and access the referential data. However, one can imagine a use case where lots of ETL processing happens in HDInsight clusters and the structured results are published to SQL Server for downstream consumption (for instance, by reporting tools). Note the linear increase in execution time with SQL Server only (blue line) versus when HDInsight is used with SQL Server to scale out the query execution (orange and grey lines). With much larger real-world datasets in SQL Server, which typically runs multiple queries competing for resources, more dramatic performance gains can be expected.


Optimization tips and tricks on Azure SQL Server for Machine Learning Services

#artificialintelligence

By using memory-optimized tables, resume features are stored in main memory and disk IO could be significantly reduced. If the database engine server detects more than 8 physical cores per NUMA node or socket, it will automatically create soft-NUMA nodes that ideally contain 8 cores. We then further created 4 SQL resource pools and 4 external resource pools [7] to specify the CPU affinity of using the same set of CPUs in each node. We can create resource governance for R services on SQL Server [8] by routing those scoring batches into different workload groups (Figure.


Predicting Hospital Length of Stay using SQL Server R Services

#artificialintelligence

Last week, my Microsoft colleagues Bharath Sankaranarayan and Carl Saroufim presented a live webinar showing how you can predict a patient's length of stay at a hospital using SQL Server R Services. The webinar is based on the Machine Learning Solution Template Predicting Length of Stay in Hospitals, which we covered here on the blog back in March. The webinar will take you through the process of using Microsoft R Server (included in the VM) to import the data and upload it to SQL Server. To help the administration manage hospital resources, the Hospital Length of Stay solution estimates the number of days the patient is expected to stay before discharge.


Optimization tips and tricks on Azure SQL Server for Machine Learning Services

#artificialintelligence

By using memory-optimized tables, resume features are stored in main memory and disk IO could be significantly reduced. If the database engine server detects more than 8 physical cores per NUMA node or socket, it will automatically create soft-NUMA nodes that ideally contain 8 cores. We then further created 4 SQL resource pools and 4 external resource pools [7] to specify the CPU affinity of using the same set of CPUs in each node. We can create resource governance for R services on SQL Server [8] by routing those scoring batches into different workload groups (Figure.


R and Python drive SQL Server 2017 into machine learning 7wData

#artificialintelligence

But it was SQL Server's new machine learning tools that grabbed my attention. SQL Server 2016 added support for embedded R code, and SQL Server 2017 continues that evolution by improving its support for R and adding Python. R remains clearly focused on statistical analysis, while Python adds statistical tools to a popular and flexible scripting language. With Python inside SQL Server, you can bring existing data and code together.


Python power comes to SQL Server 2017

#artificialintelligence

The most conventional application of Python with SQL Server is to execute Python scripts as normal, with SQL Server as a data source. Microsoft has also made it possible to embed Python code directly in SQL Server databases by including the code as a T-SQL stored procedure. These behaviors, and the RevoScalePy package, are essentially Python versions of features Microsoft built for SQL Server back when it integrated the R language into the database. Installation also includes packages from the Anaconda distribution of Python, widely used in data science, and Microsoft's RevoScalePy package, a set of data analysis functions that can take advantage of SQL Server's in-memory and column-store index features.


SQL Server 2017 (CTP 2.0)- 'first RDBMS with built-in AI'

#artificialintelligence

Community Technology Preview (CTP) 2.0 is the first production-quality preview of SQL Server 2017, and it is available on both Windows and Linux. In this preview, Microsoft added a number of new capabilities, including the ability to run advanced analytics using Python in a parallelized and highly scalable way, the ability to store and analyze graph data, and other capabilities that help you manage SQL Server for high performance and uptime, including the Adaptive Query Processing family of intelligent database features and resumable online indexing.


Protecting web users' privacy

MIT News

At the USENIX Symposium on Networked Systems Design and Implementation next week, researchers from MIT's Computer Science and Artificial Intelligence Laboratory and Stanford University will present a new encryption system that disguises users' database queries so that they reveal no private information. "The canonical example behind this line of work was public patent databases," says Frank Wang, an MIT graduate student in electrical engineering and computer science and first author on the conference paper. Goldwasser, in turn, is one of Wang's co-authors on the new paper, along with Vinod Vaikuntanathan, an MIT associate professor of electrical engineering and computer science (EECS); Catherine Yun, an EECS graduate student; and Matei Zaharia, an assistant professor of computer science at Stanford. Through a clever combination of software processes and AES encryption, the MIT and Stanford researchers were able to make Splinter 2.5 times as efficient as it would be if it used the AES circuits alone.


The Python Tutorial -- Python 3.6.0 documentation

#artificialintelligence

The Python interpreter and the extensive standard library are freely available in source or binary form for all major platforms from the Python Web site, https://www.python.org/, The same site also contains distributions of and pointers to many free third party Python modules, programs and tools, and additional documentation. For a description of standard objects and modules, see The Python Standard Library. The Python Language Reference gives a more formal definition of the language. After reading it, you will be able to read and write Python modules and programs, and you will be ready to learn more about the various Python library modules described in The Python Standard Library.


Verizon, Yahoo Agree to Reduce Buyout Price to $4.55 Billion

#artificialintelligence

DAILY VIDEO: Verizon negotiates down to $4.55B for Yahoo transaction; Congressional staffers see Russian hacking, FISA vote as priorities; IBM launches machine learning for z System mainframes; and there's more. DAILY VIDEO: White House withholds cyber-security order for further revision; Cortana to help Windows... DAILY VIDEO: Kaspersky discovers new malware designed to stealthily steal data; Microsoft to shield... DAILY VIDEO: Federal court says Google must turn over data in foreign servers; Cisco report: mobile... DAILY VIDEO: Windows 10 Cloud leak points to potential Chrome OS fighter; TiVo's analytics pinpoint... DAILY VIDEO: Google drops hands free mobile payment app; Microsoft Outlook on iOS welcomes Evernote... DAILY VIDEO: Snap Inc. makes it official, will go public next month; Microsoft sharpens Edge browser... DAILY VIDEO: Japan's supreme court backs Google in'right to be forgotten' case; HPE acquires... DAILY VIDEO: Flock adds "fake news" detector to collaboration platform; Google upgrades security... Dell's latest Intel-based PowerEdge servers bring new levels of operational efficiency and... If your IT talent is spending too much time... Dell PowerEdge servers powered by Intel processors include a number of innovative features designed... Agility is a competitive edge that Dell's PowerEdge servers can deliver thanks to dense, storage... Today's topics include reports that Verizon has negotiated a $250 million reduction in the price to acquire Yahoo, congressional committee staffers say investigations of Russian hacking and a vote to reauthorize FISA will keep Congress buy this year, IBM's launch of a machine learning platform for z System mainframes and the start of beta testing on Google's Cloud Spanner database service. Verizon has reportedly negotiated a reduction of price for buying the beleaguered web services company Yahoo from $4.8 billion to $4.55 billion the Web media in the wake of Yahoo's belated disclosure that the data of more than 1 billion Yahoo users were breached in a 2013 cyber-attack.