Thursday, February 05, 2015

Microsoft Azure Machine Learning–the power to predict

Microsoft Azure Machine Learning provides Machine Learning as a Service (on Microsoft Azure) and allows you to make your own applications more intelligent. Microsoft Azure Machine Learning was initially started as as an incubation project in Microsoft Research (codename Passau) and is part of the overall Microsoft Data Platform.
The best definition for Machine Learning – in my opinion – is from the excellent book “Introduction to Machine Learning (MIT Press 2014, Ethem Alpaydin)” (Use it as a reference – this is not an easy “how to” book)
The goal of machine learning is to program computers to use example data or past experience to solve a given problem.
In general when we want to solve a problem on a computer, we need an algorithm to transform using a set of instruction into an output. Unfortunately for some problems we do not know how to program such algorithms – such as for e-mail spam detection or predicting customer behavior. In most cases we have the input and output available e.g. a set of e-mails for which some are marked as spam. Based on this data, we would like a computer (or machine) to automatically extract the algorithm necessary to perform the classification. The algorithm does not need to be perfect but needs to be a good and useful approximation.
The term machine learning is tightly coupled to the domain of analytics (or data science – see Data Scientist: the sexiest job of the 21st century ). Analytics is concerned with the discovery and extraction of useful business patterns or mathematical decision models from a specific data set. For this a number techniques can be used, depending on the practitioners background they will probably favor a technique from their respective domain:
  • Regression, General Linear Models (GLMS), decision trees, etc … (originated out of the statistics domain)
  • Machine learning algorithms such as support vector machines, neural networks, Bayesian methods, … (originated out of the computer science domain)
If we focus specifically on machine learning we make a distinction between supervised learning where we try to find a mapping between a set of input variables and a specific output variable using a set of values to train a specific model and unsupervised learning where we try to find patterns in the input data.

But why should you care about machine learning? I think the picture below shows you how the focus is shifting from traditional reporting (hindsight) to more advanced predictive and prescriptive analytics (foresight) which will provide business with more added value but also requires business intelligence specialist new competencies such as machine learning and data mining. Examples across industries vary but in general predictive analytics has the potential to change the way how businesses make decisions (I will take a look a more in depth definitely pick up Predictive Analytics – The power to predict who will click, buy, lie or die from Eric Siegel)



Microsoft Azure Machine Learning distinguishes itself from other platforms and tools by a number of different characteristics:
  • Allows you to jointly build predictive models from anywhere in the world using only a web browser by making use of visual composition canvas  (called Machine Learning Studio) using modules without requiring you to write code (although you can use R code snippets if you want). You can start quickly from existing sample experiments/models or you can share your own data experiments.
  • Collaborative work together with anyone from anywhere using just your browser.
  • Available as a cloud service, eliminating upfront costs fro hardware resources.
  • The different modules allow you to author an end-to-end machine learning workflow starting with reading data, to training and validating your predictive model.
  • Ability to deploy models as web services. You can quickly operationalize your models by converting them into web services and you even the ability to monetize your machine learning models using Azure Data Market.
The start location is the Microsoft Azure Machine Learning homepage - https://studio.azureml.net/ which contains a number of user guides as well as training videos - http://azure.microsoft.com/en-us/documentation/videos/index/?services=machine-learning . Another great way to get started is by looking at the different Azure Machine Learning Samples - http://azure.microsoft.com/en-us/documentation/services/machine-learning/models/ such as Azure Machine Learning Sample: Credit risk prediction (predict whether an applicant is a good credit risk based on the German Credit Card UCI dataset) and a clustering algorithm to identify similar companies from companies in the S&P 500, using text in published Wikipedia articles for  these companies.




References:

No comments: